SlideShare una empresa de Scribd logo
1 de 32
Descargar para leer sin conexión
@I seek ‘fb.me’:
Identifying Users across Multiple
Online Social Networks
Workshop	
  on	
  Web	
  of	
  Linked	
  En11es	
  (WoLE)
Paridhi	
  Jain¶,	
  Ponnurangam	
  Kumaraguru¶,	
  Anupam	
  Joshi*
¶Indraprastha	
  Ins1tute	
  of	
  Informa1on	
  Technology	
  (IIIT-­‐Delhi)

*University	
  of	
  Maryland,	
  Bal1more	
  County	
  (UMBC)

1
Motivation
Multiple OSNs
Multiple Identities

Difficult to manage? Difficult to find?

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

2

2
Motivation
Multiple OSNs
Multiple Identities

Social Aggregation site

Difficult to manage? Difficult to find?

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

2

2
Motivation
Multiple OSNs
Multiple Identities

Social Aggregation site

Difficult to manage? Difficult to find?

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

Friend	
  Finder?
Malicious	
  user?
Influen1al	
  user?
User	
  of	
  interest?

2

2
Motivation
Multiple OSNs
Multiple Identities

Social Aggregation site

Difficult to manage? Difficult to find?

Friend	
  Finder?
Malicious	
  user?
Influen1al	
  user?
User	
  of	
  interest?

Identity Resolution Problem
13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

2

2
Identity Resolution
• For a user I, given a user identity IA on a social network A, find user
identity IB on social network B.

{IA}

Alice

13/05/13

{IB}

??

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

3
3
Identity Resolution =
Identity Search + Identity Matching
•

Identity Search
For a user I, given her identity IA on a social network A, and a search
parameter S, find the set of identities IBj on social network B such that
S(IA) ⋍ S(IB).

{IA,S}
•

{IB1, ... IBj, ... , IBN} = Q

Identity Matching
Given a user identity IA on a social network A, a set of candidate
identities Q on social network B, and a match function M, locate an
identity pair (IA, IBj) such that M(IA, IBj) = max{M(IA, IB1), M(IA, IBN)}

{IA, Q, M}
13/05/13

{IA, IBj}

{IB}

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

4
4
Research Gaps?
– Till	
  now,	
  focus	
  on	
  bePer	
  iden1ty	
  matching	
  algorithms
– Only	
  profile	
  aPributes	
  (private	
  and	
  public)	
  for	
  Iden1ty	
  Search
– Limita1ons	
  of	
  Profile	
  Search	
  -­‐
– Restric1ve	
  search,	
  owing	
  to	
  non-­‐availability	
  of	
  common	
  aPributes	
  across	
  
networks.	
  [Gender	
  on	
  Facebook,	
  but	
  not	
  on	
  TwiPer]
– Search	
  with	
  Limited	
  aPributes	
  →	
  Large	
  candidate	
  set	
  size	
  →	
  Intensive	
  
Iden1ty	
  Matching	
  computa1on
– Users	
  may	
  choose	
  different	
  profile	
  aPributes	
  →	
  Miss	
  out	
  correct	
  iden1ty	
  in	
  
the	
  candidate	
  set
– LiPle	
  research	
  on	
  using	
  content	
  and	
  network	
  aPributes	
  to	
  search	
  for	
  candidate	
  
iden11es
– Extensive	
  use	
  of	
  both	
  private	
  and	
  public	
  aPributes.	
  Need	
  user	
  authoriza1on	
  for	
  
iden1ty	
  search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

5
5
Research Gaps?
– Till	
  now,	
  focus	
  on	
  bePer	
  iden1ty	
  matching	
  algorithms
– Only	
  profile	
  aPributes	
  (private	
  and	
  public)	
  for	
  Iden1ty	
  Search
– Limita1ons	
  of	
  Profile	
  Search	
  -­‐
– Restric1ve	
  search,	
  owing	
  to	
  non-­‐availability	
  of	
  common	
  aPributes	
  across	
  
networks.	
  [Gender	
  on	
  Facebook,	
  but	
  not	
  on	
  TwiPer]
– Search	
  with	
  Limited	
  aPributes	
  →	
  Large	
  candidate	
  set	
  size	
  →	
  Intensive	
  
Iden1ty	
  Matching	
  computa1on
– Users	
  may	
  choose	
  different	
  profile	
  aPributes	
  →	
  Miss	
  out	
  correct	
  iden1ty	
  in	
  
the	
  candidate	
  set
– LiPle	
  research	
  on	
  using	
  content	
  and	
  network	
  aPributes	
  to	
  search	
  for	
  candidate	
  
iden11es
– Extensive	
  use	
  of	
  both	
  private	
  and	
  public	
  aPributes.	
  Need	
  user	
  authoriza1on	
  for	
  
iden1ty	
  search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

6
6
Research Gaps?
– Till	
  now,	
  focus	
  on	
  bePer	
  iden1ty	
  matching	
  algorithms
– Only	
  profile	
  aPributes	
  (private	
  and	
  public)	
  for	
  Iden1ty	
  Search
– Limita1ons	
  of	
  Profile	
  Search	
  -­‐
– Restric1ve	
  search,	
  owing	
  to	
  non-­‐availability	
  of	
  common	
  aPributes	
  across	
  
networks.	
  [Gender	
  on	
  Facebook,	
  but	
  not	
  on	
  TwiPer]
– Search	
  with	
  Limited	
  aPributes	
  →	
  Large	
  candidate	
  set	
  size	
  →	
  Intensive	
  
Iden1ty	
  Matching	
  computa1on
– Users	
  may	
  choose	
  different	
  profile	
  aPributes	
  →	
  Miss	
  out	
  correct	
  iden1ty	
  in	
  
the	
  candidate	
  set
– LiPle	
  research	
  on	
  using	
  content	
  and	
  network	
  aPributes	
  to	
  search	
  for	
  candidate	
  
iden11es
– Extensive	
  use	
  of	
  both	
  private	
  and	
  public	
  aPributes.	
  Need	
  user	
  authoriza1on	
  for	
  
iden1ty	
  search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

7
7
Proposal
– Include	
  content	
  and	
  network	
  aPributes	
  as	
  search	
  parameters
– Access	
  only	
  publicly	
  accessible	
  aPributes
– Focus	
  on	
  two	
  popular	
  social	
  networks	
  -­‐	
  TwiPer	
  and	
  Facebook

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

8
8
Contribution
– Proposed	
  novel	
  iden1ty	
  search	
  methods	
  on	
  social	
  networks
– Our	
  iden1ty	
  resolu1on	
  methods	
  return	
  correct	
  Facebook	
  iden1ty	
  for	
  39%	
  
TwiPer	
  users	
  within	
  top-­‐2	
  ranks
– We	
  observe	
  an	
  increase	
  in	
  accuracy	
  of	
  iden1ty	
  resolu1on	
  by	
  11.6%	
  owing	
  to	
  
inclusion	
  of	
  content	
  and	
  network	
  iden1ty	
  search,	
  along	
  with	
  improvised	
  profile	
  
search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

9
9
Methodology
?
?
?
?

Search

13/05/13

Candidate
Identities

If self-identified /
returned by
more than one
search method

Yes

No

Syntactic
and Image

Manual
Verification

Match

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

10
10
Identity Matching
– Syntac1c	
  Matching
– Jaro	
  Distance	
  comparison	
  between	
  username	
  and	
  name
– Example:	
  {alice123,	
  jane_alice},	
  {Alice	
  Naura,	
  Alice	
  N.	
  Janice}

– Image	
  Matching

where	
  hIA	
  and	
  hIBj	
  are	
  the	
  RGB	
  histograms	
  of	
  the	
  profile	
  image	
  and	
  Ns	
  represent	
  
histogram	
  size	
  of	
  IA

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

11
11
Profile Search
Self	
  -­‐	
  Iden1fica1on	
  

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

12
12
Profile Search
Self	
  -­‐	
  Iden1fica1on	
  

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

12
12
Content Search

13

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

13
Content Search

13

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

13
Self-mention Search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

14
14
Self-mention Search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

14
14
Network Search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

15
15
Instance,

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

16
16
Instance,

Public	
  Friend	
  List	
  
of	
  a	
  user	
  extracted	
  
from	
  public	
  feeds

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

16
16
Integrated System -

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

17
17
Evaluation
Dataset

# of users

Social Graph API

543

Method (543 users)

% Accurate

Profile (P)

205

37.7

Content (C + SM)

34

6.3

Network (N)

1

0.2

Finding Nemo

13/05/13

# of users

212

39

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

18
18
Evaluation
Dataset

# of users

Social Graph API

543

Method (543 users)

% Accurate

Profile (P)

205

37.7

Content (C + SM)

34

6.3

Network (N)

1

0.2

Finding Nemo

212

39

Search Algorithm

# of users
identified

Accuracy

P (without URL)

149

27.4%

P (with URL) + C + N +
SM
13/05/13

# of users

149+56+6+1 =
149+71

27.4% +
11.6%

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

18
18
Mean Average Precision

↓

Matching algorithm
Image (profile image)

0.83

Syntactic (username)

0.76

Syntactic (name)

13/05/13

MAP Score

0.80

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

19
19
Demo

hPp://www.youtube.com/watch?v=-­‐AFsCtKwO0c

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

20
20
Take away

Inclusion	
  of	
  content	
  and	
  network	
  a9ributes	
  for	
  iden1ty	
  search	
  
not	
  only	
  improves	
  iden1ty	
  resolu1on	
  accuracy	
  but	
  returns	
  
correct	
  Facebook	
  iden1ty	
  within	
  top-­‐2	
  ranks	
  for	
  majority	
  of	
  the	
  
TwiPer	
  users.

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

21
21
Current and Future Work
– Extend	
  the	
  social	
  networks	
  to	
  search	
  for	
  a	
  given	
  iden1ty.	
  
Example,	
  Google+,	
  Foursquare,	
  etc.
– Extend	
  the	
  search	
  methods	
  to	
  include	
  social-­‐network	
  specific	
  
features
– Find	
  mul1ple	
  (fake)	
  iden11es	
  of	
  users	
  within	
  social	
  networks

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

22
22
Questions?
paridhij@iiitd.ac.in,	
  pk@iiitd.ac.in,	
  joshi@cs.umbc.edu
precog.iiitd.edu.in
Paper:	
  hPp://precog.iiitd.edu.in/publica1ons.html

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

23
23
For	
  any	
  further	
  informa1on,	
  please	
  write	
  to	
  
pk@iiitd.ac.in
precog.iiitd.edu.in

24

Más contenido relacionado

Más de IIIT Hyderabad

Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityBeyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityIIIT Hyderabad
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...
Data Science for Social Good: #LegalNLP #AlgorithmicBias...IIIT Hyderabad
 
How to Write a (Good) Research Paper
How to Write a (Good) Research Paper How to Write a (Good) Research Paper
How to Write a (Good) Research Paper IIIT Hyderabad
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasData Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasIIIT Hyderabad
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in IndiaIIIT Hyderabad
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in IndiaIIIT Hyderabad
 
Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...IIIT Hyderabad
 
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayPrivacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayIIIT Hyderabad
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...IIIT Hyderabad
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...IIIT Hyderabad
 
Leveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceLeveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceIIIT Hyderabad
 
Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...IIIT Hyderabad
 
A Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesA Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesIIIT Hyderabad
 
A Framework For Automatic Question Answering in Indian Languages
A Framework For Automatic Question Answering in Indian LanguagesA Framework For Automatic Question Answering in Indian Languages
A Framework For Automatic Question Answering in Indian LanguagesIIIT Hyderabad
 
Exposing, Examining and Intervening Fake News
Exposing, Examining and Intervening Fake NewsExposing, Examining and Intervening Fake News
Exposing, Examining and Intervening Fake NewsIIIT Hyderabad
 
It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
 It's MY JOB: Identifying and Improving Content Quality for Online recruitmen... It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...IIIT Hyderabad
 
De-anonymizing, Preserving and Democratizing Data Privacy and Ownership
De-anonymizing, Preserving and Democratizing Data Privacy and OwnershipDe-anonymizing, Preserving and Democratizing Data Privacy and Ownership
De-anonymizing, Preserving and Democratizing Data Privacy and OwnershipIIIT Hyderabad
 
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...IIIT Hyderabad
 
NLP / Language Research at Precog
NLP / Language Research at PrecogNLP / Language Research at Precog
NLP / Language Research at PrecogIIIT Hyderabad
 
“It is our choices, Harry, that show what we truly are, far more than our abi...
“It is our choices, Harry, that show what we truly are, far more than our abi...“It is our choices, Harry, that show what we truly are, far more than our abi...
“It is our choices, Harry, that show what we truly are, far more than our abi...IIIT Hyderabad
 

Más de IIIT Hyderabad (20)

Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityBeyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
 
How to Write a (Good) Research Paper
How to Write a (Good) Research Paper How to Write a (Good) Research Paper
How to Write a (Good) Research Paper
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasData Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBias
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in India
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in India
 
Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...
 
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayPrivacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...
 
Leveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceLeveraging Social Media for Financial Advice
Leveraging Social Media for Financial Advice
 
Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...
 
A Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesA Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian Languages
 
A Framework For Automatic Question Answering in Indian Languages
A Framework For Automatic Question Answering in Indian LanguagesA Framework For Automatic Question Answering in Indian Languages
A Framework For Automatic Question Answering in Indian Languages
 
Exposing, Examining and Intervening Fake News
Exposing, Examining and Intervening Fake NewsExposing, Examining and Intervening Fake News
Exposing, Examining and Intervening Fake News
 
It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
 It's MY JOB: Identifying and Improving Content Quality for Online recruitmen... It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
 
De-anonymizing, Preserving and Democratizing Data Privacy and Ownership
De-anonymizing, Preserving and Democratizing Data Privacy and OwnershipDe-anonymizing, Preserving and Democratizing Data Privacy and Ownership
De-anonymizing, Preserving and Democratizing Data Privacy and Ownership
 
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
 
NLP / Language Research at Precog
NLP / Language Research at PrecogNLP / Language Research at Precog
NLP / Language Research at Precog
 
“It is our choices, Harry, that show what we truly are, far more than our abi...
“It is our choices, Harry, that show what we truly are, far more than our abi...“It is our choices, Harry, that show what we truly are, far more than our abi...
“It is our choices, Harry, that show what we truly are, far more than our abi...
 

Último

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 

Último (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 

@I seek 'fb.me': Identifying Users across Multiple Online Social Networks

  • 1. @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks Workshop  on  Web  of  Linked  En11es  (WoLE) Paridhi  Jain¶,  Ponnurangam  Kumaraguru¶,  Anupam  Joshi* ¶Indraprastha  Ins1tute  of  Informa1on  Technology  (IIIT-­‐Delhi) *University  of  Maryland,  Bal1more  County  (UMBC) 1
  • 2. Motivation Multiple OSNs Multiple Identities Difficult to manage? Difficult to find? 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 2 2
  • 3. Motivation Multiple OSNs Multiple Identities Social Aggregation site Difficult to manage? Difficult to find? 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 2 2
  • 4. Motivation Multiple OSNs Multiple Identities Social Aggregation site Difficult to manage? Difficult to find? 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks Friend  Finder? Malicious  user? Influen1al  user? User  of  interest? 2 2
  • 5. Motivation Multiple OSNs Multiple Identities Social Aggregation site Difficult to manage? Difficult to find? Friend  Finder? Malicious  user? Influen1al  user? User  of  interest? Identity Resolution Problem 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 2 2
  • 6. Identity Resolution • For a user I, given a user identity IA on a social network A, find user identity IB on social network B. {IA} Alice 13/05/13 {IB} ?? @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 3 3
  • 7. Identity Resolution = Identity Search + Identity Matching • Identity Search For a user I, given her identity IA on a social network A, and a search parameter S, find the set of identities IBj on social network B such that S(IA) ⋍ S(IB). {IA,S} • {IB1, ... IBj, ... , IBN} = Q Identity Matching Given a user identity IA on a social network A, a set of candidate identities Q on social network B, and a match function M, locate an identity pair (IA, IBj) such that M(IA, IBj) = max{M(IA, IB1), M(IA, IBN)} {IA, Q, M} 13/05/13 {IA, IBj} {IB} @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 4 4
  • 8. Research Gaps? – Till  now,  focus  on  bePer  iden1ty  matching  algorithms – Only  profile  aPributes  (private  and  public)  for  Iden1ty  Search – Limita1ons  of  Profile  Search  -­‐ – Restric1ve  search,  owing  to  non-­‐availability  of  common  aPributes  across   networks.  [Gender  on  Facebook,  but  not  on  TwiPer] – Search  with  Limited  aPributes  →  Large  candidate  set  size  →  Intensive   Iden1ty  Matching  computa1on – Users  may  choose  different  profile  aPributes  →  Miss  out  correct  iden1ty  in   the  candidate  set – LiPle  research  on  using  content  and  network  aPributes  to  search  for  candidate   iden11es – Extensive  use  of  both  private  and  public  aPributes.  Need  user  authoriza1on  for   iden1ty  search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 5 5
  • 9. Research Gaps? – Till  now,  focus  on  bePer  iden1ty  matching  algorithms – Only  profile  aPributes  (private  and  public)  for  Iden1ty  Search – Limita1ons  of  Profile  Search  -­‐ – Restric1ve  search,  owing  to  non-­‐availability  of  common  aPributes  across   networks.  [Gender  on  Facebook,  but  not  on  TwiPer] – Search  with  Limited  aPributes  →  Large  candidate  set  size  →  Intensive   Iden1ty  Matching  computa1on – Users  may  choose  different  profile  aPributes  →  Miss  out  correct  iden1ty  in   the  candidate  set – LiPle  research  on  using  content  and  network  aPributes  to  search  for  candidate   iden11es – Extensive  use  of  both  private  and  public  aPributes.  Need  user  authoriza1on  for   iden1ty  search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 6 6
  • 10. Research Gaps? – Till  now,  focus  on  bePer  iden1ty  matching  algorithms – Only  profile  aPributes  (private  and  public)  for  Iden1ty  Search – Limita1ons  of  Profile  Search  -­‐ – Restric1ve  search,  owing  to  non-­‐availability  of  common  aPributes  across   networks.  [Gender  on  Facebook,  but  not  on  TwiPer] – Search  with  Limited  aPributes  →  Large  candidate  set  size  →  Intensive   Iden1ty  Matching  computa1on – Users  may  choose  different  profile  aPributes  →  Miss  out  correct  iden1ty  in   the  candidate  set – LiPle  research  on  using  content  and  network  aPributes  to  search  for  candidate   iden11es – Extensive  use  of  both  private  and  public  aPributes.  Need  user  authoriza1on  for   iden1ty  search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 7 7
  • 11. Proposal – Include  content  and  network  aPributes  as  search  parameters – Access  only  publicly  accessible  aPributes – Focus  on  two  popular  social  networks  -­‐  TwiPer  and  Facebook 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 8 8
  • 12. Contribution – Proposed  novel  iden1ty  search  methods  on  social  networks – Our  iden1ty  resolu1on  methods  return  correct  Facebook  iden1ty  for  39%   TwiPer  users  within  top-­‐2  ranks – We  observe  an  increase  in  accuracy  of  iden1ty  resolu1on  by  11.6%  owing  to   inclusion  of  content  and  network  iden1ty  search,  along  with  improvised  profile   search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 9 9
  • 13. Methodology ? ? ? ? Search 13/05/13 Candidate Identities If self-identified / returned by more than one search method Yes No Syntactic and Image Manual Verification Match @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 10 10
  • 14. Identity Matching – Syntac1c  Matching – Jaro  Distance  comparison  between  username  and  name – Example:  {alice123,  jane_alice},  {Alice  Naura,  Alice  N.  Janice} – Image  Matching where  hIA  and  hIBj  are  the  RGB  histograms  of  the  profile  image  and  Ns  represent   histogram  size  of  IA 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 11 11
  • 15. Profile Search Self  -­‐  Iden1fica1on   13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 12 12
  • 16. Profile Search Self  -­‐  Iden1fica1on   13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 12 12
  • 17. Content Search 13 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 13
  • 18. Content Search 13 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 13
  • 19. Self-mention Search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 14 14
  • 20. Self-mention Search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 14 14
  • 21. Network Search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 15 15
  • 22. Instance, 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 16 16
  • 23. Instance, Public  Friend  List   of  a  user  extracted   from  public  feeds 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 16 16
  • 24. Integrated System - 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 17 17
  • 25. Evaluation Dataset # of users Social Graph API 543 Method (543 users) % Accurate Profile (P) 205 37.7 Content (C + SM) 34 6.3 Network (N) 1 0.2 Finding Nemo 13/05/13 # of users 212 39 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 18 18
  • 26. Evaluation Dataset # of users Social Graph API 543 Method (543 users) % Accurate Profile (P) 205 37.7 Content (C + SM) 34 6.3 Network (N) 1 0.2 Finding Nemo 212 39 Search Algorithm # of users identified Accuracy P (without URL) 149 27.4% P (with URL) + C + N + SM 13/05/13 # of users 149+56+6+1 = 149+71 27.4% + 11.6% @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 18 18
  • 27. Mean Average Precision ↓ Matching algorithm Image (profile image) 0.83 Syntactic (username) 0.76 Syntactic (name) 13/05/13 MAP Score 0.80 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 19 19
  • 28. Demo hPp://www.youtube.com/watch?v=-­‐AFsCtKwO0c 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 20 20
  • 29. Take away Inclusion  of  content  and  network  a9ributes  for  iden1ty  search   not  only  improves  iden1ty  resolu1on  accuracy  but  returns   correct  Facebook  iden1ty  within  top-­‐2  ranks  for  majority  of  the   TwiPer  users. 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 21 21
  • 30. Current and Future Work – Extend  the  social  networks  to  search  for  a  given  iden1ty.   Example,  Google+,  Foursquare,  etc. – Extend  the  search  methods  to  include  social-­‐network  specific   features – Find  mul1ple  (fake)  iden11es  of  users  within  social  networks 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 22 22
  • 31. Questions? paridhij@iiitd.ac.in,  pk@iiitd.ac.in,  joshi@cs.umbc.edu precog.iiitd.edu.in Paper:  hPp://precog.iiitd.edu.in/publica1ons.html 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 23 23
  • 32. For  any  further  informa1on,  please  write  to   pk@iiitd.ac.in precog.iiitd.edu.in 24