This talk was given by Hien Luu (Senior Software Engineer at LinkedIn) and Siddharth Anand (Senior Staff Software Engineer at LinkedIn) at the Hadoop Summit (June 2013).
6. Other Company Facts
*
• Headquartered
in
Mountain
View,
Calif.,
with
offices
around
the
world!
• As
of
June
1,
2013,
LinkedIn
has
~3,700
full-‐Rme
employees
located
around
the
world
Source :
http://press.linkedin.com/about
7. Agenda
ü Company Overview
• Big Data @ LinkedIn
• The Segmentation & Targeting Problem
• Solution : LinkedIn Segmentation & Targeting Platform
• Q & A
11. Big Data Story : On-line Data
Oracle
or
Espresso
Data
Change
Events
Search
Index
Graph
Index
Read
Replicas
Updates
Standar
dizaRon
A user updates the company, title, & school on his profile. He also accepts a
connection
The write is made to an Oracle or Espresso Master and DataBus replicates it:
• the profile change is applied to the Standardization service
Ø E.g. the many forms of IBM were canonicalized for search-friendliness
• …. and to the Search Index
Ø Recruiters can find you immediately by new keywords
• the connection change is applied to the Graph Index service
Ø The user can now start receiving feed updates from his new connections
12. Big Data Story : On-line Data
Databus streams also update Hadoop!
Oracle
or
Espresso
Search
Index
Graph
Index
Read
Replica
Updates
Standar
dizaRon
Data
Change
Events