Big data is here, created and used by human. This talk identifies three key issues when we turn big data into productivity, and then illustrates a five-C process (i.e. capture, clean, connect, compute, and communicate) via a social web example.
More than Just Lines on a Map: Best Practices for U.S Bike Routes
Unlocking Big Data on the Social Web
1. Unlocking Big Data
on the Social Web
Li Ding
CAISS 2013 Shanghai Summit, 2013-07-23, Shanghai, China 1
Memect
Memory Connected
2. Memect
Memory Connected
Big data is right here,
created and used by human
CAISS 2013 Shanghai Summit, 2013-07-23, Shanghai, China 2
http://www.gooddata.com/images/uploads/big-data-image.jpg
3. Memect
Memory Connected
But less than 5% were used
Well-known Big Data Features
• Volume: e.g. Facebook, Google, Twitter -- distributed and locked
• Velocity: e.g. stock price -- process it now or lost it
• Variety: e.g. bank statements -- tedious preprocess
CAISS 2013 Shanghai Summit, 2013-07-23, Shanghai, China 3
I have big data but my data is locked get value out of the data
from Big Data to Productivity
• Veracity: filter low quality data and false statements
• Versatile: low-cost, flexible, open support for various (unexpected) mash-ups
• Value: tangible benefits for human users
4. Memect
Memory Connected
Big data, machine, me and the Society
CAISS 2013 Shanghai Summit, 2013-07-23, Shanghai, China 4
capture connectclean
computecommunicate
Item value
Product Power soap
GPS (38.111,77.034)
User John Doe
“power soap” is a cleaner,
not food (Wikipedia)
(38.111,77.034) is closed to
“Subway” and “Walmart”
(open street map)
John Doe is at Walmart
right now because “power
soap” is not food, while
subway only sell food
Contact Foursquare Walmart
host regarding to “power soap”
Post “at Walmart” on Facebook
Data Web
“power soap” has
four 1-star reviews
(Amazon)
The
Cloud
me
Social
Web
6. Memect
Memory Connected
Selected Projects
• Linked Open Government Data (US
government): serve society using a
variety of public records from
international governments, powered by
the cloud and the crowd
• Context-Aware Computing (mobile
industry): capture real-time context
data using smartphone, and
connect/compute the data to make the
smartphone smarter
• Linking Financial Data: using connected
financial data and social events
• Memory Connected. connect our brain
memory and the Web memory, enable
memory exchange network
CAISS 2013 Shanghai Summit, 2013-07-23, Shanghai, China 6