Apidays New York 2024 - The value of a flexible API Management solution for O...
Big Data on the Web – What We Will Do
1. Jump into Action
Big Data on the Web – What We Will Do
Haklae Kim, PhD. , April 2012
2. Today
This Presentation .....
Open Data and The Semantic Web
Introduction What We Will Do
Open Government Data & Linked Data
2
3. Let’s Start
Web in Transition
“a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics”
(Source: Mike, 2007)
3
5. Let’s Start
Big Data
“data that becomes large enough that it cannot be processed using conventional methods”
“Big Data is like Sex in High School–Lots of people are talking about it, but few are having it.”
-Eric Hansen, SiteSpect founder and CEO
5
6. Today
This Presentation .....
Open Data and The Semantic Web
Introduction What We Will Do
Open Government Data & Linked Data
6
7. Overview
Data on the Web
Data is information about things
Data is something machines can process
Data drives applications (e.g. web sites, mobile services)
Data is relations among things
7
8. Definition
What is Open (Government) Data?
“Open”
material (data) is open if it can be freely used,
reused and redistributed by anyone
“Government data”
data and information produced
or
commissioned by government or
government controlled entities.
Source: Open Knowledge Foundation, 2010
8
9. • Transparency
• Participation
• Collaboration
“My administration is committed to creating an unprecedented level of
openness in Government.” – Barack Obama
“Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009
15. Let’s Start
The Web as a Global Data Platform
.. a system of interlinked hypertext documents accessed via the Internet
15
16. All data including documents, services, people ...
DATA links
DATA
The Semantic Web is not about links between web pages.
16
17. Overview
Linked Data & The Semantic Web
“The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or
machine can explore the web of data. With linked data, when you have some of it, you can find other, related,
data” - TBL.
5 Stars Open linked data
★ Make your stuff available on the Web
★★ Make it avaiable as structured data
★★★ Use open, standard formats (instead of excel)
★★★★ Use a open data format – URLs, descriptions
★★★★★ Link your data to other people’s data
17
18. Overview
Growth of Interlinks
… Linked Data provides the means to reach the goal of the Semantic Web – “the
emergence of a Web of Data”
2007-05-01 2007-10-08 2007-11-10 2008-02-28 2008-03-31
2008-09-18 2009-03-05 2009-03-27 2009-07-14 2010-09-22
18
19. Structured Wikipedia Multimedia Content
DBpedia BBC
Commercial Product Government Data
Best Buy UK Gov
October, 2011 19
295 interlinked datasets, approximately 31 billions triples
23. Today
This Presentation .....
Open Data and The Semantic Web
Introduction What We Will Do
Open Government Data & Linked Data
23
24. Question
What is the Semantic Web for?
Standards Search Inference Intelligence
24
25. Case Studies
Google’s Semantic Search
People should be able to ask questions and we should understand their meaning, or they should be able to talk
about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible answer to
that.“ - Google Vice President of Search Products & User Experience Marissa Mayer
an initiative launched on 2 June 2011 by Bing, Google and Yahoo!
to "create and support a common set of schemas for structured
data markup on web pages."
Freebase is an open, Creative Commons licensed repository
of structured data of almost 22 million entities. An entity is a single
person, place, or thing connected by a graph.
The Knowledge Graph is a collection of information sources that
help discern a user’s specified intent with each individual query.
The graph is actually an encyclopedia with structured information http://schema.org/docs/full.html
obtained from the web. (currently, 200 million entities)
25
26. Case Studies
Apple’s Siri
Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: " Me."
Siri (Speech Interpretation and Recognition Interface) is Knowledge Navigator (1987)
an intelligent personal assistant and knowledge navigator which a concept described by former Apple Computer CEO John
works as an application for Apple's iOS. Sculley in his 1987 book, Odyssey.
A Brief History
- In December 2007 Siri, Inc. was formed by Dag Kittlaus (CEO),
Adam Cheyer (VP Engineering), and Tom Gruber (CTO/VP Design).
- Siri Inc. went after funding and by November 2009 it had
secured $15.5 million investment, resulted in the creation of the
first Siri application, which debuted on the iPhone 3GS in
February 2010.
- Siri acquired by Apple; iPhone becomes the Virtual Personal
Assistant
(Source: http://www.youtube.com/watch?v=QRH8eimU_20)
26
27. Case Studies
Active Ontology
A processing formalism where distinct processing elements are arranged according to ontology notions;
an execution environment.
Basic concepts
* Ontology : A data structure
- Formal representation for domain knowledge
- Classes, attributes, relations
* Active Ontology : A processing environment
- Processing elements arranged according to ontology notions
- Communication channels
P movie
P genre P actor P rating
rule set
rule
rule
rule
condition
condition
condition
action
action
action
(Baur et al., 2007)
27
28. Examples
Active Framework
a platform for constructing service-oriented applications which can be accessed through multiple modalities in a
natural, task-oriented manner that leverages context throughout the experience
Active Editor
“find action movies in San Francisco”
Active Ontology “nearby Chinese restaurants”
Active Server
Facts store
Evaluation
Engine
Active Ontology
Active Ontology
Active Console
Active Ontology
(Baur et al., 2007)
28
29. What We Will Do
Interdisciplinary Collaboration
Difficult
Think
Hope is not a strategy and the “change” has been change for
the worse, and not better. 29
30. References
- Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software
- Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png
- Page 4: http://www.go-gulf.com/60seconds.jpg
- Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg
- Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi
- Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg
- Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg
30
31. For more information
contact Haklae Kim via
haklae.kim@gmail.com
Twitter: haklaekim
Or read up on the
sonagi blog at:
http://blogweb.co.kr