SlideShare una empresa de Scribd logo
1 de 32
Descargar para leer sin conexión
The Data Era: Production,
Consumption, Challenges
         Miriam Fernández
          8th November, CAEPIA 2011

   Website: http://people.kmi.open.ac.uk/miriam/about/
                     Twitter: @miri_fs
    Slide_share: http://www.slideshare.net/miriamfs
What is … ?
How do humans
                 infer knowledge?

                           Alejandro
                           in Chicago!

Semantic interpretation


                                         A picture!

Syntactic interpretation
How do machines
                 infer knowledge?



Semantic interpretation


                              A picture!

Syntactic interpretation
The Challenge

• We need to find the way in which
  machines will interpret and extract
  knowledge for us!




                 =
The Challenge
The Data Era

• The 2011 Digital University Study:
  Extracting Value from Chaos (IDC)
  – We have entered the Zettabyte era (a
    trillion gigabytes or a billion terabytes)
  – The great of information growth appears
    to be exceeding Moore’s Law

 http://www.emc.com/collateral/demos/
 microsites/emc-digital-universe-
 2011/index.htm
Big Value from
                                  Data
• Big Data: The next frontier for
  innovation, competition and
  productivity (McKinsey)
  – $300 billion potential annual value to
    US health care
  – €250 billion potential annual value to
    Europe’s public sector administration
http://www.mckinsey.com/mgi/publications/big_data/pdfs/M
GI_big_data_full_report.pdf
IBM City Forward
The Smarter Cities Challenge is a competitive grant program
awarding $50 million worth of IBM expertise over the next three
years to 100 cities around the globe. Designed to address the
wide range of challenges facing cities today
Consumption
• We need to provide efficient ways to
  consume data in order to extract the
  value out of it, the knowledge
  – Syntactic approaches (visual analytics)
    • The data is collected, centralized and analysed
    • Visualizations for humans to extract knowledge
  – Semantic approaches
    • The information is distributed / interlinked
    • Semantic structures are added to the data so
      that machines can better understand it
Syntactic
                  approaches
• Some examples
 – Gap Minder
 – IBM many eyes
 – Google Public Data Explorer
 – Google correlate
 – Google N-Gram viewer
   • What is the most popular hair
     colour in the literature?
Google N-Gram
Viewer
Semantic
                        approaches
• The Semantic Web is an extension of
  the current web in which information
  is given well-defined meaning,
  better enabling computers and people
  to work in cooperation

 Tim Berners-Lee, James Hendler,
 Ora Lassila, The Semantic Web,
 Scientific American, May 2001
The SW vision

• Use semantic structures
  (ontologies) to represent data.
  Provide machines with the ability to
  interpret and extract knowledge



                   =
Adding Structure

• Two paths towards the SW vision
  – Metadata embedded in HTML
    • Microformats
    • RDFa
    • Microdata
  – Linked Data
    • Putting the data online in a standard, web
      enabled representation (RDF)
    • Make the data Web addressable (URIs)
Metadata in HTML
                            <div class="vcard">
 • An example                <div class="fn org">Knowledge Media Institute</div>
                             <div class="adr">
Knowledge Media Institute     <div class="street-address">Walton Hall</div>
Walton Hall                   <div>
Milton Keynes                   <span class="locality">Milton Keynes</span>,
MK7 6AA                         <span class="postal-code">MK7 6AA</span>
                              </div>
                              <div class="country-name">United Kingdom</div>
                             </div>
                            </div>
Metadata in HTML

• Schema.org




                 Semantically enhanced Information Retrieval:
                 an ontology-based approach
                 http://people.kmi.open.ac.uk/miriam/about/
Metadata in HTML

• The Open Graph protocol
2007
                                                Linked Data

                  2008


                                         2009                 2010




     Linking Open Data cloud diagram,
by Richard Cyganiak and Anja Jentzsch.
        http://lod-cloud.net/
Linked Data

• An example
     http://data.semanticweb.org/person/miriam-fernandez/rdf
     <ns1:Person rdf:about="http://data.semanticweb.org/person/miriam-
     fernandez">
     <rdf:type   rdf:resource="http://xmlns.com/foaf/0.1/Person"/>




          @prefix dbpedia <http://dbpedia.org/resource/>.
          @prefix dbterm   <http://dbpedia.org/property/>.
          dbpedia:Amsterdam
            dbterm:officialName “Amsterdam” ;
            dbterm:longd “4” ;
            dbterm:longm “53” ;
            dbterm:longs “32” ;…
Open Government

• Data.gov
• Data.gov.uk
• Many others…




                   Research Funding Explorer
BBC

 • Programs
 • Music
 • Artist
 • World Cup
Who won it? ;)
Open University

               DBPedia                                             RAE
                                                       Data from
                           OpenLearn
                                                       Research
                            Content          ORO        Outputs
Exposed as linked
data, our data                  Archive of
                                                Library’s
                                 Course
Currently: OUeach
interlink withgeonames                          Catalogue
                 public          Material
                                                Of Digital
data sit in the external
other and different                              Content
                                                                   data.gov.uk
systemsbecome to
world: – hard part               A/V Material
of the “global data
discover, obtain,
                                   Podcasts
                                   iTunesU
space” on the Web
integrate by users.BBC
                                                               DBLP
Data.open.ac.uk

        data.open.ac.
        uk
The Value

• Recognized as a critical step forward
  for the HE sector in the UK
  – Favor transparency and reuse of data,
    both externally and internally
  – Reduces cost of dealing with our own
    public data
  – Enable both new kinds of applications,
    and to make the ones that are already
    feasible more cost effective
The Value

• Linking educational material across
  universities http://smartproducts1.kmi.open.ac.uk/
                    web-linkeduniversities/index.htm
The Value

• Exploring research communities
The Value

• And many others….
Conclusions

• We have reached the Data Era
  – Production: currently more than a
    Zettabyte of information in the digital world
    and increasing really fast
  – Consumption: syntactic and semantic
    approaches have emerged to extract the
    value (the knowledge) out of the data
  – Challenges: Provide machines with the
    capabilities to extract the knowledge for us!
Conclusions

• Many more challenges ahead…
 – Different formats (text vs. multimedia)
 – Different dynamics (time / location)
 – Different provenance
 – Different topics (heterogeneous)
 – Distributed, Massive, stream
 – Various quality
 –…
THX!
• Any ideas to make me rich? ☺


                                      =
 • Slide_share: http://www.slideshare.net/miriamfs
 • Website: http://people.kmi.open.ac.uk/miriam/about/
 • Twitter: @miri_fs
  Thanks to Fouad Zablith and Mathieu d'Aquin ☺ for sharing with me some of their slides and
  for their valuable comments on this presentation

Más contenido relacionado

La actualidad más candente

Aligning library services with emerging research data needs
Aligning library services with emerging research data needsAligning library services with emerging research data needs
Aligning library services with emerging research data needsAndrew Sallans
 
Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms: Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms: Martin Donnelly
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataStefan Dietze
 
Learning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesLearning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesStefan Dietze
 
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Data & Education Seminar, ITMO, St Petersburg, March 2014Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Data & Education Seminar, ITMO, St Petersburg, March 2014Stefan Dietze
 
Research Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffResearch Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffMartin Donnelly
 
LinkedUp at Mozilla Festival Science Fair
LinkedUp at Mozilla Festival Science FairLinkedUp at Mozilla Festival Science Fair
LinkedUp at Mozilla Festival Science FairMarieke Guy
 
Digital Cultural Heritage and Open Education
Digital Cultural Heritage and Open EducationDigital Cultural Heritage and Open Education
Digital Cultural Heritage and Open EducationLorna Campbell
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Stefan Dietze
 
web 2.0, library systems and the library system
web 2.0, library systems and the library systemweb 2.0, library systems and the library system
web 2.0, library systems and the library systemlisld
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Jisc
 
Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...Mathieu d'Aquin
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileLEARN Project
 
Lessons Learnt from LinkedUp
Lessons Learnt from LinkedUpLessons Learnt from LinkedUp
Lessons Learnt from LinkedUpMarieke Guy
 
Semantic Web, Linked Data and Education: A Perfect Fit?
Semantic Web, Linked Data and Education: A Perfect Fit?Semantic Web, Linked Data and Education: A Perfect Fit?
Semantic Web, Linked Data and Education: A Perfect Fit?Mathieu d'Aquin
 
Adopting technology session
Adopting technology sessionAdopting technology session
Adopting technology sessionMike Frohlich
 

La actualidad más candente (20)

Aligning library services with emerging research data needs
Aligning library services with emerging research data needsAligning library services with emerging research data needs
Aligning library services with emerging research data needs
 
Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms: Research data management: a tale of two paradigms:
Research data management: a tale of two paradigms:
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open Data
 
Learning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesLearning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, Examples
 
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Data & Education Seminar, ITMO, St Petersburg, March 2014Open Data & Education Seminar, ITMO, St Petersburg, March 2014
Open Data & Education Seminar, ITMO, St Petersburg, March 2014
 
Research Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffResearch Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staff
 
LinkedUp at Mozilla Festival Science Fair
LinkedUp at Mozilla Festival Science FairLinkedUp at Mozilla Festival Science Fair
LinkedUp at Mozilla Festival Science Fair
 
Digital Cultural Heritage and Open Education
Digital Cultural Heritage and Open EducationDigital Cultural Heritage and Open Education
Digital Cultural Heritage and Open Education
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
 
web 2.0, library systems and the library system
web 2.0, library systems and the library systemweb 2.0, library systems and the library system
web 2.0, library systems and the library system
 
EDINA National Datacentre Activity Update to GWG
EDINA National Datacentre Activity Update to GWGEDINA National Datacentre Activity Update to GWG
EDINA National Datacentre Activity Update to GWG
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...
 
Semantic Web-Linked Data and Libraries
Semantic Web-Linked Data and LibrariesSemantic Web-Linked Data and Libraries
Semantic Web-Linked Data and Libraries
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of Chile
 
Lessons Learnt from LinkedUp
Lessons Learnt from LinkedUpLessons Learnt from LinkedUp
Lessons Learnt from LinkedUp
 
Semantic Web, Linked Data and Education: A Perfect Fit?
Semantic Web, Linked Data and Education: A Perfect Fit?Semantic Web, Linked Data and Education: A Perfect Fit?
Semantic Web, Linked Data and Education: A Perfect Fit?
 
Aggregation as tactic sm new
Aggregation as tactic sm newAggregation as tactic sm new
Aggregation as tactic sm new
 
Adopting technology session
Adopting technology sessionAdopting technology session
Adopting technology session
 

Destacado

Destacado (8)

Dealers Program
Dealers ProgramDealers Program
Dealers Program
 
SocInfo2014 CityLabs Workshop
SocInfo2014 CityLabs WorkshopSocInfo2014 CityLabs Workshop
SocInfo2014 CityLabs Workshop
 
ESWC 2014 Tutorial part 2
ESWC 2014 Tutorial part 2ESWC 2014 Tutorial part 2
ESWC 2014 Tutorial part 2
 
Toyota slides candi's revision
Toyota slides  candi's revisionToyota slides  candi's revision
Toyota slides candi's revision
 
ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3
 
ESWC 2014 Tutorial Part 4
ESWC 2014 Tutorial Part 4ESWC 2014 Tutorial Part 4
ESWC 2014 Tutorial Part 4
 
ESWC 2014 Tutorial part 1
ESWC 2014 Tutorial part 1ESWC 2014 Tutorial part 1
ESWC 2014 Tutorial part 1
 
DealersProgram
DealersProgramDealersProgram
DealersProgram
 

Similar a CAEPIA 2011

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web DataMarieke Guy
 
New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...Derek Keats
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Antoine Isaac
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Peter Löwe
 
From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle Kimberly Hoffman
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessAbby Clobridge
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?Graham Pryor
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECAProject
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked DataAdrian Stevenson
 
Change Management for Libraries
Change Management for LibrariesChange Management for Libraries
Change Management for LibrariesThomas King
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?Anna Fensel
 

Similar a CAEPIA 2011 (20)

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Big and Small Web Data
Big and Small Web DataBig and Small Web Data
Big and Small Web Data
 
New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono
 
Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013Culture Hack panel SXSW 2013
Culture Hack panel SXSW 2013
 
Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Here Comes Everything
Here Comes EverythingHere Comes Everything
Here Comes Everything
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
 
From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked Data
 
Change Management for Libraries
Change Management for LibrariesChange Management for Libraries
Change Management for Libraries
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 

Más de Miriam Fernandez

Biases in Social Media Research (NoBias EU project)
Biases in Social Media Research (NoBias EU project)Biases in Social Media Research (NoBias EU project)
Biases in Social Media Research (NoBias EU project)Miriam Fernandez
 
Research seminar Queen Mary University of London (CogSci)
Research seminar Queen Mary University of London (CogSci)Research seminar Queen Mary University of London (CogSci)
Research seminar Queen Mary University of London (CogSci)Miriam Fernandez
 
Vision track october_2020_fernandez_v5
Vision track october_2020_fernandez_v5Vision track october_2020_fernandez_v5
Vision track october_2020_fernandez_v5Miriam Fernandez
 
On the Application of Social Data Science to Address Societal Challenges
On the Application of Social Data Science to Address Societal ChallengesOn the Application of Social Data Science to Address Societal Challenges
On the Application of Social Data Science to Address Societal ChallengesMiriam Fernandez
 
Online radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directionsOnline radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directionsMiriam Fernandez
 
Mining Social Media Data For Policing
Mining Social Media Data For PolicingMining Social Media Data For Policing
Mining Social Media Data For PolicingMiriam Fernandez
 
Introduction to Mining Social Media Data
Introduction to Mining Social Media DataIntroduction to Mining Social Media Data
Introduction to Mining Social Media DataMiriam Fernandez
 
Online Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsOnline Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsMiriam Fernandez
 
Slides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxSlides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxMiriam Fernandez
 
Artificial Intelligence for Policing
Artificial Intelligence for PolicingArtificial Intelligence for Policing
Artificial Intelligence for PolicingMiriam Fernandez
 
OUSocial OUSocMed conference
OUSocial OUSocMed conference OUSocial OUSocMed conference
OUSocial OUSocMed conference Miriam Fernandez
 
On the use of social media for evidence-based policing
On the use of social media for evidence-based policingOn the use of social media for evidence-based policing
On the use of social media for evidence-based policingMiriam Fernandez
 
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...Miriam Fernandez
 
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookMiriam Fernandez
 
Wm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalWm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalMiriam Fernandez
 
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Miriam Fernandez
 

Más de Miriam Fernandez (16)

Biases in Social Media Research (NoBias EU project)
Biases in Social Media Research (NoBias EU project)Biases in Social Media Research (NoBias EU project)
Biases in Social Media Research (NoBias EU project)
 
Research seminar Queen Mary University of London (CogSci)
Research seminar Queen Mary University of London (CogSci)Research seminar Queen Mary University of London (CogSci)
Research seminar Queen Mary University of London (CogSci)
 
Vision track october_2020_fernandez_v5
Vision track october_2020_fernandez_v5Vision track october_2020_fernandez_v5
Vision track october_2020_fernandez_v5
 
On the Application of Social Data Science to Address Societal Challenges
On the Application of Social Data Science to Address Societal ChallengesOn the Application of Social Data Science to Address Societal Challenges
On the Application of Social Data Science to Address Societal Challenges
 
Online radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directionsOnline radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directions
 
Mining Social Media Data For Policing
Mining Social Media Data For PolicingMining Social Media Data For Policing
Mining Social Media Data For Policing
 
Introduction to Mining Social Media Data
Introduction to Mining Social Media DataIntroduction to Mining Social Media Data
Introduction to Mining Social Media Data
 
Online Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsOnline Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future Directions
 
Slides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxSlides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptx
 
Artificial Intelligence for Policing
Artificial Intelligence for PolicingArtificial Intelligence for Policing
Artificial Intelligence for Policing
 
OUSocial OUSocMed conference
OUSocial OUSocMed conference OUSocial OUSocMed conference
OUSocial OUSocMed conference
 
On the use of social media for evidence-based policing
On the use of social media for evidence-based policingOn the use of social media for evidence-based policing
On the use of social media for evidence-based policing
 
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
 
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
 
Wm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalWm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-final
 
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
 

Último

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 

Último (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

CAEPIA 2011

  • 1. The Data Era: Production, Consumption, Challenges Miriam Fernández 8th November, CAEPIA 2011 Website: http://people.kmi.open.ac.uk/miriam/about/ Twitter: @miri_fs Slide_share: http://www.slideshare.net/miriamfs
  • 3. How do humans infer knowledge? Alejandro in Chicago! Semantic interpretation A picture! Syntactic interpretation
  • 4. How do machines infer knowledge? Semantic interpretation A picture! Syntactic interpretation
  • 5. The Challenge • We need to find the way in which machines will interpret and extract knowledge for us! =
  • 7. The Data Era • The 2011 Digital University Study: Extracting Value from Chaos (IDC) – We have entered the Zettabyte era (a trillion gigabytes or a billion terabytes) – The great of information growth appears to be exceeding Moore’s Law http://www.emc.com/collateral/demos/ microsites/emc-digital-universe- 2011/index.htm
  • 8. Big Value from Data • Big Data: The next frontier for innovation, competition and productivity (McKinsey) – $300 billion potential annual value to US health care – €250 billion potential annual value to Europe’s public sector administration http://www.mckinsey.com/mgi/publications/big_data/pdfs/M GI_big_data_full_report.pdf
  • 9. IBM City Forward The Smarter Cities Challenge is a competitive grant program awarding $50 million worth of IBM expertise over the next three years to 100 cities around the globe. Designed to address the wide range of challenges facing cities today
  • 10. Consumption • We need to provide efficient ways to consume data in order to extract the value out of it, the knowledge – Syntactic approaches (visual analytics) • The data is collected, centralized and analysed • Visualizations for humans to extract knowledge – Semantic approaches • The information is distributed / interlinked • Semantic structures are added to the data so that machines can better understand it
  • 11. Syntactic approaches • Some examples – Gap Minder – IBM many eyes – Google Public Data Explorer – Google correlate – Google N-Gram viewer • What is the most popular hair colour in the literature?
  • 13. Semantic approaches • The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation Tim Berners-Lee, James Hendler, Ora Lassila, The Semantic Web, Scientific American, May 2001
  • 14. The SW vision • Use semantic structures (ontologies) to represent data. Provide machines with the ability to interpret and extract knowledge =
  • 15. Adding Structure • Two paths towards the SW vision – Metadata embedded in HTML • Microformats • RDFa • Microdata – Linked Data • Putting the data online in a standard, web enabled representation (RDF) • Make the data Web addressable (URIs)
  • 16. Metadata in HTML <div class="vcard"> • An example <div class="fn org">Knowledge Media Institute</div> <div class="adr"> Knowledge Media Institute <div class="street-address">Walton Hall</div> Walton Hall <div> Milton Keynes <span class="locality">Milton Keynes</span>, MK7 6AA <span class="postal-code">MK7 6AA</span> </div> <div class="country-name">United Kingdom</div> </div> </div>
  • 17. Metadata in HTML • Schema.org Semantically enhanced Information Retrieval: an ontology-based approach http://people.kmi.open.ac.uk/miriam/about/
  • 18. Metadata in HTML • The Open Graph protocol
  • 19. 2007 Linked Data 2008 2009 2010 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 20. Linked Data • An example http://data.semanticweb.org/person/miriam-fernandez/rdf <ns1:Person rdf:about="http://data.semanticweb.org/person/miriam- fernandez"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Person"/> @prefix dbpedia <http://dbpedia.org/resource/>. @prefix dbterm <http://dbpedia.org/property/>. dbpedia:Amsterdam dbterm:officialName “Amsterdam” ; dbterm:longd “4” ; dbterm:longm “53” ; dbterm:longs “32” ;…
  • 21. Open Government • Data.gov • Data.gov.uk • Many others… Research Funding Explorer
  • 22. BBC • Programs • Music • Artist • World Cup Who won it? ;)
  • 23. Open University DBPedia RAE Data from OpenLearn Research Content ORO Outputs Exposed as linked data, our data Archive of Library’s Course Currently: OUeach interlink withgeonames Catalogue public Material Of Digital data sit in the external other and different Content data.gov.uk systemsbecome to world: – hard part A/V Material of the “global data discover, obtain, Podcasts iTunesU space” on the Web integrate by users.BBC DBLP
  • 24. Data.open.ac.uk data.open.ac. uk
  • 25. The Value • Recognized as a critical step forward for the HE sector in the UK – Favor transparency and reuse of data, both externally and internally – Reduces cost of dealing with our own public data – Enable both new kinds of applications, and to make the ones that are already feasible more cost effective
  • 26.
  • 27. The Value • Linking educational material across universities http://smartproducts1.kmi.open.ac.uk/ web-linkeduniversities/index.htm
  • 28. The Value • Exploring research communities
  • 29. The Value • And many others….
  • 30. Conclusions • We have reached the Data Era – Production: currently more than a Zettabyte of information in the digital world and increasing really fast – Consumption: syntactic and semantic approaches have emerged to extract the value (the knowledge) out of the data – Challenges: Provide machines with the capabilities to extract the knowledge for us!
  • 31. Conclusions • Many more challenges ahead… – Different formats (text vs. multimedia) – Different dynamics (time / location) – Different provenance – Different topics (heterogeneous) – Distributed, Massive, stream – Various quality –…
  • 32. THX! • Any ideas to make me rich? ☺ = • Slide_share: http://www.slideshare.net/miriamfs • Website: http://people.kmi.open.ac.uk/miriam/about/ • Twitter: @miri_fs Thanks to Fouad Zablith and Mathieu d'Aquin ☺ for sharing with me some of their slides and for their valuable comments on this presentation