SlideShare a Scribd company logo
1 of 39
Download to read offline
The Web of Data and its Five Stars	


          Richard Cyganiak, DERI, NUI Galway
                        @cygri	


                      6 June 2012	

     Realising and Exploiting the EU data cloud	

      European Data Forum, Copenhagen, Denmark
Generating insight from data
                                            	

•  Today,    data is abundant	

•  New   middlemen find new ways of getting data to the end user	

•  Supply    and demand for data higher than ever	

•  Analyst's   problem is no longer a lack of relevant data, but:	

  •  Understanding      data	

  •  Assessing    applicability	

  •  Getting    it into the right form for use	

•  Similar   problems inside and outside of the firewall
From the Web 
to the Web of Data
Tim Berners-Lee’s 5-star plan for an open web of data
                                                       	



         ★
       	

     	

Make data available on the Web under an open license	


       	

★★ 	

Make it available as structured data	


     	

★★★ 	

Use a non-proprietary format	


  	

★★★★ 	

Use URIs to identify things	


	

★★★★★ 	

Link your data to other people’s data 
            to provide context
The 0th star
                                       	


•  Data   catalog with good metadata	

•  Make   your data findable
Data on the Web, Open License
                            	



             ★
Open Data
Government data catalogs
Open vs. Closed
              	

   Data used to be closed by default.
In the future, it will be open by default.
Is open data just for governments?
Good reasons against opening data
                                         	

•  Privacy	


•  Competitive    advantage	

•  Producing    data and charging for it as business model	

•  Can't   get license from upstream
Business models
                           	





Scott Brinker, http://www.chiefmartec.com/2010/01/7-business-models-for-linked-data.html
Data licenses
            	





                  http://opendefinition.org/licenses/
Structured Data
              	



  ★★
Enabling re-use
                                                   	

•  Delivering    data to end users in different forms	

•  Combining      data with other data	

•  3rd   party analysis of data
Formats in government data
                                          	

•  Good     for re-use: MS Excel, CSV, XML, JSON, Microdata	

•  Not   so good for re-use: Pure websites, MS Word	

•  Bad   for re-use: PDF	

•  Really   bad for re-use: Only charts/maps without numbers
Symptom: Screenscraping
Non-Proprietary Formats
                      	



    ★★★
Specialist formats
                                                 	

•  Specialist   tools often have specialist formats	

  •  Few   people have the tools	

  •  Expensive	


  •  Difficult    to re-use	

  •  (Geospatial     tools, statistics packages, etc.)
Non-proprietary formats, open standards
                                      	

•  CSV   (dead simple)	

•  XML	


•  JSON	


•  RDF   (good for 4+5 stars)	

•  OGC      web services	

•  OAI-ORE      web services
Use URIs as Identifiers
                     	



★★★★
http://www.bbc.co.uk/music/artists/79239441-bfd5-4981-a70c-55c3f15c1287
http://data.ordnancesurvey.co.uk/id/postcodeunit/HA99HD
http://opencorporates.com/companies/us_vt/F013910
Turning local identifiers into URIs–Why?
                                      	


•  Make   them globally unique	

•  Clarify   authority	

•  Make   them resolvable	

•  Make   them linkable	





 http://data.ordnancesurvey.co.uk/id/7000000000017765
The schema level
                           	


By using URIs, connections that existed only in people's
    minds can be put explicitly into the data model.
Include Links to Other Data
                          	



★★★★★
Hyperlinks are the soul of the Web.
 The Web of Data is no different.
Data links
                                            	




Central Contractor Registration (CCR) 	





                          Geonames
Linked Data Principles
                                           	

1.    Use URIs to name things (not only documents, but also people, locations,
      concepts, etc.)	

2.    To enable agents (human users and machine agents alike) to look up those
      names, use HTTP URIs	

3.    When someone looks up a URI, provide useful information (structured data in
      RDF, SPARQL).	

4.    Include links to other URIs allowing agents to discover more things	





                                                 http://www.w3.org/DesignIssues/LinkedData.html
Summary
                                              	

•  In   the future, data will be open by default, unless good reason not to	

•  Emergence      of a web of data	

•  “Five-star   plan” for getting there, dataset by dataset	

   •  2   stars: re-usable data!	

   •  3   stars: open standards!	

   •  4+5    stars: connect the silos!
Thank You!
         	


richard@cyganiak.de
                  	

       @cygri

More Related Content

What's hot

Introducing the Linked Data Research Centre
Introducing the Linked Data Research CentreIntroducing the Linked Data Research Centre
Introducing the Linked Data Research Centre
Michael Hausenblas
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
Bernhard Haslhofer
 
Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes ...
Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes ...Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes ...
Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes ...
PhiloWeb
 
2011 05-02 linked data intro
2011 05-02 linked data intro2011 05-02 linked data intro
2011 05-02 linked data intro
vafopoulos
 
2011 05-01 linked data
2011 05-01 linked data2011 05-01 linked data
2011 05-01 linked data
vafopoulos
 

What's hot (20)

Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Introducing the Linked Data Research Centre
Introducing the Linked Data Research CentreIntroducing the Linked Data Research Centre
Introducing the Linked Data Research Centre
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes ...
Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes ...Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes ...
Nicolas Delaforge: Modeling the Web resource, extracting the context: stakes ...
 
2011 05-02 linked data intro
2011 05-02 linked data intro2011 05-02 linked data intro
2011 05-02 linked data intro
 
LODLAM Landscape
LODLAM LandscapeLODLAM Landscape
LODLAM Landscape
 
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural HeritageBuild Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
 
2011 05-01 linked data
2011 05-01 linked data2011 05-01 linked data
2011 05-01 linked data
 
Quick Linked Data Introduction
Quick Linked Data IntroductionQuick Linked Data Introduction
Quick Linked Data Introduction
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Linked Data In Action
Linked Data In ActionLinked Data In Action
Linked Data In Action
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
McDanold-1-jun15
McDanold-1-jun15McDanold-1-jun15
McDanold-1-jun15
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
 
Linked GeoData - WhereCampDC 20110610
Linked GeoData - WhereCampDC 20110610Linked GeoData - WhereCampDC 20110610
Linked GeoData - WhereCampDC 20110610
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
 
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked DataIFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
 

Viewers also liked (7)

Project Nimbus - Neng Giin Yap
Project Nimbus - Neng Giin YapProject Nimbus - Neng Giin Yap
Project Nimbus - Neng Giin Yap
 
MS Cloud Day - Cloud Computing – A Crash Course for Architects
MS Cloud Day - Cloud Computing – A Crash Course for Architects MS Cloud Day - Cloud Computing – A Crash Course for Architects
MS Cloud Day - Cloud Computing – A Crash Course for Architects
 
Code understanding and systems design with visual studio 2010
Code understanding and systems design with visual studio 2010Code understanding and systems design with visual studio 2010
Code understanding and systems design with visual studio 2010
 
Windows Azure
Windows Azure Windows Azure
Windows Azure
 
Visual studio 2010 alm keynote sgp
Visual studio 2010 alm keynote sgpVisual studio 2010 alm keynote sgp
Visual studio 2010 alm keynote sgp
 
MS TechDays 2011 - HTML 5 All the Awesome Bits
MS TechDays 2011 - HTML 5 All the Awesome BitsMS TechDays 2011 - HTML 5 All the Awesome Bits
MS TechDays 2011 - HTML 5 All the Awesome Bits
 
Active Directory Upgrade
Active Directory UpgradeActive Directory Upgrade
Active Directory Upgrade
 

Similar to EDF2012: The Web of Data and its Five Stars

Linked open data project
Linked open data projectLinked open data project
Linked open data project
Faathima Fayaza
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
BigMine
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open data
Marie Gustafsson Friberger
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
Peter Haase
 

Similar to EDF2012: The Web of Data and its Five Stars (20)

Linked open data project
Linked open data projectLinked open data project
Linked open data project
 
Linked_Open_Data_Rome_Netcamp_13
Linked_Open_Data_Rome_Netcamp_13Linked_Open_Data_Rome_Netcamp_13
Linked_Open_Data_Rome_Netcamp_13
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012
 
Publising Data on the Web
Publising Data on the WebPublising Data on the Web
Publising Data on the Web
 
Dealing with Dark Data
Dealing with Dark DataDealing with Dark Data
Dealing with Dark Data
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
 
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
 
Research, the Cloud, and the IRB
Research, the Cloud, and the IRBResearch, the Cloud, and the IRB
Research, the Cloud, and the IRB
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
 
Linked Data
Linked DataLinked Data
Linked Data
 
Myth Busters IV: I Access My Data Through APIs–Data Virtualization Can't Do This
Myth Busters IV: I Access My Data Through APIs–Data Virtualization Can't Do ThisMyth Busters IV: I Access My Data Through APIs–Data Virtualization Can't Do This
Myth Busters IV: I Access My Data Through APIs–Data Virtualization Can't Do This
 
Semantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for InformationSemantic Search: We're Living in a Golden Age for Information
Semantic Search: We're Living in a Golden Age for Information
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open data
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data Management
 
Semantic Web For Dummies
Semantic Web For DummiesSemantic Web For Dummies
Semantic Web For Dummies
 
Agile Data Rationalization for Operational Intelligence
Agile Data Rationalization for Operational IntelligenceAgile Data Rationalization for Operational Intelligence
Agile Data Rationalization for Operational Intelligence
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 

More from Richard Cyganiak

VoID: Metadata for RDF Datasets
VoID: Metadata for RDF DatasetsVoID: Metadata for RDF Datasets
VoID: Metadata for RDF Datasets
Richard Cyganiak
 
How to get your data into Sindice and Google with sitemap4rdf
How to get your data into Sindice and Google with sitemap4rdfHow to get your data into Sindice and Google with sitemap4rdf
How to get your data into Sindice and Google with sitemap4rdf
Richard Cyganiak
 
Self-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and GridworksSelf-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and Gridworks
Richard Cyganiak
 

More from Richard Cyganiak (12)

SHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data MudSHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data Mud
 
What's New in RDF 1.1?
What's New in RDF 1.1?What's New in RDF 1.1?
What's New in RDF 1.1?
 
VoID: Metadata for RDF Datasets
VoID: Metadata for RDF DatasetsVoID: Metadata for RDF Datasets
VoID: Metadata for RDF Datasets
 
Practical Cross-Dataset Queries with SPARQL (Introduction)
Practical Cross-Dataset Queries with SPARQL (Introduction)Practical Cross-Dataset Queries with SPARQL (Introduction)
Practical Cross-Dataset Queries with SPARQL (Introduction)
 
How to Publish Open Data
How to Publish Open DataHow to Publish Open Data
How to Publish Open Data
 
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integrationSigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
 
Investigating Community Implementation of the GoodRelations Ontology
Investigating Community Implementation of the GoodRelations OntologyInvestigating Community Implementation of the GoodRelations Ontology
Investigating Community Implementation of the GoodRelations Ontology
 
How to get your data into Sindice and Google with sitemap4rdf
How to get your data into Sindice and Google with sitemap4rdfHow to get your data into Sindice and Google with sitemap4rdf
How to get your data into Sindice and Google with sitemap4rdf
 
Self-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and GridworksSelf-Service Linked Government Data with dcat and Gridworks
Self-Service Linked Government Data with dcat and Gridworks
 
The State of Linked Government Data
The State of Linked Government DataThe State of Linked Government Data
The State of Linked Government Data
 
What is SDMX-RDF?
What is SDMX-RDF?What is SDMX-RDF?
What is SDMX-RDF?
 
dcat: An RDF vocabulary for interoperability of data catalogues
dcat: An RDF vocabulary for interoperability of data cataloguesdcat: An RDF vocabulary for interoperability of data catalogues
dcat: An RDF vocabulary for interoperability of data catalogues
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

EDF2012: The Web of Data and its Five Stars

  • 1. The Web of Data and its Five Stars Richard Cyganiak, DERI, NUI Galway @cygri 6 June 2012 Realising and Exploiting the EU data cloud European Data Forum, Copenhagen, Denmark
  • 2. Generating insight from data •  Today, data is abundant •  New middlemen find new ways of getting data to the end user •  Supply and demand for data higher than ever •  Analyst's problem is no longer a lack of relevant data, but: •  Understanding data •  Assessing applicability •  Getting it into the right form for use •  Similar problems inside and outside of the firewall
  • 3. From the Web to the Web of Data
  • 4. Tim Berners-Lee’s 5-star plan for an open web of data ★ Make data available on the Web under an open license ★★ Make it available as structured data ★★★ Use a non-proprietary format ★★★★ Use URIs to identify things ★★★★★ Link your data to other people’s data to provide context
  • 5. The 0th star •  Data catalog with good metadata •  Make your data findable
  • 6.
  • 7. Data on the Web, Open License ★
  • 10. Open vs. Closed Data used to be closed by default. In the future, it will be open by default.
  • 11. Is open data just for governments?
  • 12.
  • 13.
  • 14.
  • 15. Good reasons against opening data •  Privacy •  Competitive advantage •  Producing data and charging for it as business model •  Can't get license from upstream
  • 16. Business models Scott Brinker, http://www.chiefmartec.com/2010/01/7-business-models-for-linked-data.html
  • 17. Data licenses http://opendefinition.org/licenses/
  • 18. Structured Data ★★
  • 19. Enabling re-use •  Delivering data to end users in different forms •  Combining data with other data •  3rd party analysis of data
  • 20. Formats in government data •  Good for re-use: MS Excel, CSV, XML, JSON, Microdata •  Not so good for re-use: Pure websites, MS Word •  Bad for re-use: PDF •  Really bad for re-use: Only charts/maps without numbers
  • 23. Specialist formats •  Specialist tools often have specialist formats •  Few people have the tools •  Expensive •  Difficult to re-use •  (Geospatial tools, statistics packages, etc.)
  • 24.
  • 25. Non-proprietary formats, open standards •  CSV (dead simple) •  XML •  JSON •  RDF (good for 4+5 stars) •  OGC web services •  OAI-ORE web services
  • 26. Use URIs as Identifiers ★★★★
  • 30.
  • 31. Turning local identifiers into URIs–Why? •  Make them globally unique •  Clarify authority •  Make them resolvable •  Make them linkable http://data.ordnancesurvey.co.uk/id/7000000000017765
  • 32. The schema level By using URIs, connections that existed only in people's minds can be put explicitly into the data model.
  • 33. Include Links to Other Data ★★★★★
  • 34. Hyperlinks are the soul of the Web. The Web of Data is no different.
  • 35. Data links Central Contractor Registration (CCR) Geonames
  • 36. Linked Data Principles 1.  Use URIs to name things (not only documents, but also people, locations, concepts, etc.) 2.  To enable agents (human users and machine agents alike) to look up those names, use HTTP URIs 3.  When someone looks up a URI, provide useful information (structured data in RDF, SPARQL). 4.  Include links to other URIs allowing agents to discover more things http://www.w3.org/DesignIssues/LinkedData.html
  • 37.
  • 38. Summary •  In the future, data will be open by default, unless good reason not to •  Emergence of a web of data •  “Five-star plan” for getting there, dataset by dataset •  2 stars: re-usable data! •  3 stars: open standards! •  4+5 stars: connect the silos!
  • 39. Thank You! richard@cyganiak.de @cygri