SlideShare una empresa de Scribd logo
1 de 29
Descargar para leer sin conexión
Sharing Data on The Web
    A Semantic Web Tutorial

                              James Leigh
                               @jamesrdf
                  james@3roundstones.com
                            February 2013
Towards Linked Open Data
●
    On the Web with an open license                        ★


              “It's great to have the data accessible on
              the Web under an open license, however,
              the data is locked-up in a document.
              Other than writing a custom scraper, it's
              hard to get the data out of the
              document.”
Towards Linked Open Data
    On the Web with an open license
●
                                                          ★
    Machine-readable data
                                                          ★
●




              “Splendid! The data is accessible on the
              Web in a structured way (that is,
              machine-readable), however, the data is
              still locked-up in a document. To get the
              data out of the document you depend on
              proprietary software.”
Towards Linked Open Data
    On the Web with an open license
●
                                                           ★
    Machine-readable data
                                                           ★
●



    Non-proprietary format
                                                           ★
●




              “Excellent! The data is not only available
              via the Web but now everyone can use
              the data easily. On the other hand, it's
              still data on the Web and not data in the
              Web.”
Towards Linked Open Data
●
    On the Web with an open license                    ★
    Machine-readable data
                                                       ★
●



    Non-proprietary format
                                                       ★
●


●
    RDF standards
              “Wonderful! Now it's data in the Web. The★
               (most important) data items have a URI and
               can be shared on the Web. A native way to
               represent the data is using RDF, however
               other formats such as Atom can be
               converted/mapped, if required.”
Towards Linked Open Data
    On the Web with an open license
●
                                                         ★
    Machine-readable data
                                                         ★
●



    Non-proprietary format
                                                         ★
●


●
    RDF standards
●
    Linked RDF
                                                         ★
                                                         ★
              “Brilliant! Now it's data, in the Web linked to
              other data. Both the consumer and the
              publisher benefit from the network effect. ”
Path to Success
●
    Start easy
    –   Well curated
    –   Small dataset
●
    Reach out to developers
●
    Get others involved early
●
    Ensure internal benefit
●
    Maximize productivity
●
    Integrate related dataset
Choosing a License
●
    Allow modification?
●
    Require attribution?
●
    Permit commercial uses?
●
    Must others share alike?
●
    What jurisdiction?
Address Data Quality
●
    Completeness
●
    Conformity
●
    Consistency
●
    Accuracy
●
    Integrity
●
    Eliminate duplication
Put It On The Web
●
    Upload it and share             ★
●
    Document what is available
●
    Document how to use it
    –   Solve a consumer need
●
    Encourage feedback
    –   Continues improvement
Make It Machine-readable
    Simplify data structure
●
                                         ★
    Don't include unstable identifiers
                                         ★
●


●
    Automate publishing
Use a Non-proprietary Format
    Standardized formats
                                       ★
●


    –   CSV instead of excel
    –   XML instead of MDB             ★
●
    Benefits:                          ★
    –   Accessibility
    –   Interoperability
    –   Reduce risk of
        ●
            Confidential info
        ●
            Software viruses
API Service
    Data access layer
●
                                 ★
    Atom Publishing Protocol
                                 ★
●



    Google Data Protocol
                                 ★
●


●
    Open Data Protocol
●
    Facebook Graph API
●
    Linked Data
The Web
    Universal unidirectional links using URLs
●
                                                ★
        Graph of nodes identified by URL
                                                ★
    –

●
    Cooperation without coordination
●
    It's simple nodes and Links
                                                ★
The Semantic Web
    A universal scalable interconnection bus
●
                                               ★
    Share and reuse data across
                                               ★
●


    –   Community boundaries
    –   Application
                                               ★
    –   Enterprise
(re)Model the Data
    Identify existing model
●
                                                        ★
    Reuse common vocabularies
                                                        ★
●



    Model data relationships
                                                        ★
●


    –   Put aside particular needs of any application
●
    Diagram relationships
●
    Test assumptions with subject matter experts
Universal Identifiers
    Others can reference things
                                           ★
●


●
    The Web uses URL for

                                           ★
    downloadable things
●
    Uses URI for real world things and

●
    abstract concepts
    Two references with the same URI
                                           ★
    are the same thing
●
    Quick, easy and scalable look-up
Publish RDF
●
    RDF connects better than any other model   ★
    Cooperation without coordination
                                               ★
●



    Data can be combined safely
                                               ★
●


●
    Mix of vocabularies
                                               ★
VoID Description
●
    De facto standard for describing Linked Data   ★
    Used in linked data catalogues
                                                   ★
●



    Include provenance and attributions
                                                   ★
●


●
    Associate consumers' license
                                                   ★
RDF Schema
●
    Understand structure          ★
    of the graph
    –   Self-describing           ★
●
    Define terms and              ★
    relationships                 ★
●
    Classes of things
Simple Knowledge Organization
●
    Controlled vocabularies         ★
    Taxonomies
                                    ★
●



    Thesauri
                                    ★
●




                                    ★
Controlled Vocabularies
    Strongly agree          Mr.         AB
                                             ★
●                                   ●
                        ●
                                    ●
                                        BC
●
    Agree
                            Ms.         MB
                                             ★
                                    ●
                        ●
●
    Somewhat agree                  ●
                                        NB
    Neither agree nor       Mrs.        NL
                                             ★
●                       ●           ●



    disagree                        ●
                                        NS
                        ●
                            Miss.       ON
    Somewhat disagree
                                             ★
                                    ●
●
                                    ●
                                        PE
●
    Disagree            ●
                            Dr.     ●
                                        QC
●
    Strongly disagree               ●
                                        SK
                                    ●
                                        NT
                                    ●
                                        NU
                                    ●
                                        YT
Taxonomies

             ★
             ★
             ★
             ★
Thesauri

           ★
           ★
           ★
           ★
Web Ontology Language
●
    Property restrictions        ★
    Equivalences across
                                 ★
●


    ontologies
●
    Constraints                  ★
●
    Automatic reasoners          ★
●
    Annotations
Linked RDF
    Linked your data to others
●
                                      ★
    Adds more related data
                                      ★
●



    Add value to the data
                                      ★
●


    –   By joining it to other data
                                      ★
                                      ★
Generating Data Websites
    Give ordinary users direct access the data
●
                                                 ★
    Templates to generate HTML
                                                 ★
●


    –   Generate static HTML offline
    –   Produce HTML on-demand
                                                 ★
                                                 ★
                                                 ★
Social Responsibility
●
    Responsibility to maintain published data
●
    Publish frequency of data updates
●
    Have a persistence strategy
●
    Ensure data is accurate as possible
●
    Respond to reports of problematic data
Comments or Questions?

Más contenido relacionado

Similar a Sharing data on the web (2013)

EDF2012: The Web of Data and its Five Stars
EDF2012: The Web of Data and its Five StarsEDF2012: The Web of Data and its Five Stars
EDF2012: The Web of Data and its Five StarsRichard Cyganiak
 
MONGODB VASUDEV PRAJAPATI DOCUMENTBASE DATABASE
MONGODB VASUDEV PRAJAPATI DOCUMENTBASE DATABASEMONGODB VASUDEV PRAJAPATI DOCUMENTBASE DATABASE
MONGODB VASUDEV PRAJAPATI DOCUMENTBASE DATABASEvasustudy176
 
An Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDBAn Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDBWilliam LaForest
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Databasenehabsairam
 
Development without Constraint
Development without ConstraintDevelopment without Constraint
Development without ConstraintChad Davis
 
How to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectHow to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectDATAVERSITY
 
Silicon valley nosql meetup april 2012
Silicon valley nosql meetup  april 2012Silicon valley nosql meetup  april 2012
Silicon valley nosql meetup april 2012InfiniteGraph
 
When to Use MongoDB
When to Use MongoDBWhen to Use MongoDB
When to Use MongoDBMongoDB
 
Hw09 Next Steps For Hadoop
Hw09   Next Steps For HadoopHw09   Next Steps For Hadoop
Hw09 Next Steps For HadoopCloudera, Inc.
 
Drupal as a Semantic Web platform - ISWC 2012
Drupal as a Semantic Web platform - ISWC 2012Drupal as a Semantic Web platform - ISWC 2012
Drupal as a Semantic Web platform - ISWC 2012scorlosquet
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013scorlosquet
 
Integrating Big Data Technologies
Integrating Big Data TechnologiesIntegrating Big Data Technologies
Integrating Big Data TechnologiesDATAVERSITY
 
A Taste Of InfoGrid
A Taste Of InfoGridA Taste Of InfoGrid
A Taste Of InfoGridInfoGrid.org
 
REST and Linked Data: a match made for domain driven development?
REST and Linked Data: a match made for domain driven development?REST and Linked Data: a match made for domain driven development?
REST and Linked Data: a match made for domain driven development?ruyalarcon
 
An Evening with MongoDB Detroit 2013
An Evening with MongoDB Detroit 2013An Evening with MongoDB Detroit 2013
An Evening with MongoDB Detroit 2013MongoDB
 
NoSQL Simplified: Schema vs. Schema-less
NoSQL Simplified: Schema vs. Schema-lessNoSQL Simplified: Schema vs. Schema-less
NoSQL Simplified: Schema vs. Schema-lessInfiniteGraph
 
Low TCO, high Impact: Innovative Government Services at Manatee County
Low TCO, high Impact: Innovative Government Services at Manatee CountyLow TCO, high Impact: Innovative Government Services at Manatee County
Low TCO, high Impact: Innovative Government Services at Manatee Countybkraft
 
Domain Semantics
Domain SemanticsDomain Semantics
Domain Semanticsmlang22
 
Big data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalBig data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalramazan fırın
 

Similar a Sharing data on the web (2013) (20)

EDF2012: The Web of Data and its Five Stars
EDF2012: The Web of Data and its Five StarsEDF2012: The Web of Data and its Five Stars
EDF2012: The Web of Data and its Five Stars
 
MONGODB VASUDEV PRAJAPATI DOCUMENTBASE DATABASE
MONGODB VASUDEV PRAJAPATI DOCUMENTBASE DATABASEMONGODB VASUDEV PRAJAPATI DOCUMENTBASE DATABASE
MONGODB VASUDEV PRAJAPATI DOCUMENTBASE DATABASE
 
An Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDBAn Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDB
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
 
Development without Constraint
Development without ConstraintDevelopment without Constraint
Development without Constraint
 
How to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot ProjectHow to Get Started with Your MongoDB Pilot Project
How to Get Started with Your MongoDB Pilot Project
 
Silicon valley nosql meetup april 2012
Silicon valley nosql meetup  april 2012Silicon valley nosql meetup  april 2012
Silicon valley nosql meetup april 2012
 
When to Use MongoDB
When to Use MongoDBWhen to Use MongoDB
When to Use MongoDB
 
Hw09 Next Steps For Hadoop
Hw09   Next Steps For HadoopHw09   Next Steps For Hadoop
Hw09 Next Steps For Hadoop
 
Drupal as a Semantic Web platform - ISWC 2012
Drupal as a Semantic Web platform - ISWC 2012Drupal as a Semantic Web platform - ISWC 2012
Drupal as a Semantic Web platform - ISWC 2012
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013
 
Integrating Big Data Technologies
Integrating Big Data TechnologiesIntegrating Big Data Technologies
Integrating Big Data Technologies
 
A Taste Of InfoGrid
A Taste Of InfoGridA Taste Of InfoGrid
A Taste Of InfoGrid
 
REST and Linked Data: a match made for domain driven development?
REST and Linked Data: a match made for domain driven development?REST and Linked Data: a match made for domain driven development?
REST and Linked Data: a match made for domain driven development?
 
An Evening with MongoDB Detroit 2013
An Evening with MongoDB Detroit 2013An Evening with MongoDB Detroit 2013
An Evening with MongoDB Detroit 2013
 
NoSQL Simplified: Schema vs. Schema-less
NoSQL Simplified: Schema vs. Schema-lessNoSQL Simplified: Schema vs. Schema-less
NoSQL Simplified: Schema vs. Schema-less
 
Basic Views
Basic ViewsBasic Views
Basic Views
 
Low TCO, high Impact: Innovative Government Services at Manatee County
Low TCO, high Impact: Innovative Government Services at Manatee CountyLow TCO, high Impact: Innovative Government Services at Manatee County
Low TCO, high Impact: Innovative Government Services at Manatee County
 
Domain Semantics
Domain SemanticsDomain Semantics
Domain Semantics
 
Big data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalBig data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-final
 

Más de 3 Round Stones

Brief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data ScientistBrief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data Scientist3 Round Stones
 
US EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open DataUS EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open Data3 Round Stones
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach3 Round Stones
 
W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 2014W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 20143 Round Stones
 
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round StonesLightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones3 Round Stones
 
Why Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David WoodWhy Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David Wood3 Round Stones
 
Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 2014Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 20143 Round Stones
 
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...3 Round Stones
 
Publising Data on the Web
Publising Data on the WebPublising Data on the Web
Publising Data on the Web3 Round Stones
 
Callimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 TutorialCallimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 Tutorial3 Round Stones
 
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...3 Round Stones
 
Data Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round StonesData Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round Stones3 Round Stones
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs3 Round Stones
 
ORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data ProjectORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data Project3 Round Stones
 
Linked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the WebLinked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the Web3 Round Stones
 
Delivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked DataDelivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked Data3 Round Stones
 
The Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information IntegrationThe Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information Integration3 Round Stones
 
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 20133 Round Stones
 
Sharing Data on the Web
Sharing Data on the WebSharing Data on the Web
Sharing Data on the Web3 Round Stones
 

Más de 3 Round Stones (20)

Brief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data ScientistBrief on Linked Data for U.S. EPA's Chief Data Scientist
Brief on Linked Data for U.S. EPA's Chief Data Scientist
 
US EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open DataUS EPA Resource Conservation and Recovery Act published as Linked Open Data
US EPA Resource Conservation and Recovery Act published as Linked Open Data
 
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
 
W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 2014W3C Data Shapes Working Group 2014
W3C Data Shapes Working Group 2014
 
Open by Default
Open by DefaultOpen by Default
Open by Default
 
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round StonesLightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
Lightning Talk SLIDES for Callimachus Enterprise by 3 Round Stones
 
Why Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David WoodWhy Your Next Product Should be Semantic by Dr. David Wood
Why Your Next Product Should be Semantic by Dr. David Wood
 
Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 2014Celebrating 10 years of the Semantic Technology Conference 2014
Celebrating 10 years of the Semantic Technology Conference 2014
 
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
Enterprise & Scientific Data Interoperability Using Linked Data at the Health...
 
Publising Data on the Web
Publising Data on the WebPublising Data on the Web
Publising Data on the Web
 
Callimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 TutorialCallimachus Enterprise 1.3 Tutorial
Callimachus Enterprise 1.3 Tutorial
 
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
Improving Scientific Information Sharing by Fostering Reuse - Presentation at...
 
Data Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round StonesData Transparency 2013 - OrgPedia by 3 Round Stones
Data Transparency 2013 - OrgPedia by 3 Round Stones
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs
 
ORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data ProjectORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data Project
 
Linked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the WebLinked Data: The Jargon-free Primer on Integrating Data on the Web
Linked Data: The Jargon-free Primer on Integrating Data on the Web
 
Delivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked DataDelivering on Standards for Publishing Government Linked Data
Delivering on Standards for Publishing Government Linked Data
 
The Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information IntegrationThe Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information Integration
 
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
 
Sharing Data on the Web
Sharing Data on the WebSharing Data on the Web
Sharing Data on the Web
 

Sharing data on the web (2013)

  • 1. Sharing Data on The Web A Semantic Web Tutorial James Leigh @jamesrdf james@3roundstones.com February 2013
  • 2. Towards Linked Open Data ● On the Web with an open license ★ “It's great to have the data accessible on the Web under an open license, however, the data is locked-up in a document. Other than writing a custom scraper, it's hard to get the data out of the document.”
  • 3. Towards Linked Open Data On the Web with an open license ● ★ Machine-readable data ★ ● “Splendid! The data is accessible on the Web in a structured way (that is, machine-readable), however, the data is still locked-up in a document. To get the data out of the document you depend on proprietary software.”
  • 4. Towards Linked Open Data On the Web with an open license ● ★ Machine-readable data ★ ● Non-proprietary format ★ ● “Excellent! The data is not only available via the Web but now everyone can use the data easily. On the other hand, it's still data on the Web and not data in the Web.”
  • 5. Towards Linked Open Data ● On the Web with an open license ★ Machine-readable data ★ ● Non-proprietary format ★ ● ● RDF standards “Wonderful! Now it's data in the Web. The★ (most important) data items have a URI and can be shared on the Web. A native way to represent the data is using RDF, however other formats such as Atom can be converted/mapped, if required.”
  • 6. Towards Linked Open Data On the Web with an open license ● ★ Machine-readable data ★ ● Non-proprietary format ★ ● ● RDF standards ● Linked RDF ★ ★ “Brilliant! Now it's data, in the Web linked to other data. Both the consumer and the publisher benefit from the network effect. ”
  • 7. Path to Success ● Start easy – Well curated – Small dataset ● Reach out to developers ● Get others involved early ● Ensure internal benefit ● Maximize productivity ● Integrate related dataset
  • 8. Choosing a License ● Allow modification? ● Require attribution? ● Permit commercial uses? ● Must others share alike? ● What jurisdiction?
  • 9. Address Data Quality ● Completeness ● Conformity ● Consistency ● Accuracy ● Integrity ● Eliminate duplication
  • 10. Put It On The Web ● Upload it and share ★ ● Document what is available ● Document how to use it – Solve a consumer need ● Encourage feedback – Continues improvement
  • 11. Make It Machine-readable Simplify data structure ● ★ Don't include unstable identifiers ★ ● ● Automate publishing
  • 12. Use a Non-proprietary Format Standardized formats ★ ● – CSV instead of excel – XML instead of MDB ★ ● Benefits: ★ – Accessibility – Interoperability – Reduce risk of ● Confidential info ● Software viruses
  • 13. API Service Data access layer ● ★ Atom Publishing Protocol ★ ● Google Data Protocol ★ ● ● Open Data Protocol ● Facebook Graph API ● Linked Data
  • 14. The Web Universal unidirectional links using URLs ● ★ Graph of nodes identified by URL ★ – ● Cooperation without coordination ● It's simple nodes and Links ★
  • 15. The Semantic Web A universal scalable interconnection bus ● ★ Share and reuse data across ★ ● – Community boundaries – Application ★ – Enterprise
  • 16. (re)Model the Data Identify existing model ● ★ Reuse common vocabularies ★ ● Model data relationships ★ ● – Put aside particular needs of any application ● Diagram relationships ● Test assumptions with subject matter experts
  • 17. Universal Identifiers Others can reference things ★ ● ● The Web uses URL for ★ downloadable things ● Uses URI for real world things and ● abstract concepts Two references with the same URI ★ are the same thing ● Quick, easy and scalable look-up
  • 18. Publish RDF ● RDF connects better than any other model ★ Cooperation without coordination ★ ● Data can be combined safely ★ ● ● Mix of vocabularies ★
  • 19. VoID Description ● De facto standard for describing Linked Data ★ Used in linked data catalogues ★ ● Include provenance and attributions ★ ● ● Associate consumers' license ★
  • 20. RDF Schema ● Understand structure ★ of the graph – Self-describing ★ ● Define terms and ★ relationships ★ ● Classes of things
  • 21. Simple Knowledge Organization ● Controlled vocabularies ★ Taxonomies ★ ● Thesauri ★ ● ★
  • 22. Controlled Vocabularies Strongly agree Mr. AB ★ ● ● ● ● BC ● Agree Ms. MB ★ ● ● ● Somewhat agree ● NB Neither agree nor Mrs. NL ★ ● ● ● disagree ● NS ● Miss. ON Somewhat disagree ★ ● ● ● PE ● Disagree ● Dr. ● QC ● Strongly disagree ● SK ● NT ● NU ● YT
  • 23. Taxonomies ★ ★ ★ ★
  • 24. Thesauri ★ ★ ★ ★
  • 25. Web Ontology Language ● Property restrictions ★ Equivalences across ★ ● ontologies ● Constraints ★ ● Automatic reasoners ★ ● Annotations
  • 26. Linked RDF Linked your data to others ● ★ Adds more related data ★ ● Add value to the data ★ ● – By joining it to other data ★ ★
  • 27. Generating Data Websites Give ordinary users direct access the data ● ★ Templates to generate HTML ★ ● – Generate static HTML offline – Produce HTML on-demand ★ ★ ★
  • 28. Social Responsibility ● Responsibility to maintain published data ● Publish frequency of data updates ● Have a persistence strategy ● Ensure data is accurate as possible ● Respond to reports of problematic data