SlideShare una empresa de Scribd logo
1 de 44
Descargar para leer sin conexión
The Joy of Data
A cookbook for publishing
 Linked Data on the Web
     Bernadette Hyland, CEO
        3 Round Stones, Inc
    bhyland@3roundstones.com
A pragmatic
            approach to
publishing & consuming
           Linked Data
Agenda
• Setting the scene

• Ingredients ... we use a cooking analogy

• Open standards & best practices

• Data modeling without context

• Social contract as a publisher

• Next steps
Setting the scene ...
   where should we
             focus?
We’ll review
• Converting data into RDF

• The social contract publishers
  make

• The importance of announcing

• Where to turn for guidance
Why should we care?
• We     pretend our organizations are hierarchical -- they aren’t

• Information      is power.

  • Combining       information from different sources is very
       powerful.

• The    US data warehouse market in 2010 was $10B

• In   2012 expected to grow to $13.5B
World changing phenomenon

     Linked Data approach, we can begin to address the
• Using
 non-hierarchical nature of our organizations

• We   can combine information sources

• The W3C  has defined standards that enable interoperability
 and allow us to freely move data
We are sowing the
 seeds for nothing
         short of a
        revolution
What does it take?
• The ingredients list ...

• Thinking differently about your
  data

• Modeling for re-use

• Summary of process in 7 steps
“The change from atoms to bits is irrevocable
             and unstoppable”
                            Being Digital by Nicolas Negroponte
We use URIs to describe both bits & atoms ...

 Information resources are things that
 computers understand, e.g., Web pages, images,
 CSS files, etc.

 Non-information resources are atoms, e.g.,
 people, places, events, things, concepts, etc.
• A different way of thinking about
  data

• The Open World Assumption

• Lots of URIs

• To be citizen of the world (not
  everyone speaks English)

• To publish useful information &
  announce it!
Peeling the
  onion ....
Machine readable
and Human Readable (or edible)
Publish machine & human
      readable content
• Machine readable format
• Human-readable descriptions of your data set
• Increase visibility with search engines
 • Include RDFa or other microformats
 • Publish a voID description of your RDF dataset
100%

                                                                 House email
            90%


                                                              SEO
            80%
                                                                           Paid search
                          Banners,
            70%           buttons
                                             Text-link ads
Usage >>>




                                                     Affiliate Marketing
            60%                                            Behavioral
                                          Contextual        targeting
                                           targeting
                        Rented email
                            lists
            50%                        Rich media/
                                          video


            40%
                        Pop-ups/
                       pop-unders
            30%
                  0%        10%        20%           30%         40%           50%       60%
                       Marketers Reporting “Great” Return on Investment
Model without
      context
There is a Process



Identify   Model   Name    Describe   Convert   Publish




                          Maintain
Preparation
1. Leverage what exists
• Request a copy of the logical and physical model of the
   database(s)
• Obtain data extracts (i.e., databases and/or spreadsheets)
   or create data in a way that can be replicated.
Modeling the data
2. Model data without context to allow for
   reuse and easier merging of data sets

 • Traditional
            DBAs organize data for specified
  Web services or applications.

 • With LD, application logic does not drive the
  data schema, concepts, etc.
Modeling the data
3. Look for real world objects of interest (e.g., people, places,
   things, locations, etc.) and model them.
• Investigate how others are already modeling similar or
    related data.
• Look for duplication and normalize the data
• Use common sense to decide whether or not to make link
Modeling the data ...
4. Connect data from different sources and authoritative
  vocabularies (see list of popular vocabularies below).
• Use URIs as names for your objects
Modeling the data ...

• Put aside immediate needs of any application
• Don’t think about how an application will use your data
• Do think about time and how the data will change over
  time.
Convert, Publish & Maintain

5. Write a script or process to convert the data set
   repeatedly

6. Publish to the Web and announce it! (more details shortly)

7. Maintenance strategy (more details in the social contract at
   the end)
Take the plunge ... Be forgiving

 •   Simplistic data models can still be useful

 •   Better to make progress with something rather than do
     nothing because we cannot be comprehensive and
     complete
Take an iterative approach
1. Review of modeling decisions

2. Review vocabularies chosen and developed

3. Modify/update data conversion scripts

4. Do a maintenance walk-through with real use cases

5. Show how to explore data with SPARQL and
   visualizations

6. Discuss a persistent identifier strategy (think PURLs)
shared innovation™




29
Describe your
         data
Data stewards should....

• Make data accessible via the Web’s standard
  access mechanism, specifically http URIs
• Represent data in a common format,
  such as RDF/XML, Notation-3 (N3), Turtle, N-
  Triples, RDFa, and RDF/JSON
• Provide self describing data
Linked Data Formats
• RDF/XML - RDF for XML pipelines

• Turtle - Human-readable RDF

• XHTML with GRDDL transformation

• XHTML with embedded RDFa

• RDF Schema - Describing structure
In a tart, smoothie or
 margarita ... berries
   can be combined in
       different ways
Merging data
Guidelines for merging

• URIs name the resources we are describing
• Two people using the same URI are describing the same
  thing
• The same URI in two datasets means the same thing
• Graphs from several different sources can be merged;
• Resources with the same URI are considered identical;
• No limitations on which graphs can be merged.
Announcing the
      finished
      product!
•Inform the LOD
 developer community
 (linkeddata.org, W3 lists)
•Announce to search
 engines (RDFa hints, register
 to make accessible)
•Publish human readable
 descriptions
•Encourage interlinking
•Publish schema as voID
•Include SPARQL
 endpoint
ACCEPTABLE ROI FOR IT

         4%   17%
   13%


 16%


                    6 months
              49%   12 months
                    18 months
                    24 months
                    More than 24 months
The Social Contract ...
                      The not so fine print


• LOD is a social contract to provide the public with information
• Follow best practices for modeling
• Carefully consider your URI strategy
• Ensure that your LOD remains available where you say it will be
• Publish voID description
• For a government agency ... a data policy is “a must”
  • specify data quality and retention, treatment of data thru
    secondary sources, restrictions for use, frequency of updates,
    public participation, and applicability of this data policy
We’ve created
someting quite
     beautiful
Reading




    http://linkeddatabook.com/editions/1.0/

http://3roundstones.com/linking-enterprise-data/
This work is Copyright © 2011 3 Round Stones Inc.
It is licensed under the Creative Commons Attribution 3.0 Unported License
Full details at: http://creativecommons.org/licenses/by/3.0/

You are free:

            to Share — to copy, distribute and transmit the work



            to Remix — to adapt the work



Under the following conditions:
            Attribution. You must attribute the work in the manner specified by the
            author or licensor (but not in any way that suggests that they endorse
            you or your use of the work).
•   For any reuse or distribution, you must make clear to others the license terms of this work.
•   Any of the above conditions can be waived if you get permission from the copyright holder.
•   Nothing in this license impairs or restricts the author's moral rights.
•   Some Content in the work may be licensed under different terms, this is noted separately.
Bernadette Hyland SemTech 2011 West - Linked Data Cookbook

Más contenido relacionado

Similar a Bernadette Hyland SemTech 2011 West - Linked Data Cookbook

A Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesA Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesLIBER Europe
 
Metadata Management In A Social Media World, Spsbos, 2 2010
Metadata Management In A Social Media World, Spsbos, 2 2010Metadata Management In A Social Media World, Spsbos, 2 2010
Metadata Management In A Social Media World, Spsbos, 2 2010Christian Buckley
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Linked open data project
Linked open data projectLinked open data project
Linked open data projectFaathima Fayaza
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of librariesRegan Harper
 
Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)Anna Fensel
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clarkdatascienceiqss
 
SHARE Notification Service, December 2014
SHARE Notification Service, December 2014SHARE Notification Service, December 2014
SHARE Notification Service, December 2014SHARE
 
COAR: All About the SHared Access Research Ecosystem (SHARE)
COAR: All About the SHared Access Research Ecosystem (SHARE)COAR: All About the SHared Access Research Ecosystem (SHARE)
COAR: All About the SHared Access Research Ecosystem (SHARE)CASRAI
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13DataDryad
 
Delivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphsDelivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphsBen Gardner
 
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...zepheiraorg
 
Metadata, Open Access and More: Crossref presentation
Metadata, Open Access and More: Crossref presentationMetadata, Open Access and More: Crossref presentation
Metadata, Open Access and More: Crossref presentationCrossref
 

Similar a Bernadette Hyland SemTech 2011 West - Linked Data Cookbook (20)

A Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data RepositoriesA Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data Repositories
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Metadata Management In A Social Media World, Spsbos, 2 2010
Metadata Management In A Social Media World, Spsbos, 2 2010Metadata Management In A Social Media World, Spsbos, 2 2010
Metadata Management In A Social Media World, Spsbos, 2 2010
 
Alamw15 VIVO
Alamw15 VIVOAlamw15 VIVO
Alamw15 VIVO
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Linked open data project
Linked open data projectLinked open data project
Linked open data project
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clark
 
SHARE Notification Service, December 2014
SHARE Notification Service, December 2014SHARE Notification Service, December 2014
SHARE Notification Service, December 2014
 
Tec2010 Buckley Share
Tec2010 Buckley ShareTec2010 Buckley Share
Tec2010 Buckley Share
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
COAR: All About the SHared Access Research Ecosystem (SHARE)
COAR: All About the SHared Access Research Ecosystem (SHARE)COAR: All About the SHared Access Research Ecosystem (SHARE)
COAR: All About the SHared Access Research Ecosystem (SHARE)
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13
 
Delivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphsDelivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphs
 
1 d.1
1 d.11 d.1
1 d.1
 
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...
Back to the Future: The Reinvention of the Library Catalog, Yesterday, Today,...
 
Metadata, Open Access and More: Crossref presentation
Metadata, Open Access and More: Crossref presentationMetadata, Open Access and More: Crossref presentation
Metadata, Open Access and More: Crossref presentation
 
The Power of Data
The Power of DataThe Power of Data
The Power of Data
 

Más de Bernadette Hyland-Wood

ChangeMakeHer Talk on STEM Careers in Australia & beyond
ChangeMakeHer Talk on STEM Careers in Australia & beyondChangeMakeHer Talk on STEM Careers in Australia & beyond
ChangeMakeHer Talk on STEM Careers in Australia & beyondBernadette Hyland-Wood
 
Women in IT - Empowering a Healthier Future
Women in IT - Empowering a Healthier FutureWomen in IT - Empowering a Healthier Future
Women in IT - Empowering a Healthier FutureBernadette Hyland-Wood
 
Why Consider Software Engineering as a Career
Why Consider Software Engineering as a CareerWhy Consider Software Engineering as a Career
Why Consider Software Engineering as a CareerBernadette Hyland-Wood
 
Diversity & Inclusion in the Workplace - CTO School Brisbane AU
Diversity & Inclusion in the Workplace - CTO School Brisbane AUDiversity & Inclusion in the Workplace - CTO School Brisbane AU
Diversity & Inclusion in the Workplace - CTO School Brisbane AUBernadette Hyland-Wood
 
Being Prepared for Life & a Career in the 21st Century
Being Prepared for Life & a Career in the 21st CenturyBeing Prepared for Life & a Career in the 21st Century
Being Prepared for Life & a Career in the 21st CenturyBernadette Hyland-Wood
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale Bernadette Hyland-Wood
 
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open DataBernadette Hyland-Wood
 
Brief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBrief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBernadette Hyland-Wood
 
2015 ESRI Health and Human Services Presentation on GeoHealth.us
2015 ESRI Health and Human Services Presentation on GeoHealth.us2015 ESRI Health and Human Services Presentation on GeoHealth.us
2015 ESRI Health and Human Services Presentation on GeoHealth.usBernadette Hyland-Wood
 
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...Bernadette Hyland-Wood
 
Government Linked Data Projects in the Wild
Government Linked Data Projects in the WildGovernment Linked Data Projects in the Wild
Government Linked Data Projects in the WildBernadette Hyland-Wood
 
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...Bernadette Hyland-Wood
 
20111114 b hyland government data and publishers
20111114   b hyland government data and publishers20111114   b hyland government data and publishers
20111114 b hyland government data and publishersBernadette Hyland-Wood
 
CENDI Presentation on What's going on with Government Linked Data
CENDI Presentation on What's going on with Government Linked DataCENDI Presentation on What's going on with Government Linked Data
CENDI Presentation on What's going on with Government Linked DataBernadette Hyland-Wood
 
20111120 warsaw learning curve by b hyland notes
20111120 warsaw   learning curve by b hyland notes20111120 warsaw   learning curve by b hyland notes
20111120 warsaw learning curve by b hyland notesBernadette Hyland-Wood
 
Rapid Web Application Development for Linked Data
Rapid Web Application Development for Linked DataRapid Web Application Development for Linked Data
Rapid Web Application Development for Linked DataBernadette Hyland-Wood
 
Rapid Semantic Web Application Development
Rapid Semantic Web Application DevelopmentRapid Semantic Web Application Development
Rapid Semantic Web Application DevelopmentBernadette Hyland-Wood
 
Rapid semantic web app dev using Callimachus
Rapid semantic web app dev using CallimachusRapid semantic web app dev using Callimachus
Rapid semantic web app dev using CallimachusBernadette Hyland-Wood
 
Brief for W3C Government Linked Data Working Group 29-June 2011
Brief for W3C Government Linked Data Working Group 29-June 2011Brief for W3C Government Linked Data Working Group 29-June 2011
Brief for W3C Government Linked Data Working Group 29-June 2011Bernadette Hyland-Wood
 

Más de Bernadette Hyland-Wood (20)

ChangeMakeHer Talk on STEM Careers in Australia & beyond
ChangeMakeHer Talk on STEM Careers in Australia & beyondChangeMakeHer Talk on STEM Careers in Australia & beyond
ChangeMakeHer Talk on STEM Careers in Australia & beyond
 
Women in IT - Empowering a Healthier Future
Women in IT - Empowering a Healthier FutureWomen in IT - Empowering a Healthier Future
Women in IT - Empowering a Healthier Future
 
Why Consider Software Engineering as a Career
Why Consider Software Engineering as a CareerWhy Consider Software Engineering as a Career
Why Consider Software Engineering as a Career
 
Diversity & Inclusion in the Workplace - CTO School Brisbane AU
Diversity & Inclusion in the Workplace - CTO School Brisbane AUDiversity & Inclusion in the Workplace - CTO School Brisbane AU
Diversity & Inclusion in the Workplace - CTO School Brisbane AU
 
Being Prepared for Life & a Career in the 21st Century
Being Prepared for Life & a Career in the 21st CenturyBeing Prepared for Life & a Career in the 21st Century
Being Prepared for Life & a Career in the 21st Century
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
3 Round Stones Briefing to U.S. EPA's Chief Data Scientist on Open Data
 
Brief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data ScientistBrief on Linked Data at U.S. EPA to Chief Data Scientist
Brief on Linked Data at U.S. EPA to Chief Data Scientist
 
2015 ESRI Health and Human Services Presentation on GeoHealth.us
2015 ESRI Health and Human Services Presentation on GeoHealth.us2015 ESRI Health and Human Services Presentation on GeoHealth.us
2015 ESRI Health and Human Services Presentation on GeoHealth.us
 
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...
Bernadette Hyland speaks at Startup Queensland Visiting Entrepreneurs Program...
 
Government Linked Data Projects in the Wild
Government Linked Data Projects in the WildGovernment Linked Data Projects in the Wild
Government Linked Data Projects in the Wild
 
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
 
20111114 b hyland government data and publishers
20111114   b hyland government data and publishers20111114   b hyland government data and publishers
20111114 b hyland government data and publishers
 
CENDI Presentation on What's going on with Government Linked Data
CENDI Presentation on What's going on with Government Linked DataCENDI Presentation on What's going on with Government Linked Data
CENDI Presentation on What's going on with Government Linked Data
 
20111101 b hyland-w3-c-tpac-egov
20111101 b hyland-w3-c-tpac-egov20111101 b hyland-w3-c-tpac-egov
20111101 b hyland-w3-c-tpac-egov
 
20111120 warsaw learning curve by b hyland notes
20111120 warsaw   learning curve by b hyland notes20111120 warsaw   learning curve by b hyland notes
20111120 warsaw learning curve by b hyland notes
 
Rapid Web Application Development for Linked Data
Rapid Web Application Development for Linked DataRapid Web Application Development for Linked Data
Rapid Web Application Development for Linked Data
 
Rapid Semantic Web Application Development
Rapid Semantic Web Application DevelopmentRapid Semantic Web Application Development
Rapid Semantic Web Application Development
 
Rapid semantic web app dev using Callimachus
Rapid semantic web app dev using CallimachusRapid semantic web app dev using Callimachus
Rapid semantic web app dev using Callimachus
 
Brief for W3C Government Linked Data Working Group 29-June 2011
Brief for W3C Government Linked Data Working Group 29-June 2011Brief for W3C Government Linked Data Working Group 29-June 2011
Brief for W3C Government Linked Data Working Group 29-June 2011
 

Último

Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 

Último (20)

Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 

Bernadette Hyland SemTech 2011 West - Linked Data Cookbook

  • 1. The Joy of Data A cookbook for publishing Linked Data on the Web Bernadette Hyland, CEO 3 Round Stones, Inc bhyland@3roundstones.com
  • 2. A pragmatic approach to publishing & consuming Linked Data
  • 3. Agenda • Setting the scene • Ingredients ... we use a cooking analogy • Open standards & best practices • Data modeling without context • Social contract as a publisher • Next steps
  • 4. Setting the scene ... where should we focus?
  • 5. We’ll review • Converting data into RDF • The social contract publishers make • The importance of announcing • Where to turn for guidance
  • 6. Why should we care? • We pretend our organizations are hierarchical -- they aren’t • Information is power. • Combining information from different sources is very powerful. • The US data warehouse market in 2010 was $10B • In 2012 expected to grow to $13.5B
  • 7. World changing phenomenon Linked Data approach, we can begin to address the • Using non-hierarchical nature of our organizations • We can combine information sources • The W3C has defined standards that enable interoperability and allow us to freely move data
  • 8. We are sowing the seeds for nothing short of a revolution
  • 9. What does it take? • The ingredients list ... • Thinking differently about your data • Modeling for re-use • Summary of process in 7 steps
  • 10.
  • 11. “The change from atoms to bits is irrevocable and unstoppable” Being Digital by Nicolas Negroponte
  • 12. We use URIs to describe both bits & atoms ... Information resources are things that computers understand, e.g., Web pages, images, CSS files, etc. Non-information resources are atoms, e.g., people, places, events, things, concepts, etc.
  • 13. • A different way of thinking about data • The Open World Assumption • Lots of URIs • To be citizen of the world (not everyone speaks English) • To publish useful information & announce it!
  • 14. Peeling the onion ....
  • 16. and Human Readable (or edible)
  • 17. Publish machine & human readable content • Machine readable format • Human-readable descriptions of your data set • Increase visibility with search engines • Include RDFa or other microformats • Publish a voID description of your RDF dataset
  • 18. 100% House email 90% SEO 80% Paid search Banners, 70% buttons Text-link ads Usage >>> Affiliate Marketing 60% Behavioral Contextual targeting targeting Rented email lists 50% Rich media/ video 40% Pop-ups/ pop-unders 30% 0% 10% 20% 30% 40% 50% 60% Marketers Reporting “Great” Return on Investment
  • 19. Model without context
  • 20. There is a Process Identify Model Name Describe Convert Publish Maintain
  • 21. Preparation 1. Leverage what exists • Request a copy of the logical and physical model of the database(s) • Obtain data extracts (i.e., databases and/or spreadsheets) or create data in a way that can be replicated.
  • 22. Modeling the data 2. Model data without context to allow for reuse and easier merging of data sets • Traditional DBAs organize data for specified Web services or applications. • With LD, application logic does not drive the data schema, concepts, etc.
  • 23. Modeling the data 3. Look for real world objects of interest (e.g., people, places, things, locations, etc.) and model them. • Investigate how others are already modeling similar or related data. • Look for duplication and normalize the data • Use common sense to decide whether or not to make link
  • 24. Modeling the data ... 4. Connect data from different sources and authoritative vocabularies (see list of popular vocabularies below). • Use URIs as names for your objects
  • 25. Modeling the data ... • Put aside immediate needs of any application • Don’t think about how an application will use your data • Do think about time and how the data will change over time.
  • 26. Convert, Publish & Maintain 5. Write a script or process to convert the data set repeatedly 6. Publish to the Web and announce it! (more details shortly) 7. Maintenance strategy (more details in the social contract at the end)
  • 27. Take the plunge ... Be forgiving • Simplistic data models can still be useful • Better to make progress with something rather than do nothing because we cannot be comprehensive and complete
  • 28. Take an iterative approach 1. Review of modeling decisions 2. Review vocabularies chosen and developed 3. Modify/update data conversion scripts 4. Do a maintenance walk-through with real use cases 5. Show how to explore data with SPARQL and visualizations 6. Discuss a persistent identifier strategy (think PURLs)
  • 31. Data stewards should.... • Make data accessible via the Web’s standard access mechanism, specifically http URIs • Represent data in a common format, such as RDF/XML, Notation-3 (N3), Turtle, N- Triples, RDFa, and RDF/JSON • Provide self describing data
  • 32. Linked Data Formats • RDF/XML - RDF for XML pipelines • Turtle - Human-readable RDF • XHTML with GRDDL transformation • XHTML with embedded RDFa • RDF Schema - Describing structure
  • 33.
  • 34. In a tart, smoothie or margarita ... berries can be combined in different ways
  • 36. Guidelines for merging • URIs name the resources we are describing • Two people using the same URI are describing the same thing • The same URI in two datasets means the same thing • Graphs from several different sources can be merged; • Resources with the same URI are considered identical; • No limitations on which graphs can be merged.
  • 37. Announcing the finished product!
  • 38. •Inform the LOD developer community (linkeddata.org, W3 lists) •Announce to search engines (RDFa hints, register to make accessible) •Publish human readable descriptions •Encourage interlinking •Publish schema as voID •Include SPARQL endpoint
  • 39. ACCEPTABLE ROI FOR IT 4% 17% 13% 16% 6 months 49% 12 months 18 months 24 months More than 24 months
  • 40. The Social Contract ... The not so fine print • LOD is a social contract to provide the public with information • Follow best practices for modeling • Carefully consider your URI strategy • Ensure that your LOD remains available where you say it will be • Publish voID description • For a government agency ... a data policy is “a must” • specify data quality and retention, treatment of data thru secondary sources, restrictions for use, frequency of updates, public participation, and applicability of this data policy
  • 42. Reading http://linkeddatabook.com/editions/1.0/ http://3roundstones.com/linking-enterprise-data/
  • 43. This work is Copyright © 2011 3 Round Stones Inc. It is licensed under the Creative Commons Attribution 3.0 Unported License Full details at: http://creativecommons.org/licenses/by/3.0/ You are free: to Share — to copy, distribute and transmit the work to Remix — to adapt the work Under the following conditions: Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). • For any reuse or distribution, you must make clear to others the license terms of this work. • Any of the above conditions can be waived if you get permission from the copyright holder. • Nothing in this license impairs or restricts the author's moral rights. • Some Content in the work may be licensed under different terms, this is noted separately.