SlideShare una empresa de Scribd logo
1 de 44
Semantic web and linked data
     for data set publication
         Dave Reynolds, Epimorphics Ltd
                                @der42
Outline
   Background on linked data
   Roles in data set publishing
   Case study: Environment Agency
   Lessons
Linked data background
Linked data ...

    publishing data on the web ...

   ... to enable integration, linking and reuse
       across silos
Linked data
Apply the principles to the web to publication of data
The linked data web:
     is a global network of things
     each identified by a URI
     fetching a URI gives a set of statements   in RDF
     things connected by typed links
     open, anyone can say anything about anything else


Linked data is “data you can click on”
Example schools information
         http://education.data.gov.uk/id/school/401874
Example schools information
                 http://education.data.gov.uk/id/school/401874   a        School


                  label                              phase
                                  district                           “Secondary”
“Cardiff High School”

                                 “Cardiff”
Example schools information
                  http://education.data.gov.uk/id/school/401874               a        school:School



                                                          phase
                       label
                                      district                                school:PhaseOfEducation_Secondary
“Cardiff High School”

             http://statistics.data.gov.uk/id/local-authority-district/00PT   label          “Cardiff”
Example schools information
                  http://education.data.gov.uk/id/school/401874               rdf:type      school:School


                  rdfs:label                       school:phase

                                    school:district                                school:PhaseOfEducation_Secondary
“Cardiff High School”

             http://statistics.data.gov.uk/id/local-authority-district/00PT        rdfs:label     “Cardiff”
Example schools information
                  http://education.data.gov.uk/id/school/401874               rdf:type      school:School


                  rdfs:label                       school:phase

                                    school:district                                school:PhaseOfEducation_Secondary
“Cardiff High School”

             http://statistics.data.gov.uk/id/local-authority-district/00PT        label          “Cardiff”



             http://data.ordnancesurvey.co.uk/id/7000000000025484


     admingeo:ward
                                                       spatial:extent

                      admingeo:parish
                                                              GML: 310499.4 184176.6 310476.5 ...
Example schools information
                  http://education.data.gov.uk/id/school/401874               rdf:type      school:School


                  rdfs:label                       school:phase

                                    school:district                                school:PhaseOfEducation_Secondary
“Cardiff High School”

             http://statistics.data.gov.uk/id/local-authority-district/00PT        label          “Cardiff”

                                    owl:sameAs

             http://data.ordnancesurvey.co.uk/id/7000000000025484


     admingeo:ward
                                                       spatial:extent

                      admingeo:parish
                                                              GML: 310499.4 184176.6 310476.5 ...
Role in data set publication
   well suited to describing things
       schools, companies, animal species, music tracks, tv programmes ...

   what about datasets?
       environmental measurements, experimental results, statistical analyses ...
Approach 1 : Data catalogues
   treat the dataset as a single resource, identify with a URI
   provide metadata as linked data
       descriptive
       categorical
       technical and structural


Benefits?
       separate of metadata from resource & repository
       easy aggregation of metadata into catalogues
       schema-less enables use-specific annotations and links
       use of sharable category schemes and reference data
=> support for discovery
Approach 2 : Fine grain publication
   publish the data set itself as linked data
       entities, terms, individual records in data identified by URIs
       data set structure and ontologies linked from data
       still include dataset metadata


Benefits?
       all benefits of approach 1 to support discovery
       self-describing
       data slices addressable (trace back, provenance, annotation)
       integration across sets - reuse of terms for dimensions, units, values
       fine grained access
=> integration, comparison, context, data as a service
bathing water quality

                                              what we do...

                            start of season

                                  15th May                                  Press interest




                                               bathing season
what information                                                20-22 samples in 22weeks
is relevant to the public
about beaches
                                30th Sept
                            annual report
                    what       November
                    we do

                               December
how linkable data helps
             Tenby
             Tourist Information Centre
             Unit 2 , The Gateway Complex
             Tenby. Wales , SA70 7LT
             Tel: 01834 842 402
             Fax: 01834 845 439
             Email: tenby.tic@pembrokeshire.gov.uk




                                                     Photo by Skellig2008 (flickr)
Publishing the Bathing Water Quality data set

                             Bathing           Sampling          Zones Of              Assessment
  Vocabularies
                             Waters             Points           Influence                 s


                                                                          e.g. http://location.data.gov.uk/def/ef/SampingPoint



       URI Set
                             Bathing           Sampling               Zone Of
Reference Data               Waters             Points               Influence

                                                                e.g. http://location.data.gov.uk/so/ef/SamplingPoint/bwsp.eaew



                                            Assessme            http://environment.data.gov.uk/data/bathing-water-quality
 Observation
                                               nt
   Datasets
                              void:subset              void:subset

                                                       In-season
                                Annual
                                                         Weekly
            .../compliance     Complianc                               .../in-season
                                                       Assessme
                                  e
                                                           nt
Data cube vocabulary
   collaborative development
    sponsored by data.gov.uk
   simple, flexible vocabulary
   mirrors core information models from:
        SDMX (Statistical Data and Metadata eXchange)
        DDI (Data Documentation Initiative)
   extension to SCOVO vocabulary




image: dullhunk @ flickr
Data cube model
    A set of observations
     indexed by dimensions
     describing measures
     interpreted according to attributes
(e.g. region)
 dimension




                                measure(s)    attributes


                              • population   unit of measure = count
                                = 32,567     status = preliminary
                                             ...



                dimension
                (e.g. time)
Data cube vocabulary
1. Top level
   DataSet                        qb:DataStructureDefinition
                                                                         qb:component

       provenance and metadata                                         qb:sliceKey

       structure                   qb:structure

                                   qb:DataSet                      qb:SliceKey
                                                      qb:slice
                                                                 qb:sliceStructure
                                    qb:dataset
                                                      qb:Slice

                                                                    qb:subSlice



                                                       qb:observation

                                  qb:Observation
                                   dimension values
                                   measure value(s)
                                   attribute values
Data cube vocabulary
1. Top level
   DataSet                               qb:DataStructureDefinition
                                                                                qb:component

       provenance and metadata                                                qb:sliceKey

       structure                          qb:structure

   Observation                           qb:DataSet                      qb:SliceKey

       measured values, at dimensions                       qb:slice
                                                                        qb:sliceStructure
                                           qb:dataset
        with attributes                                      qb:Slice

       direct link to DataSet                                             qb:subSlice



                                                              qb:observation

                                         qb:Observation
                                          dimension values
                                          measure value(s)
                                          attribute values
Data cube vocabulary
1. Top level
   DataSet                               qb:DataStructureDefinition
                                                                                qb:component

       provenance and metadata                                                qb:sliceKey

       structure                          qb:structure

   Observation                           qb:DataSet                      qb:SliceKey

       measured values, at dimensions                       qb:slice
                                                                        qb:sliceStructure
                                           qb:dataset
        with attributes                                      qb:Slice

       direct link to DataSet                                             qb:subSlice


   Slice                                                     qb:observation

                                         qb:Observation
       optional grouping by fixing
        dimensions                        dimension values
                                          measure value(s)
                                          attribute values
       guide to presentation
       allows for abbreviated data
Data cube vocabulary
2. Data Structure Definition
   explicit definition of cube
                                       qb:DataSet
    structure, inline in the data                            qb:structure


   enables                         qb:DataStructureDefinition
       validation                                               qb:component

       visualization
       discovery                   qb:ComponentSpecification

       abbreviation                                      qb:componentRequired
                                                          qb:componentAttachment
                                                          qb:order

                                           qb:dimension

                                           qb:measure

                                           qb:attribute
Bathing Water Quality cubes
   measures
       total coliform count, entero virus count, ...
       sample classification
   dimensions
       sampling point
       sampling week
       sampling year
   attributes
       abnormal weather
Everything has a URI
                      Selected Lists and
                       Individual Bathing Waters
                      Lists and Individual
                       Assessments
                          In-Season or Annual
                           Compliance
                      Vocabulary Terms
                      Datasets (and subsets)
                      Presented as:
                          HTML, (for people)
                          JSON, XML, RDF and CSV
                           (for programs)
Data Platform and Applications


  Web of Linked Data




                       http://environment.data.gov.uk/lab/bwq-os.html
Outcomes
   bathing water quality information available
       as both data set and set of web APIs
       updated weekly (in season)
   third party applications to use and combine the data
   seed a web of environmental and location data
       reference identifiers can be reused for related information
       URI patterns designed to be compatible with INSPIRE
Wrapping up




image: erika g. @ flickr.com
Lessons
   importance of reference identifiers
   developer accessibility
       linked data API
   publish once, consume many ways
   importance of maintenance and QoS expectation
   reusable patterns:
       reusable vocabularies - Data Cube, org ...
       URI patterns
       provenance – OMPV and specializations
   incremental approach
Acknowledgements
   Alex Coley (Environment Agency)
       for slides 17, 18, and for sponsoring the bathing water quality
        data publication
   Stuart Williams
       developer of the bathing water application and slides 19,27,28
   John Sheridan (The National Archive)
       for sponsoring the development of data cube
   Richard Cyganiak, Jeni Tennison
       co-developers of the data cube vocabulary
fin.
                                         fin.




image: Christian Haugen @ flickr.com
Spare
Linked data principles
   Use URIs as names for things
   Use HTTP URIs so that people can look up those names
   When someone looks up a URI, provide useful
    information, using the standards (RDF*, SPARQL)
   Include links to other URIs, so that they can discover
    more things



             Pattern of application of semantic
                         web stack
Linked open data cloud: 2007




Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Linked open data cloud: 2009




Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Linked open data cloud: 2010




Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Accessing all this data
   link following
       HTTP GET, follow links, aggregate relevant statements
   query
       SPARQL
SPARQL
   core idea is pattern matching
       graph patterns with variables
       any subgraph which matches yields row of bindings
                  ont:districtAdministrative          rdfs:label
        ?school                                []                    “Cardiff”

   syntax based on Turtle syntax for RDF
   web API endpoints
   lots of power
       filters                    sub-queries           federated query
       optionals                  property chains       update
       named graphs               aggregation           construct
Accessing all this data
   link following
       HTTP GET, follow links, aggregate relevant statements
   query
       SPARQL
   linked data API
       RESTful API onto linked data resources
       simple query, usable without RDF stack, web dev friendly
       easy to layer visualizations and UIs on top
   third parties
       search engines and aggregators e.g. Sindice, sameAs.org
Semantic web layer cake
Data.gov.uk
visualizations on top of linked data
Data.gov.uk – linked datasets and APIs

Más contenido relacionado

Similar a Using linked data for dataset publication

Learning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesLearning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesStefan Dietze
 
Putting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationPutting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationMathieu d'Aquin
 
Linked Data Hypercubes - Semtech London
Linked Data Hypercubes - Semtech LondonLinked Data Hypercubes - Semtech London
Linked Data Hypercubes - Semtech LondonDave Reynolds
 
Environmental Linked Data - Semtech Biz London
Environmental Linked Data - Semtech Biz LondonEnvironmental Linked Data - Semtech Biz London
Environmental Linked Data - Semtech Biz LondonAlex Coley
 
Introduction to linked data and the semantic web
Introduction to linked data and the semantic webIntroduction to linked data and the semantic web
Introduction to linked data and the semantic webDave Reynolds
 
Linked Data Hypercubes
Linked Data HypercubesLinked Data Hypercubes
Linked Data HypercubesDave Reynolds
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013Stefan Dietze
 
Linked Data for Federation of OER Data & Repositories
Linked Data for Federation of OER Data & RepositoriesLinked Data for Federation of OER Data & Repositories
Linked Data for Federation of OER Data & RepositoriesStefan Dietze
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 
Linked data introduction w exempel
Linked data introduction w exempelLinked data introduction w exempel
Linked data introduction w exempelKerstin Forsberg
 
Data Warehousing and Mining Data from Library and University Systems for Asse...
Data Warehousing and Mining Data from Library and University Systems for Asse...Data Warehousing and Mining Data from Library and University Systems for Asse...
Data Warehousing and Mining Data from Library and University Systems for Asse...Ray Schwartz
 
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Laurent Lefort
 
Information Intermediaries
Information IntermediariesInformation Intermediaries
Information IntermediariesDave Reynolds
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless OpportunityRachel Frick
 
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Alex Coley
 
Creating Visualizations with Linked Open Data
Creating Visualizations with Linked Open DataCreating Visualizations with Linked Open Data
Creating Visualizations with Linked Open DataAlvaro Graves
 
Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Mathieu d'Aquin
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedStefan Dietze
 

Similar a Using linked data for dataset publication (20)

Learning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, ExamplesLearning Analytics & Linked Data – Opportunities, Challenges, Examples
Learning Analytics & Linked Data – Opportunities, Challenges, Examples
 
Putting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in educationPutting Intelligence in Open Data - With examples in education
Putting Intelligence in Open Data - With examples in education
 
Linked Data Hypercubes - Semtech London
Linked Data Hypercubes - Semtech LondonLinked Data Hypercubes - Semtech London
Linked Data Hypercubes - Semtech London
 
Environmental Linked Data - Semtech Biz London
Environmental Linked Data - Semtech Biz LondonEnvironmental Linked Data - Semtech Biz London
Environmental Linked Data - Semtech Biz London
 
Introduction to linked data and the semantic web
Introduction to linked data and the semantic webIntroduction to linked data and the semantic web
Introduction to linked data and the semantic web
 
Linked Data Hypercubes
Linked Data HypercubesLinked Data Hypercubes
Linked Data Hypercubes
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013
 
Linked Data for Federation of OER Data & Repositories
Linked Data for Federation of OER Data & RepositoriesLinked Data for Federation of OER Data & Repositories
Linked Data for Federation of OER Data & Repositories
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
Icm sem tech_master
Icm sem tech_masterIcm sem tech_master
Icm sem tech_master
 
Linked data introduction w exempel
Linked data introduction w exempelLinked data introduction w exempel
Linked data introduction w exempel
 
Data Warehousing and Mining Data from Library and University Systems for Asse...
Data Warehousing and Mining Data from Library and University Systems for Asse...Data Warehousing and Mining Data from Library and University Systems for Asse...
Data Warehousing and Mining Data from Library and University Systems for Asse...
 
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
 
Information Intermediaries
Information IntermediariesInformation Intermediaries
Information Intermediaries
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
 
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...Using linked data and the semantic web - "powered by INSPIRE" conference pres...
Using linked data and the semantic web - "powered by INSPIRE" conference pres...
 
Creating Visualizations with Linked Open Data
Creating Visualizations with Linked Open DataCreating Visualizations with Linked Open Data
Creating Visualizations with Linked Open Data
 
Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...Linked Data at the Open University: From Technical Challenges to Organization...
Linked Data at the Open University: From Technical Challenges to Organization...
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
 

Último

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 

Último (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Using linked data for dataset publication

  • 1. Semantic web and linked data for data set publication Dave Reynolds, Epimorphics Ltd @der42
  • 2. Outline  Background on linked data  Roles in data set publishing  Case study: Environment Agency  Lessons
  • 4. Linked data ... publishing data on the web ... ... to enable integration, linking and reuse across silos
  • 5. Linked data Apply the principles to the web to publication of data The linked data web:  is a global network of things  each identified by a URI  fetching a URI gives a set of statements in RDF  things connected by typed links  open, anyone can say anything about anything else Linked data is “data you can click on”
  • 6. Example schools information http://education.data.gov.uk/id/school/401874
  • 7. Example schools information http://education.data.gov.uk/id/school/401874 a School label phase district “Secondary” “Cardiff High School” “Cardiff”
  • 8. Example schools information http://education.data.gov.uk/id/school/401874 a school:School phase label district school:PhaseOfEducation_Secondary “Cardiff High School” http://statistics.data.gov.uk/id/local-authority-district/00PT label “Cardiff”
  • 9. Example schools information http://education.data.gov.uk/id/school/401874 rdf:type school:School rdfs:label school:phase school:district school:PhaseOfEducation_Secondary “Cardiff High School” http://statistics.data.gov.uk/id/local-authority-district/00PT rdfs:label “Cardiff”
  • 10. Example schools information http://education.data.gov.uk/id/school/401874 rdf:type school:School rdfs:label school:phase school:district school:PhaseOfEducation_Secondary “Cardiff High School” http://statistics.data.gov.uk/id/local-authority-district/00PT label “Cardiff” http://data.ordnancesurvey.co.uk/id/7000000000025484 admingeo:ward spatial:extent admingeo:parish GML: 310499.4 184176.6 310476.5 ...
  • 11. Example schools information http://education.data.gov.uk/id/school/401874 rdf:type school:School rdfs:label school:phase school:district school:PhaseOfEducation_Secondary “Cardiff High School” http://statistics.data.gov.uk/id/local-authority-district/00PT label “Cardiff” owl:sameAs http://data.ordnancesurvey.co.uk/id/7000000000025484 admingeo:ward spatial:extent admingeo:parish GML: 310499.4 184176.6 310476.5 ...
  • 12.
  • 13. Role in data set publication  well suited to describing things  schools, companies, animal species, music tracks, tv programmes ...  what about datasets?  environmental measurements, experimental results, statistical analyses ...
  • 14. Approach 1 : Data catalogues  treat the dataset as a single resource, identify with a URI  provide metadata as linked data  descriptive  categorical  technical and structural Benefits?  separate of metadata from resource & repository  easy aggregation of metadata into catalogues  schema-less enables use-specific annotations and links  use of sharable category schemes and reference data => support for discovery
  • 15. Approach 2 : Fine grain publication  publish the data set itself as linked data  entities, terms, individual records in data identified by URIs  data set structure and ontologies linked from data  still include dataset metadata Benefits?  all benefits of approach 1 to support discovery  self-describing  data slices addressable (trace back, provenance, annotation)  integration across sets - reuse of terms for dimensions, units, values  fine grained access => integration, comparison, context, data as a service
  • 16.
  • 17. bathing water quality what we do... start of season 15th May Press interest bathing season what information 20-22 samples in 22weeks is relevant to the public about beaches 30th Sept annual report what November we do December
  • 18. how linkable data helps Tenby Tourist Information Centre Unit 2 , The Gateway Complex Tenby. Wales , SA70 7LT Tel: 01834 842 402 Fax: 01834 845 439 Email: tenby.tic@pembrokeshire.gov.uk Photo by Skellig2008 (flickr)
  • 19. Publishing the Bathing Water Quality data set Bathing Sampling Zones Of Assessment Vocabularies Waters Points Influence s e.g. http://location.data.gov.uk/def/ef/SampingPoint URI Set Bathing Sampling Zone Of Reference Data Waters Points Influence e.g. http://location.data.gov.uk/so/ef/SamplingPoint/bwsp.eaew Assessme http://environment.data.gov.uk/data/bathing-water-quality Observation nt Datasets void:subset void:subset In-season Annual Weekly .../compliance Complianc .../in-season Assessme e nt
  • 20. Data cube vocabulary  collaborative development sponsored by data.gov.uk  simple, flexible vocabulary  mirrors core information models from:  SDMX (Statistical Data and Metadata eXchange)  DDI (Data Documentation Initiative)  extension to SCOVO vocabulary image: dullhunk @ flickr
  • 21. Data cube model A set of observations  indexed by dimensions  describing measures  interpreted according to attributes (e.g. region) dimension measure(s) attributes • population unit of measure = count = 32,567 status = preliminary ... dimension (e.g. time)
  • 22. Data cube vocabulary 1. Top level  DataSet qb:DataStructureDefinition qb:component  provenance and metadata qb:sliceKey  structure qb:structure qb:DataSet qb:SliceKey qb:slice qb:sliceStructure qb:dataset qb:Slice qb:subSlice qb:observation qb:Observation dimension values measure value(s) attribute values
  • 23. Data cube vocabulary 1. Top level  DataSet qb:DataStructureDefinition qb:component  provenance and metadata qb:sliceKey  structure qb:structure  Observation qb:DataSet qb:SliceKey  measured values, at dimensions qb:slice qb:sliceStructure qb:dataset with attributes qb:Slice  direct link to DataSet qb:subSlice qb:observation qb:Observation dimension values measure value(s) attribute values
  • 24. Data cube vocabulary 1. Top level  DataSet qb:DataStructureDefinition qb:component  provenance and metadata qb:sliceKey  structure qb:structure  Observation qb:DataSet qb:SliceKey  measured values, at dimensions qb:slice qb:sliceStructure qb:dataset with attributes qb:Slice  direct link to DataSet qb:subSlice  Slice qb:observation qb:Observation  optional grouping by fixing dimensions dimension values measure value(s) attribute values  guide to presentation  allows for abbreviated data
  • 25. Data cube vocabulary 2. Data Structure Definition  explicit definition of cube qb:DataSet structure, inline in the data qb:structure  enables qb:DataStructureDefinition  validation qb:component  visualization  discovery qb:ComponentSpecification  abbreviation qb:componentRequired qb:componentAttachment qb:order qb:dimension qb:measure qb:attribute
  • 26. Bathing Water Quality cubes  measures  total coliform count, entero virus count, ...  sample classification  dimensions  sampling point  sampling week  sampling year  attributes  abnormal weather
  • 27. Everything has a URI  Selected Lists and Individual Bathing Waters  Lists and Individual Assessments  In-Season or Annual Compliance  Vocabulary Terms  Datasets (and subsets)  Presented as:  HTML, (for people)  JSON, XML, RDF and CSV (for programs)
  • 28. Data Platform and Applications Web of Linked Data http://environment.data.gov.uk/lab/bwq-os.html
  • 29. Outcomes  bathing water quality information available  as both data set and set of web APIs  updated weekly (in season)  third party applications to use and combine the data  seed a web of environmental and location data  reference identifiers can be reused for related information  URI patterns designed to be compatible with INSPIRE
  • 30. Wrapping up image: erika g. @ flickr.com
  • 31. Lessons  importance of reference identifiers  developer accessibility  linked data API  publish once, consume many ways  importance of maintenance and QoS expectation  reusable patterns:  reusable vocabularies - Data Cube, org ...  URI patterns  provenance – OMPV and specializations  incremental approach
  • 32. Acknowledgements  Alex Coley (Environment Agency)  for slides 17, 18, and for sponsoring the bathing water quality data publication  Stuart Williams  developer of the bathing water application and slides 19,27,28  John Sheridan (The National Archive)  for sponsoring the development of data cube  Richard Cyganiak, Jeni Tennison  co-developers of the data cube vocabulary
  • 33. fin. fin. image: Christian Haugen @ flickr.com
  • 34. Spare
  • 35. Linked data principles  Use URIs as names for things  Use HTTP URIs so that people can look up those names  When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL)  Include links to other URIs, so that they can discover more things Pattern of application of semantic web stack
  • 36. Linked open data cloud: 2007 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 37. Linked open data cloud: 2009 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 38. Linked open data cloud: 2010 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 39. Accessing all this data  link following  HTTP GET, follow links, aggregate relevant statements  query  SPARQL
  • 40. SPARQL  core idea is pattern matching  graph patterns with variables  any subgraph which matches yields row of bindings ont:districtAdministrative rdfs:label ?school [] “Cardiff”  syntax based on Turtle syntax for RDF  web API endpoints  lots of power  filters  sub-queries  federated query  optionals  property chains  update  named graphs  aggregation  construct
  • 41. Accessing all this data  link following  HTTP GET, follow links, aggregate relevant statements  query  SPARQL  linked data API  RESTful API onto linked data resources  simple query, usable without RDF stack, web dev friendly  easy to layer visualizations and UIs on top  third parties  search engines and aggregators e.g. Sindice, sameAs.org
  • 44. Data.gov.uk – linked datasets and APIs

Notas del editor

  1. Context about bathing water quality
  2. context