SlideShare una empresa de Scribd logo
1 de 41
Descargar para leer sin conexión
Joshua Shinavier



 The state of the art in
     Linked Data


Advanced Semantic Web, Spring 2009
         Literature Survey
Outline
•   Linked Data

•   Linking Open Data

•   describing linked datasets

•   growing the data web

•   keeping Linked Data connected

•   indexing and searching

•   applications

•   navigation

•   state of the data web



                             2
Linked Data overview

•   resource -- an item of interest

•   URI -- global identifier for a resource

•   representation -- data corresponding to the state
    of a resource

•   information resource -- a “document” containing
    information

•   non-information resource -- anything else

•   associated description -- representation describing
    a Semantic Web resource




                             3
The Linking Open Data initiative
•   “bootstrap” the data web with large, interconnected data sets
    to reach a critical mass of semantics

•   strict adherence to W3C standards

    •   identification and transportation (URI, HTTP) of resource
        descriptions

    •   interpretation (RDF, RDFS, OWL) of resource descriptions

•   LOD grows as data providers:

    •   publish structured data on the Web

    •   set RDF links between entities in different data sources

•   transition of the web from a distributed document repository
    into a universal, ubiquitous database [Erling 09]

                                 4
The LOD cloud




      5
LOD data sets




      6
Link sets in LOD




        7
Describing linked datasets

•   voiD (Vocabulary of Interlinked Datasets)
    [Alexander, Cyganiak, Hausenblas, Zhao 09]

    •   describes data sets the link sets between them

•   DING (Dataset RankING) [Toupikov, Umbrich,
    Delbru, Hausenblas, Tummarello 09]

    •   ranking of linked datasets using formal
        descriptions

•   modeling of the Linked Data domain [Halpin,
    Presutti 09]




                            8
Keeping Linked Data connected

•   network-shaped Entity Name System to enable
    systematic reuse of URIs [Bouquet, Stoermer,
    Cordioli, Tummarello 08]

    •   similar to DNS for interlinking hypertext

•   n2Mate framework [Peterson, Cregan, Atkinson,
    Brisbin 08]

    •   use social networking principles to facilitate
        vocabulary and instance reuse

•   graph-based disambiguation of Semantic Web
    entities with idMesh [Cudré-Mauroux, Haghani,
    Jost, Aberer, de Meer 09]



                              9
Managing co-reference
•   many conflated resources in DBpedia [Jaffri,
    Glaser, Millard 08]

    •   representative of LOD as a whole

•   Co-Reference Resolution Service [Glaser, Jaffri,
    Millard 09]

    •   when co-reference is context-specific,
        owl:sameAs is inappropriate

    •   stores co-reference information as a first-class
        entity

•   ontology-level alignment should precede data-level
    alignment [Nikolov, Uren, Motta 09]



                             10
Growing the data web

•   how to get data out there?

•   challenges of the read-write Semantic Web

    •   user awareness of social context of data (e.g.
        licensing, privacy)

    •   view update problem

    •   is the wiki model applicable?

•   incentives for posting data on the SW

•   validating existing Linked Data with Vapour
    [Berrueta, Fernandez, Frade 08]



                              11
Examples of LOD data sets


•   DBpedia [Auer, Bizer, Kobilarov, Lehmann,
    Cyganiak, Ives 07]

    •   extracts structured information from Wikipedia

    •   linking hub for the LOD cloud

•   RDF Book Mashup [Bizer, Cyganiak, Gauss 07]

    •   product metadata from Amazon.com




                            12
Music and movies as Linked Data
•   Linked Movie Database [Hassanzadeh, Consens 09]

    •   combines data from IMDb, Freebase, OMDB,
        DBPedia, RottenTomatoes.com, Stanford Movie
        Database

•   interlinked music datasets [Raimond, Sutton,
    Sandler 08]

    •   combines data from Jamendo on DBTune, BBC
        John Peel sessions, SBSimilarity, Musicbrainz,
        DBpedia, Geonames

    •   links artists, albums, tracks, personal music
        collections

    •   generated links based similarity of resources,
        similarity of neighbors

                             13
Other sources of data


•   the hypertext Web itself [Li, Zhao 08]

    •   extraction of semantic links from hypertext links and
        hierarchical relationships among Web documents

•   RDF representation of HTML DOM from using SparqPlug
    [Coetzee, Heath, Motta 08]

•   multimedia metadata

    •   interlinking multimedia fragments [Hausenblas, Troncy,
        Bürger, Raimond 09]




                                14
Other sources of data (cont.)

•   XML Business Reporting Language (XBRL) [Garcia, Gil
    09]

    •   mapping data to RDF and schemas to OWL
        facilitates interoperability

•   large thesauri [Neubert 09]

    •   as interlinking hubs for professional communities

•   enterprise data, e.g. technical documentation [Servant
    08]

•   MARC21 bibliographic records [Styles, Ayers, Shabir
    08]



                             15
Mapping tools


•   D2R Server for customizable mappings from
    relational databases to ontologies [Bizer, Cyganiak
    06]

•   browser-based tools for defining RDB-to-RDF
    mappings [Zhou, Xu, Chen, Idehen 08]

•   Triplify [Auer, Dietzold, Lehmann, Hellmann,
    Aumueller 09]

•   from generic data silos to Linked Data using
    OpenLink Data Spaces [Idehen, Erling 08]




                           16
Aggregated resources


•   Open Archives Initiative Protocol for Metadata
    Harvesting (OAI-PMH)

    •   can be made Web-accessible with OAI2LOD
        Server [Haslhofer, Schandl 08]

•   Open Archives Initiative - Object Reuse and
    Exchange (OAI-ORE) [Van de Sompel, Lagoze,
    Nelson, Warner, Sanderson, Johnston 09]

    •   adheres to Web principles




                            17
User-driven Linked Data


•   existing Linked Data datasets are more
    appropriate for machine than human
    consumption

•   template-generated interlinks are of limited quality

•   data from existing silos quickly becomes out of
    date

•   need human involvement to grow the data web
    organically




                           18
User-driven Linked Data (cont.)
•   direct modification using SPARQL/Update

    •   e.g. in Tabulator [Berners-Lee, Hollenbach, Lu, Presbrey,
        Prud’hommeaux, Schraefel 08]

•   User Contributed Interlinking [Halb, Raimond, Hausenblas]

•   semantic wikis

•   Loomp [Roesch, Heese 09]

    •   semantic annotation of content using a text editor
        interface




                                19
User-driven Linked Data (cont.)
•   public data from existing social networks

    •   wrappers for Web 2.0 services [Passant 08]

    •   unifying personal identity across various
        networks [Rowe 09]

•   Semantically Interlinked Online Communities
    (SIOC)

    •   integrating social media sites (forums, blogs,
        wikis, etc. with the data web [Bojars, Passant,
        Cyganiak, Breslin 08]

•   Meaning of a Tag (MOAT) ontology gives meaning
    to tags on Web 2.0 [Passant, Laublet 08]



                             20
Usability and licensing

•   usability (for humans) of Linked Data [Halb,
    Raimond, Hausenblas 08]

    •   current LOD datasets are primarily for machine
        consumption

    •   low semantic strength of current LOD link sets

•   provenance information for Linked Data [Hartig
    09]

•   Open Data Commons license [Miller, Styles, Heath
    08]




                            21
Indexing and searching
•   W3C’s TAP semantic search [Guha, McCool 01]

•   Swoogle [Ding, Finin, Joshi, Pan, Cost, Peng, Reddivari,
    Doshi, Sachs 04]

    •   adapts PageRank concept to ontologies

•   SWSE [Hogan, Harth, Umbrich, Decker 07]

    •   MultiCrawler [Harth, Umbrich, Decker 06]

•   RDF Gateway search

•   Watson document-based search

•   Falcons [Cheng, Ge, Wu, Qu 08]

    •   textual search using class hierarchies for query restriction

•   Sindice Semantic Web index [Tummarello, Delbru, Oren 07]
                                22
Link discovery


•   Silk link discovery framework [Volz, Bizer, Gaedke,
    Kobilarov 09]

    •   find relationships between entities within
        different data sources

    •   generation of owl:sameAs links

•   value of Web of Data depends on the amount and
    quality of links between data sources




                             23
Navigation
•   like early Web, it’s easy to get “Lost in Hyperspace”

•   Tabulator generic Linked Data browser [Berners-
    Lee, Chen, Chilton, Connolly, Dhanaraj,
    Hollenbach, Lerer, Sheets 06]

    •   encourage deployment of Linked Data

    •   test, refine and promote Linked Data standards

•   faceted views over large-scale linked data with
    Virtuoso Cluster Edition [Erling 09]

•   Explorator RDF browser [Araujo, Schwabe 09]

    •   exploratory search using direct manipulation



                            24
Navigation (cont.)
•   DBPedia Mobile map view and faceted Linked
    Data browser [Becker, Bizer 08]

    •   explore the geospatial Semantic Web

    •   uses current GPS position as a starting point

    •   potential for Linked Data publishing




                            25
Navigation (cont.)
•   Fenfire generic Linked Data browser [Hastrup,
    Cyganiak, Bojars 08]

    •   uses graph views rather than tables or outlines

    •   shows graph data as directly as possible

    •   related to Fentwine [Fallenstein, Lukka 04]




                            26
Navigation (cont.)


•   Humboldt [Kobilarov,
    Dickinson 08]

    •   exploratory browsing

    •   faceted views

    •   “resource at a time”

    •   uses a “pivot” operation
        to refocus the view




                                   27
Navigation (cont.)
•   zLinks plugin [Bergman, Giasson 08]

    •   WordPress plugin with supporting server

    •   relates hypertext links with contextually
        relevant Linked Data

    •   WOWY (WordNet, OpenCyc, Wikipedia, YAGO)

        •   distinguish between types of resources

        •   disambiguate alternate senses




                              28
Navigation (cont.)
•   mapping of Linked Data to a file system model
    [Schandl 09]

    •   enables use of this data within desktop
        applications




                            29
Other applications
•   how to use the data that is out there?

    •   emerging applications which exploit Linked
        Data [Hausenblas 09]

•   integrating data sources related to drug and
    clinical trials [Jentzsch, Andersson, Hassanzadeh,
    Stephens, Bizer 09]

•   mashups

    •   MashQL [Jarrar, Dikaiakos 09]

        •   Internet is a database, mashup is a query
            over that database

•   benefit of specialized, independent Linked Data
    services acting together [Bojars, Passant, Giasson,
    Breslin 07]
                              30
The gray area
•   U-P2P framework for peer-to-peer linked data [Davoust,
    Esfandiari 09]

    •   data replication provides a measure of popularity

•   Linked Data with Named Graphs

    •   e.g. interlinks with embedded provenance information
        [Zhao, Klyne, Shotton 08]

•   Ripple scripting language [Shinavier 07]

    •   embeds Turing-complete programs in the Web of Data




                                31
State of the data web
•   where are we with the Linked Data graph?

    •   size

    •   number and type of links

    •   usefulness to end users

    •   network characteristics

•   single-point-of-access (e.g. DBpedia, GeoNames)
    vs. distributed datasets (e.g. FOAF-o-sphere,
    SIOC-land)

•   syntactic and semantic analysis of the LOD
    dataset [Hausenblas, Halb, Raimond, Heath 08]



                            32
Statistics of the data web

•   today’s Linked Data is very different than the first-
    generation data web [Halpin 09]

    •   LOD data accounts for the vast majority of data

    •   power-law distributions are emerging

    •   data web is not growing organically

    •   Web standards are generally adhered to

•   is Linked Data useful to ordinary users?

    •   sampling of Linked Data using Live.com query
        logs and FALCON-S semantic search engine


                            33
Query popularity follows a power law




 •   ...




                 34
URI frequency... not so much




•   ...




                  35
Data publishing lacks a “long tail”




•   ...




                 36
A few dominant ontologies are emerging




          # of URIs by vocabulary
                     37
(DBpedia bias)




# of URIs by domain name
           38
Graph analysis for the data web

•   common network analysis techniques can be used
    to investigate interoperability and structural
    patterns of the LOD cloud [Rodriguez 09]

•   results based on March 2009 statistics of the LOD
    data set graph:

    •   LOD graph is not strongly connected

    •   diameter of 8 is large given relatively small size
        of the cloud

    •   data sets have nearly identical incoming and
        outgoing link patterns (⇒ majority of reciprocal
        owl:sameAs links)



                              39
Ranking and clustering of LOD data sets




                   40
•       Original slide show:

    •    http://tw.rpi.edu/proj/portal.wiki/images/f/f0/
         LinkedData.pdf

•       References:

    •    http://tw.rpi.edu/proj/portal.wiki/images/e/e0/
         LinkedDataSurvey.pdf

•       BibTeX:

    •    http://tw.rpi.edu/proj/portal.wiki/images/3/37/
         LinkedDataSurvey.bbl




                                     41

Más contenido relacionado

La actualidad más candente

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebNuxeo
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingHerbert Van de Sompel
 
Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Andy Powell
 
Grid Computing July 2009
Grid Computing July 2009Grid Computing July 2009
Grid Computing July 2009Ian Foster
 
Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportPascal-Nicolas Becker
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesRichard Wallis
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataEUCLID project
 
Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)Hector Correa
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.orgJoshua Shinavier
 
Microtask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataMicrotask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataEUCLID project
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked DataAdrian Stevenson
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebPascal-Nicolas Becker
 
Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?Victor Olex
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our OpportunityRichard Wallis
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGGRatko Mutavdzic
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarshipHerbert Van de Sompel
 
Learning W3C Linked Data Platform with examples
Learning W3C Linked Data Platform with examplesLearning W3C Linked Data Platform with examples
Learning W3C Linked Data Platform with examplesNandana Mihindukulasooriya
 

La actualidad más candente (20)

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous MappingPersistent Identifiers and the Web: The Need for an Unambiguous Mapping
Persistent Identifiers and the Web: The Need for an Unambiguous Mapping
 
Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...
 
Grid Computing July 2009
Grid Computing July 2009Grid Computing July 2009
Grid Computing July 2009
 
Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data Support
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
ResourceSync Tutorial
ResourceSync TutorialResourceSync Tutorial
ResourceSync Tutorial
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.org
 
Microtask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked DataMicrotask Crowdsourcing Applications for Linked Data
Microtask Crowdsourcing Applications for Linked Data
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked Data
 
Introduction to W3C Linked Data Platform
Introduction to W3C Linked Data PlatformIntroduction to W3C Linked Data Platform
Introduction to W3C Linked Data Platform
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG
 
Interoperability for web based scholarship
Interoperability for web based scholarshipInteroperability for web based scholarship
Interoperability for web based scholarship
 
Learning W3C Linked Data Platform with examples
Learning W3C Linked Data Platform with examplesLearning W3C Linked Data Platform with examples
Learning W3C Linked Data Platform with examples
 
Swoogle
SwoogleSwoogle
Swoogle
 

Similar a The State of the Linked Data Art in 2009

Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Session 1.4 a distributed network of heritage information
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage informationsemanticsconference
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamEnno Meijers
 
Linked open data project
Linked open data projectLinked open data project
Linked open data projectFaathima Fayaza
 
Question answering in linked data
Question answering in linked dataQuestion answering in linked data
Question answering in linked dataReza Ramezani
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked dataLaura Po
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaEnno Meijers
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...cmitch41
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of librariesRegan Harper
 
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinDBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinAnja Jentzsch
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
EuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEnno Meijers
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in RomaniaVlad Posea
 

Similar a The State of the Linked Data Art in 2009 (20)

Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Linked Data
Linked DataLinked Data
Linked Data
 
Finding Data Sets
Finding Data SetsFinding Data Sets
Finding Data Sets
 
Linked Data Basics
Linked Data BasicsLinked Data Basics
Linked Data Basics
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Session 1.4 a distributed network of heritage information
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage information
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
 
Linked open data project
Linked open data projectLinked open data project
Linked open data project
 
Question answering in linked data
Question answering in linked dataQuestion answering in linked data
Question answering in linked data
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL India
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, BerlinDBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
DBpedia Mappings Wiki, SMWCon Fall 2013, Berlin
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
EuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage information
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in Romania
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 

Más de Joshua Shinavier

Transpilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing HydraTranspilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing HydraJoshua Shinavier
 
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...Joshua Shinavier
 
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)Joshua Shinavier
 
In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)Joshua Shinavier
 
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)Joshua Shinavier
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityBuilding an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityJoshua Shinavier
 
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...Joshua Shinavier
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph SchemaJoshua Shinavier
 
TinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBsTinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBsJoshua Shinavier
 
Real-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter AnnotationsReal-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter AnnotationsJoshua Shinavier
 
Real-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 charsReal-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 charsJoshua Shinavier
 

Más de Joshua Shinavier (14)

Anything-to-Graph
Anything-to-GraphAnything-to-Graph
Anything-to-Graph
 
Transpilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing HydraTranspilers Gone Wild: Introducing Hydra
Transpilers Gone Wild: Introducing Hydra
 
TinkerPop 2020
TinkerPop 2020TinkerPop 2020
TinkerPop 2020
 
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
An Algebraic Data Model for Graphs and Hypergraphs (Category Theory meetup, N...
 
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)In Search of the Universal Data Model (ISWC 2019 Minute Madness)
In Search of the Universal Data Model (ISWC 2019 Minute Madness)
 
In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)In Search of the Universal Data Model (Connected Data London 2019)
In Search of the Universal Data Model (Connected Data London 2019)
 
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
Algebraic Property Graphs (GQL Community Update, oct. 9, 2019)
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityBuilding an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
 
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
A Graph is a Graph is a Graph: Equivalence, Transformation, and Composition o...
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph Schema
 
TinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBsTinkerPop: a story of graphs, DBs, and graph DBs
TinkerPop: a story of graphs, DBs, and graph DBs
 
Semantics and Sensors
Semantics and SensorsSemantics and Sensors
Semantics and Sensors
 
Real-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter AnnotationsReal-time Semantic Web with Twitter Annotations
Real-time Semantic Web with Twitter Annotations
 
Real-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 charsReal-time #SemanticWeb in 140 chars
Real-time #SemanticWeb in 140 chars
 

Último

Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 

Último (20)

Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 

The State of the Linked Data Art in 2009

  • 1. Joshua Shinavier The state of the art in Linked Data Advanced Semantic Web, Spring 2009 Literature Survey
  • 2. Outline • Linked Data • Linking Open Data • describing linked datasets • growing the data web • keeping Linked Data connected • indexing and searching • applications • navigation • state of the data web 2
  • 3. Linked Data overview • resource -- an item of interest • URI -- global identifier for a resource • representation -- data corresponding to the state of a resource • information resource -- a “document” containing information • non-information resource -- anything else • associated description -- representation describing a Semantic Web resource 3
  • 4. The Linking Open Data initiative • “bootstrap” the data web with large, interconnected data sets to reach a critical mass of semantics • strict adherence to W3C standards • identification and transportation (URI, HTTP) of resource descriptions • interpretation (RDF, RDFS, OWL) of resource descriptions • LOD grows as data providers: • publish structured data on the Web • set RDF links between entities in different data sources • transition of the web from a distributed document repository into a universal, ubiquitous database [Erling 09] 4
  • 7. Link sets in LOD 7
  • 8. Describing linked datasets • voiD (Vocabulary of Interlinked Datasets) [Alexander, Cyganiak, Hausenblas, Zhao 09] • describes data sets the link sets between them • DING (Dataset RankING) [Toupikov, Umbrich, Delbru, Hausenblas, Tummarello 09] • ranking of linked datasets using formal descriptions • modeling of the Linked Data domain [Halpin, Presutti 09] 8
  • 9. Keeping Linked Data connected • network-shaped Entity Name System to enable systematic reuse of URIs [Bouquet, Stoermer, Cordioli, Tummarello 08] • similar to DNS for interlinking hypertext • n2Mate framework [Peterson, Cregan, Atkinson, Brisbin 08] • use social networking principles to facilitate vocabulary and instance reuse • graph-based disambiguation of Semantic Web entities with idMesh [Cudré-Mauroux, Haghani, Jost, Aberer, de Meer 09] 9
  • 10. Managing co-reference • many conflated resources in DBpedia [Jaffri, Glaser, Millard 08] • representative of LOD as a whole • Co-Reference Resolution Service [Glaser, Jaffri, Millard 09] • when co-reference is context-specific, owl:sameAs is inappropriate • stores co-reference information as a first-class entity • ontology-level alignment should precede data-level alignment [Nikolov, Uren, Motta 09] 10
  • 11. Growing the data web • how to get data out there? • challenges of the read-write Semantic Web • user awareness of social context of data (e.g. licensing, privacy) • view update problem • is the wiki model applicable? • incentives for posting data on the SW • validating existing Linked Data with Vapour [Berrueta, Fernandez, Frade 08] 11
  • 12. Examples of LOD data sets • DBpedia [Auer, Bizer, Kobilarov, Lehmann, Cyganiak, Ives 07] • extracts structured information from Wikipedia • linking hub for the LOD cloud • RDF Book Mashup [Bizer, Cyganiak, Gauss 07] • product metadata from Amazon.com 12
  • 13. Music and movies as Linked Data • Linked Movie Database [Hassanzadeh, Consens 09] • combines data from IMDb, Freebase, OMDB, DBPedia, RottenTomatoes.com, Stanford Movie Database • interlinked music datasets [Raimond, Sutton, Sandler 08] • combines data from Jamendo on DBTune, BBC John Peel sessions, SBSimilarity, Musicbrainz, DBpedia, Geonames • links artists, albums, tracks, personal music collections • generated links based similarity of resources, similarity of neighbors 13
  • 14. Other sources of data • the hypertext Web itself [Li, Zhao 08] • extraction of semantic links from hypertext links and hierarchical relationships among Web documents • RDF representation of HTML DOM from using SparqPlug [Coetzee, Heath, Motta 08] • multimedia metadata • interlinking multimedia fragments [Hausenblas, Troncy, Bürger, Raimond 09] 14
  • 15. Other sources of data (cont.) • XML Business Reporting Language (XBRL) [Garcia, Gil 09] • mapping data to RDF and schemas to OWL facilitates interoperability • large thesauri [Neubert 09] • as interlinking hubs for professional communities • enterprise data, e.g. technical documentation [Servant 08] • MARC21 bibliographic records [Styles, Ayers, Shabir 08] 15
  • 16. Mapping tools • D2R Server for customizable mappings from relational databases to ontologies [Bizer, Cyganiak 06] • browser-based tools for defining RDB-to-RDF mappings [Zhou, Xu, Chen, Idehen 08] • Triplify [Auer, Dietzold, Lehmann, Hellmann, Aumueller 09] • from generic data silos to Linked Data using OpenLink Data Spaces [Idehen, Erling 08] 16
  • 17. Aggregated resources • Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) • can be made Web-accessible with OAI2LOD Server [Haslhofer, Schandl 08] • Open Archives Initiative - Object Reuse and Exchange (OAI-ORE) [Van de Sompel, Lagoze, Nelson, Warner, Sanderson, Johnston 09] • adheres to Web principles 17
  • 18. User-driven Linked Data • existing Linked Data datasets are more appropriate for machine than human consumption • template-generated interlinks are of limited quality • data from existing silos quickly becomes out of date • need human involvement to grow the data web organically 18
  • 19. User-driven Linked Data (cont.) • direct modification using SPARQL/Update • e.g. in Tabulator [Berners-Lee, Hollenbach, Lu, Presbrey, Prud’hommeaux, Schraefel 08] • User Contributed Interlinking [Halb, Raimond, Hausenblas] • semantic wikis • Loomp [Roesch, Heese 09] • semantic annotation of content using a text editor interface 19
  • 20. User-driven Linked Data (cont.) • public data from existing social networks • wrappers for Web 2.0 services [Passant 08] • unifying personal identity across various networks [Rowe 09] • Semantically Interlinked Online Communities (SIOC) • integrating social media sites (forums, blogs, wikis, etc. with the data web [Bojars, Passant, Cyganiak, Breslin 08] • Meaning of a Tag (MOAT) ontology gives meaning to tags on Web 2.0 [Passant, Laublet 08] 20
  • 21. Usability and licensing • usability (for humans) of Linked Data [Halb, Raimond, Hausenblas 08] • current LOD datasets are primarily for machine consumption • low semantic strength of current LOD link sets • provenance information for Linked Data [Hartig 09] • Open Data Commons license [Miller, Styles, Heath 08] 21
  • 22. Indexing and searching • W3C’s TAP semantic search [Guha, McCool 01] • Swoogle [Ding, Finin, Joshi, Pan, Cost, Peng, Reddivari, Doshi, Sachs 04] • adapts PageRank concept to ontologies • SWSE [Hogan, Harth, Umbrich, Decker 07] • MultiCrawler [Harth, Umbrich, Decker 06] • RDF Gateway search • Watson document-based search • Falcons [Cheng, Ge, Wu, Qu 08] • textual search using class hierarchies for query restriction • Sindice Semantic Web index [Tummarello, Delbru, Oren 07] 22
  • 23. Link discovery • Silk link discovery framework [Volz, Bizer, Gaedke, Kobilarov 09] • find relationships between entities within different data sources • generation of owl:sameAs links • value of Web of Data depends on the amount and quality of links between data sources 23
  • 24. Navigation • like early Web, it’s easy to get “Lost in Hyperspace” • Tabulator generic Linked Data browser [Berners- Lee, Chen, Chilton, Connolly, Dhanaraj, Hollenbach, Lerer, Sheets 06] • encourage deployment of Linked Data • test, refine and promote Linked Data standards • faceted views over large-scale linked data with Virtuoso Cluster Edition [Erling 09] • Explorator RDF browser [Araujo, Schwabe 09] • exploratory search using direct manipulation 24
  • 25. Navigation (cont.) • DBPedia Mobile map view and faceted Linked Data browser [Becker, Bizer 08] • explore the geospatial Semantic Web • uses current GPS position as a starting point • potential for Linked Data publishing 25
  • 26. Navigation (cont.) • Fenfire generic Linked Data browser [Hastrup, Cyganiak, Bojars 08] • uses graph views rather than tables or outlines • shows graph data as directly as possible • related to Fentwine [Fallenstein, Lukka 04] 26
  • 27. Navigation (cont.) • Humboldt [Kobilarov, Dickinson 08] • exploratory browsing • faceted views • “resource at a time” • uses a “pivot” operation to refocus the view 27
  • 28. Navigation (cont.) • zLinks plugin [Bergman, Giasson 08] • WordPress plugin with supporting server • relates hypertext links with contextually relevant Linked Data • WOWY (WordNet, OpenCyc, Wikipedia, YAGO) • distinguish between types of resources • disambiguate alternate senses 28
  • 29. Navigation (cont.) • mapping of Linked Data to a file system model [Schandl 09] • enables use of this data within desktop applications 29
  • 30. Other applications • how to use the data that is out there? • emerging applications which exploit Linked Data [Hausenblas 09] • integrating data sources related to drug and clinical trials [Jentzsch, Andersson, Hassanzadeh, Stephens, Bizer 09] • mashups • MashQL [Jarrar, Dikaiakos 09] • Internet is a database, mashup is a query over that database • benefit of specialized, independent Linked Data services acting together [Bojars, Passant, Giasson, Breslin 07] 30
  • 31. The gray area • U-P2P framework for peer-to-peer linked data [Davoust, Esfandiari 09] • data replication provides a measure of popularity • Linked Data with Named Graphs • e.g. interlinks with embedded provenance information [Zhao, Klyne, Shotton 08] • Ripple scripting language [Shinavier 07] • embeds Turing-complete programs in the Web of Data 31
  • 32. State of the data web • where are we with the Linked Data graph? • size • number and type of links • usefulness to end users • network characteristics • single-point-of-access (e.g. DBpedia, GeoNames) vs. distributed datasets (e.g. FOAF-o-sphere, SIOC-land) • syntactic and semantic analysis of the LOD dataset [Hausenblas, Halb, Raimond, Heath 08] 32
  • 33. Statistics of the data web • today’s Linked Data is very different than the first- generation data web [Halpin 09] • LOD data accounts for the vast majority of data • power-law distributions are emerging • data web is not growing organically • Web standards are generally adhered to • is Linked Data useful to ordinary users? • sampling of Linked Data using Live.com query logs and FALCON-S semantic search engine 33
  • 34. Query popularity follows a power law • ... 34
  • 35. URI frequency... not so much • ... 35
  • 36. Data publishing lacks a “long tail” • ... 36
  • 37. A few dominant ontologies are emerging # of URIs by vocabulary 37
  • 38. (DBpedia bias) # of URIs by domain name 38
  • 39. Graph analysis for the data web • common network analysis techniques can be used to investigate interoperability and structural patterns of the LOD cloud [Rodriguez 09] • results based on March 2009 statistics of the LOD data set graph: • LOD graph is not strongly connected • diameter of 8 is large given relatively small size of the cloud • data sets have nearly identical incoming and outgoing link patterns (⇒ majority of reciprocal owl:sameAs links) 39
  • 40. Ranking and clustering of LOD data sets 40
  • 41. Original slide show: • http://tw.rpi.edu/proj/portal.wiki/images/f/f0/ LinkedData.pdf • References: • http://tw.rpi.edu/proj/portal.wiki/images/e/e0/ LinkedDataSurvey.pdf • BibTeX: • http://tw.rpi.edu/proj/portal.wiki/images/3/37/ LinkedDataSurvey.bbl 41