SlideShare una empresa de Scribd logo
1 de 27
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Ranking the LinkedData:
the case of DBpedia
Roberto Mirizzi1, Azzurra Ragone1,2,
Tommaso Di Noia1, Eugenio Di Sciascio1
1Politecnico di Bari
Via Orabona, 4
70125 Bari (ITALY)
2University of Trento
Via Sommarive, 14
38100 Trento (ITALY)
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Outline
• Tags are all around
• NOT (Not Only Tag): what is it?
• NOT a look behind the curtains:
– Ranking of RDF resources: an hybrid approach
• Evaluation
• Conclusion and Future Work
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Tags are all around
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Tag cloud
and many
more…
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Tagging: a double face
Annotation phase Retrieval phase
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Problems with annotation
• Insert as much as possible tags (time
consuming):
– different versions of the same tag to catch all the
possible searches
– Multilingual tags
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Problem with retrieval
• Exactly (syntactic) match among tags: web
service is different from web services,
webservices,…
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Why not to use Semantic tags?
Plugged into the Web 3.0
Disambiguation
Relations among tags
Machine understandable
NOT: Not Only Tag
http://sisinflab.poliba.it/not-only-tag/
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Demo
• Let’s imagine to tag the book:
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
NOT
http://sisinflab.poliba.it/not-only-tag/
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Smarter taggingAnnotationphaseRetrievalphase
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
What is behind NOT?
• DBpedia graph exploration
• Computation of similarity value between each
pair of RDF resources using external
information sources (search engines,
bookmarking systems)
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
What is behind NOT? (II)
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
What is behind NOT? (III)
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
What is behind NOT? (IV)
Semantic_Web XML-based_standards
Knowledge_representation Data_management Internet_architecture
Triplestores Folksonomy
…
…
XML Computer_and_telecommunication_stantards
Web_services User_interface_markup_languages Scalable_Vector_GraphicsMicroformats
skos:subject skos:broaderCategoryArticle
Legend
……
…
Resource Description Framework
Microformat
RDFa
…
…
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
DBpedia-Ranker: hybrid ranking
?r1 ?r2
isSimilar
v
hasValue
)(
),(
)(
),(
),(
2
21
1
21
21
rf
rrf
rf
rrf
rrsim 






viceversaandrandrbetweenwikilink,2
saor viceverrandrbetweenkwikilin,1
randrbetweenwikilinkno,0
),(
21
21
21
21 rrorewikilinkSc
)(
),(
),(
2
12
21
rl
rrl
rroreabstractSc 
Graph-based ranking
External sources-based ranking
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Functional Architecture
Back-end
Query engine
Storage
Cloud
Generator
GUI
Ext.InfoSources
DBpedia
Lookup
Service
Delicious
Yahoo!
Bing
Graph
Explorer
SPARQL
Context
Analyzer
Ranker
Offline computation
Linked Data graph
exploration
Rank nodes exploiting
external information
Store results as pairs of
nodes together with their
similarity
Runtime Search
Start typing a tag
Query the system for
relevant tags
(corresponding to DBpedia
resources)
Show the semantic tag
cloud
1
2
3
1
2
3
1
Offlinecomputation
2
3
1
2
3
GoogleGoogle
Runtimesearch
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Evaluation
We evaluate five different algorithms:
1. DBpediaRanker
2. DBpediaRanker minus Wikipedia info
3. DBpediaRanker minus ext info sources
4. Co-occurrence
5. Similarity Distance
),()()(
),(
),(
2121
21
21
rrfrfrf
rrf
rrcoOcc


 
)}(log),(min{loglog
),(log)(log),(logmax
),(
21
2121
21
rfrfN
rrfrfrf
rrngd



10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Evaluation (II)
http://sisinflab.poliba.it/evaluation
 50 volunteers
Researchers in the ICT area
244 votes collected (on average 5
votes for each users)
Time to vote: 1min and 40secs
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Evaluation (III)
http://sisinflab.poliba.it/evaluation/data
3.91 - Good
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Conclusion
• NOT *is* useful in the annotation phase:
– suggestions of semantically related tags
– Tags enrichment
• NOT *is* useful in the retrieval phase:
– Semantic match among tags
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Future Work
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Impakt Revolution
http://sisinflab.poliba.it/impakt-revolution/
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Inspiration: Google Wonder Wheel
Exploratory Search in Google…
…nice, but there is no “semantics” in it.
You can not discover new knowledge exploiting the meaning of a term (keyword/tag/query)
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
SWOC: Semantic Wonder Cloud
http://sisinflab.poliba.it/semantic-wonder-cloud/index/
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Q&A
a.ragone@poliba.it
Thanks for being here on Friday! :-)
http://sisinflab.poliba.it/not-only-tag/
http://sisinflab.poliba.it/semantic-wonder-cloud/index/
http://sisinflab.poliba.it/impakt-revolution/
10th International Conference on Web Engineering, Vienna
July 5-9, 2010
Conclusion
 NOT: a tool for smarter tagging
 Ranking algorithm for RDF graphs
Future work
 Test our algorithms with different domains
 Extract more fine grained contexts
 Enrich the extracted context using also relevant properties
 Integrate our approach with real existing systems
 Use the core system to automatically extract relevant tags
(concepts) from a document (or from a collection of
documents) exploiting tools for named entities extraction

Más contenido relacionado

La actualidad más candente

Budapest Odf 20110627
Budapest Odf 20110627Budapest Odf 20110627
Budapest Odf 20110627Bart Hanssens
 
Cultural Heritage: when data are much worst than one can believe
Cultural Heritage: when data are much worst than one can believe Cultural Heritage: when data are much worst than one can believe
Cultural Heritage: when data are much worst than one can believe Research Data Alliance
 
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...Michael Hausenblas
 
On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsNina Jeliazkova
 
Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift
 
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...BigData_Europe
 
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'ScienceWorks
 
The META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open DataThe META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open DataGeorg Rehm
 
Tuesday 5 May: IIPC activities, Olga Holownia, IIPC
Tuesday 5 May: IIPC activities, Olga Holownia, IIPCTuesday 5 May: IIPC activities, Olga Holownia, IIPC
Tuesday 5 May: IIPC activities, Olga Holownia, IIPCWARCnet
 
New trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsNew trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsMaría Poveda Villalón
 
Defining iot.schema.org: Using Knowledge Extraction from Existing IoT-based ...
Defining iot.schema.org: Using Knowledge Extraction from  Existing IoT-based ...Defining iot.schema.org: Using Knowledge Extraction from  Existing IoT-based ...
Defining iot.schema.org: Using Knowledge Extraction from Existing IoT-based ...Amélie Gyrard
 
Advanced Topics in OpenAPI: Added Value Services and Protection in the OpenTr...
Advanced Topics in OpenAPI: Added Value Services and Protection in the OpenTr...Advanced Topics in OpenAPI: Added Value Services and Protection in the OpenTr...
Advanced Topics in OpenAPI: Added Value Services and Protection in the OpenTr...🧑‍💻 Manuel Coppotelli
 
LibChain – Open, Verifiable and Anonymous Access Management. Juan Cabello, P...
 LibChain – Open, Verifiable and Anonymous Access Management. Juan Cabello, P... LibChain – Open, Verifiable and Anonymous Access Management. Juan Cabello, P...
LibChain – Open, Verifiable and Anonymous Access Management. Juan Cabello, P...LIBER Europe
 
DMDW Lesson 01 - Introduction
DMDW Lesson 01 - IntroductionDMDW Lesson 01 - Introduction
DMDW Lesson 01 - IntroductionJohannes Hoppe
 

La actualidad más candente (16)

Budapest Odf 20110627
Budapest Odf 20110627Budapest Odf 20110627
Budapest Odf 20110627
 
bonino
boninobonino
bonino
 
Cultural Heritage: when data are much worst than one can believe
Cultural Heritage: when data are much worst than one can believe Cultural Heritage: when data are much worst than one can believe
Cultural Heritage: when data are much worst than one can believe
 
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...
 
On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurements
 
Datalift lod2-paris-24032011
Datalift lod2-paris-24032011Datalift lod2-paris-24032011
Datalift lod2-paris-24032011
 
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
 
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
 
The META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open DataThe META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open Data
 
Bancilhon
BancilhonBancilhon
Bancilhon
 
Tuesday 5 May: IIPC activities, Olga Holownia, IIPC
Tuesday 5 May: IIPC activities, Olga Holownia, IIPCTuesday 5 May: IIPC activities, Olga Holownia, IIPC
Tuesday 5 May: IIPC activities, Olga Holownia, IIPC
 
New trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsNew trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and tools
 
Defining iot.schema.org: Using Knowledge Extraction from Existing IoT-based ...
Defining iot.schema.org: Using Knowledge Extraction from  Existing IoT-based ...Defining iot.schema.org: Using Knowledge Extraction from  Existing IoT-based ...
Defining iot.schema.org: Using Knowledge Extraction from Existing IoT-based ...
 
Advanced Topics in OpenAPI: Added Value Services and Protection in the OpenTr...
Advanced Topics in OpenAPI: Added Value Services and Protection in the OpenTr...Advanced Topics in OpenAPI: Added Value Services and Protection in the OpenTr...
Advanced Topics in OpenAPI: Added Value Services and Protection in the OpenTr...
 
LibChain – Open, Verifiable and Anonymous Access Management. Juan Cabello, P...
 LibChain – Open, Verifiable and Anonymous Access Management. Juan Cabello, P... LibChain – Open, Verifiable and Anonymous Access Management. Juan Cabello, P...
LibChain – Open, Verifiable and Anonymous Access Management. Juan Cabello, P...
 
DMDW Lesson 01 - Introduction
DMDW Lesson 01 - IntroductionDMDW Lesson 01 - Introduction
DMDW Lesson 01 - Introduction
 

Destacado

Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...
Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...
Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...Minsuk Kahng
 
LocWeb 2014 Workshop at CIKM
LocWeb 2014 Workshop at CIKMLocWeb 2014 Workshop at CIKM
LocWeb 2014 Workshop at CIKMDirk Ahlers
 
CIKM Presentation at the AFAAS Review Workshop Addis-Ababa 15 oct 2014
CIKM Presentation at the AFAAS Review Workshop Addis-Ababa 15 oct  2014 CIKM Presentation at the AFAAS Review Workshop Addis-Ababa 15 oct  2014
CIKM Presentation at the AFAAS Review Workshop Addis-Ababa 15 oct 2014 AFAAS
 
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Project
 
Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010
Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010
Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010Roku
 
Leveraging Joint Interactions for Credibility Analysis in News Communities
Leveraging Joint Interactions for Credibility Analysis in News CommunitiesLeveraging Joint Interactions for Credibility Analysis in News Communities
Leveraging Joint Interactions for Credibility Analysis in News CommunitiesSubhabrata Mukherjee
 
CIKM 2009 - Efficient itemset generator discovery over a stream sliding window
CIKM 2009 - Efficient itemset generator discovery over a stream sliding windowCIKM 2009 - Efficient itemset generator discovery over a stream sliding window
CIKM 2009 - Efficient itemset generator discovery over a stream sliding windowChuancong Gao
 
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Yuto Yamaguchi
 
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...Shuai Yuan
 

Destacado (9)

Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...
Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...
Ranking Objects by Following Paths in Entity-Relationship Graphs (PhD Worksho...
 
LocWeb 2014 Workshop at CIKM
LocWeb 2014 Workshop at CIKMLocWeb 2014 Workshop at CIKM
LocWeb 2014 Workshop at CIKM
 
CIKM Presentation at the AFAAS Review Workshop Addis-Ababa 15 oct 2014
CIKM Presentation at the AFAAS Review Workshop Addis-Ababa 15 oct  2014 CIKM Presentation at the AFAAS Review Workshop Addis-Ababa 15 oct  2014
CIKM Presentation at the AFAAS Review Workshop Addis-Ababa 15 oct 2014
 
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
 
Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010
Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010
Semantic Tags Generation and Retrieval for Online Advertising - CIKM 2010
 
Leveraging Joint Interactions for Credibility Analysis in News Communities
Leveraging Joint Interactions for Credibility Analysis in News CommunitiesLeveraging Joint Interactions for Credibility Analysis in News Communities
Leveraging Joint Interactions for Credibility Analysis in News Communities
 
CIKM 2009 - Efficient itemset generator discovery over a stream sliding window
CIKM 2009 - Efficient itemset generator discovery over a stream sliding windowCIKM 2009 - Efficient itemset generator discovery over a stream sliding window
CIKM 2009 - Efficient itemset generator discovery over a stream sliding window
 
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
 
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
CIKM 2013 Tutorial: Real-time Bidding: A New Frontier of Computational Advert...
 

Similar a Ranking the Linked Data: the case of DBpedia - ICWE 2010

Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...Ghislain ATEMEZING
 
UGent Research Projects on Linked Data in Architecture and Construction
UGent Research Projects on Linked Data in Architecture and ConstructionUGent Research Projects on Linked Data in Architecture and Construction
UGent Research Projects on Linked Data in Architecture and ConstructionPieter Pauwels
 
From Exploratory Search to Web Search and back - PIKM 2010
From Exploratory Search to Web Search and back - PIKM 2010From Exploratory Search to Web Search and back - PIKM 2010
From Exploratory Search to Web Search and back - PIKM 2010Roku
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Sandro D'Elia
 
GeoChronos - CANARIE NEP Showcase 2009 Presentation
GeoChronos - CANARIE NEP Showcase 2009 PresentationGeoChronos - CANARIE NEP Showcase 2009 Presentation
GeoChronos - CANARIE NEP Showcase 2009 PresentationCameron Kiddle
 
Introduction to the International Image Interoperability Framework (IIIF)
Introduction to the International Image Interoperability Framework (IIIF)Introduction to the International Image Interoperability Framework (IIIF)
Introduction to the International Image Interoperability Framework (IIIF)Simeon Warner
 
The Archives Forum - The National Archives - 02 March 2011
The Archives Forum - The National Archives - 02 March 2011The Archives Forum - The National Archives - 02 March 2011
The Archives Forum - The National Archives - 02 March 2011David F. Flanders
 
Lee Feigenbaum Presentation
Lee Feigenbaum PresentationLee Feigenbaum Presentation
Lee Feigenbaum PresentationMediabistro
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” diannepatricia
 
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...Pieter Pauwels
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikisSören Auer
 
Semantic Web in the Plateau of Productivity
Semantic Web in the Plateau of ProductivitySemantic Web in the Plateau of Productivity
Semantic Web in the Plateau of ProductivityIoannis Stavrakantonakis
 
Visual Querying LOD sources with LODeX
 Visual Querying LOD sources with LODeX Visual Querying LOD sources with LODeX
Visual Querying LOD sources with LODeXFabio Benedetti
 
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT
 
GeoChronos - CANARIE NEP Showcase 2010 Presentation
GeoChronos - CANARIE NEP Showcase 2010 PresentationGeoChronos - CANARIE NEP Showcase 2010 Presentation
GeoChronos - CANARIE NEP Showcase 2010 PresentationCameron Kiddle
 
SIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media SitesSIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media SitesUldis Bojars
 

Similar a Ranking the Linked Data: the case of DBpedia - ICWE 2010 (20)

Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
 
UGent Research Projects on Linked Data in Architecture and Construction
UGent Research Projects on Linked Data in Architecture and ConstructionUGent Research Projects on Linked Data in Architecture and Construction
UGent Research Projects on Linked Data in Architecture and Construction
 
From Exploratory Search to Web Search and back - PIKM 2010
From Exploratory Search to Web Search and back - PIKM 2010From Exploratory Search to Web Search and back - PIKM 2010
From Exploratory Search to Web Search and back - PIKM 2010
 
Jung 2010
Jung 2010Jung 2010
Jung 2010
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708
 
GeoChronos - CANARIE NEP Showcase 2009 Presentation
GeoChronos - CANARIE NEP Showcase 2009 PresentationGeoChronos - CANARIE NEP Showcase 2009 Presentation
GeoChronos - CANARIE NEP Showcase 2009 Presentation
 
Introduction to the International Image Interoperability Framework (IIIF)
Introduction to the International Image Interoperability Framework (IIIF)Introduction to the International Image Interoperability Framework (IIIF)
Introduction to the International Image Interoperability Framework (IIIF)
 
The Archives Forum - The National Archives - 02 March 2011
The Archives Forum - The National Archives - 02 March 2011The Archives Forum - The National Archives - 02 March 2011
The Archives Forum - The National Archives - 02 March 2011
 
Lee Feigenbaum Presentation
Lee Feigenbaum PresentationLee Feigenbaum Presentation
Lee Feigenbaum Presentation
 
Linked Open Data and Ontotext Projects
Linked Open Data and Ontotext ProjectsLinked Open Data and Ontotext Projects
Linked Open Data and Ontotext Projects
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services”
 
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
1213532535.pdf
1213532535.pdf1213532535.pdf
1213532535.pdf
 
Semantic Web in the Plateau of Productivity
Semantic Web in the Plateau of ProductivitySemantic Web in the Plateau of Productivity
Semantic Web in the Plateau of Productivity
 
Visual Querying LOD sources with LODeX
 Visual Querying LOD sources with LODeX Visual Querying LOD sources with LODeX
Visual Querying LOD sources with LODeX
 
W3 c semantic web activity
W3 c semantic web activityW3 c semantic web activity
W3 c semantic web activity
 
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
 
GeoChronos - CANARIE NEP Showcase 2010 Presentation
GeoChronos - CANARIE NEP Showcase 2010 PresentationGeoChronos - CANARIE NEP Showcase 2010 Presentation
GeoChronos - CANARIE NEP Showcase 2010 Presentation
 
SIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media SitesSIOC: Semantic Web for Social Media Sites
SIOC: Semantic Web for Social Media Sites
 

Ranking the Linked Data: the case of DBpedia - ICWE 2010

  • 1. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Ranking the LinkedData: the case of DBpedia Roberto Mirizzi1, Azzurra Ragone1,2, Tommaso Di Noia1, Eugenio Di Sciascio1 1Politecnico di Bari Via Orabona, 4 70125 Bari (ITALY) 2University of Trento Via Sommarive, 14 38100 Trento (ITALY)
  • 2. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Outline • Tags are all around • NOT (Not Only Tag): what is it? • NOT a look behind the curtains: – Ranking of RDF resources: an hybrid approach • Evaluation • Conclusion and Future Work
  • 3. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Tags are all around
  • 4. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Tag cloud and many more…
  • 5. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Tagging: a double face Annotation phase Retrieval phase
  • 6. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Problems with annotation • Insert as much as possible tags (time consuming): – different versions of the same tag to catch all the possible searches – Multilingual tags
  • 7. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Problem with retrieval • Exactly (syntactic) match among tags: web service is different from web services, webservices,…
  • 8. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Why not to use Semantic tags? Plugged into the Web 3.0 Disambiguation Relations among tags Machine understandable NOT: Not Only Tag http://sisinflab.poliba.it/not-only-tag/
  • 9. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Demo • Let’s imagine to tag the book:
  • 10. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 NOT http://sisinflab.poliba.it/not-only-tag/
  • 11. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Smarter taggingAnnotationphaseRetrievalphase
  • 12. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 What is behind NOT? • DBpedia graph exploration • Computation of similarity value between each pair of RDF resources using external information sources (search engines, bookmarking systems)
  • 13. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 What is behind NOT? (II)
  • 14. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 What is behind NOT? (III)
  • 15. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 What is behind NOT? (IV) Semantic_Web XML-based_standards Knowledge_representation Data_management Internet_architecture Triplestores Folksonomy … … XML Computer_and_telecommunication_stantards Web_services User_interface_markup_languages Scalable_Vector_GraphicsMicroformats skos:subject skos:broaderCategoryArticle Legend …… … Resource Description Framework Microformat RDFa … …
  • 16. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 DBpedia-Ranker: hybrid ranking ?r1 ?r2 isSimilar v hasValue )( ),( )( ),( ),( 2 21 1 21 21 rf rrf rf rrf rrsim        viceversaandrandrbetweenwikilink,2 saor viceverrandrbetweenkwikilin,1 randrbetweenwikilinkno,0 ),( 21 21 21 21 rrorewikilinkSc )( ),( ),( 2 12 21 rl rrl rroreabstractSc  Graph-based ranking External sources-based ranking
  • 17. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Functional Architecture Back-end Query engine Storage Cloud Generator GUI Ext.InfoSources DBpedia Lookup Service Delicious Yahoo! Bing Graph Explorer SPARQL Context Analyzer Ranker Offline computation Linked Data graph exploration Rank nodes exploiting external information Store results as pairs of nodes together with their similarity Runtime Search Start typing a tag Query the system for relevant tags (corresponding to DBpedia resources) Show the semantic tag cloud 1 2 3 1 2 3 1 Offlinecomputation 2 3 1 2 3 GoogleGoogle Runtimesearch
  • 18. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Evaluation We evaluate five different algorithms: 1. DBpediaRanker 2. DBpediaRanker minus Wikipedia info 3. DBpediaRanker minus ext info sources 4. Co-occurrence 5. Similarity Distance ),()()( ),( ),( 2121 21 21 rrfrfrf rrf rrcoOcc     )}(log),(min{loglog ),(log)(log),(logmax ),( 21 2121 21 rfrfN rrfrfrf rrngd   
  • 19. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Evaluation (II) http://sisinflab.poliba.it/evaluation  50 volunteers Researchers in the ICT area 244 votes collected (on average 5 votes for each users) Time to vote: 1min and 40secs
  • 20. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Evaluation (III) http://sisinflab.poliba.it/evaluation/data 3.91 - Good
  • 21. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Conclusion • NOT *is* useful in the annotation phase: – suggestions of semantically related tags – Tags enrichment • NOT *is* useful in the retrieval phase: – Semantic match among tags
  • 22. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Future Work
  • 23. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Impakt Revolution http://sisinflab.poliba.it/impakt-revolution/
  • 24. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Inspiration: Google Wonder Wheel Exploratory Search in Google… …nice, but there is no “semantics” in it. You can not discover new knowledge exploiting the meaning of a term (keyword/tag/query)
  • 25. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 SWOC: Semantic Wonder Cloud http://sisinflab.poliba.it/semantic-wonder-cloud/index/
  • 26. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Q&A a.ragone@poliba.it Thanks for being here on Friday! :-) http://sisinflab.poliba.it/not-only-tag/ http://sisinflab.poliba.it/semantic-wonder-cloud/index/ http://sisinflab.poliba.it/impakt-revolution/
  • 27. 10th International Conference on Web Engineering, Vienna July 5-9, 2010 Conclusion  NOT: a tool for smarter tagging  Ranking algorithm for RDF graphs Future work  Test our algorithms with different domains  Extract more fine grained contexts  Enrich the extracted context using also relevant properties  Integrate our approach with real existing systems  Use the core system to automatically extract relevant tags (concepts) from a document (or from a collection of documents) exploiting tools for named entities extraction

Notas del editor

  1. Cerca: owl Poi aggiungi rdf Poi aggiungi owl