SlideShare una empresa de Scribd logo
1 de 32
Discovery Hub: on-the-fly linked data exploratory search
Nicolas Marie, Fabien Gandon, Myriam Ribière
Florentin Rodio, Damien Legrand
CONTEXT
PROPOSITION
EVALUATION
CONCLUSION
Search…
ExploratoryLookup
???« members » + « The Beatles»
Precise information need Fuzzy information need
you are here
related work…
Aemoo Kaminskas & al. LED MORE Seevl Yovisto
Purpose Explorator
y search
Cross-domain
recommendation
Exploratory
search on
ICT domain
Film
recommendati
on
Musical
recommendati
on
Video
exploratory
search
Data DBpedia
EN +
external
services
DBpedia EN
subset
DBpedia +
external
services
DBpedia EN
subset
DBpedia EN
subset
DBpedia
EN+DE
subset
Multi-domain Yes Cross two
domains
No No, cinema No, music Yes
Query Entity
search
Entity selection in
a pre-processed
list
Entity search Entity search Entity
recognition
from Youtube.
Entity
recognition in
keywords
Algorithm EKP
filtered
view
weighted
activation
DBpedia
Ranker
sVSM algo. DBrec
algorithm
Set of
heuristics
Ranking No Yes Yes Yes Yes Yes
Explanations Wikipedia-
based
Path-based No Shared prop. Shared
properties
No
Offline proc. Yes , EKP
part
Yes Yes Yes Yes Yes
goal: domain-independent, customizable, on the fly, remote sources
composite interest queries
knowing my interest for X and Y what can I
discover/learn which is related to all these resources?
The Beatles Ken Loach
CONTEXT
PROPOSITION
EVALUATION
CONCLUSION
principle
results selection
ranking
sorting/categorization
explanations
1
2
3
4
http://dbpedia.org/resource/Ken_Loach
…dbpedia.org/resource/The_Beatles
research questions
1. How can we discover linked resources of interest
to be explored ?
2. How to address remote LOD sources for this?
3. How to present and explain the results to the user
for an exploratory objective ?
http://fr.dbpedia.org/sparql
http://es.dbpedia.org/sparql
http://it.dbpedia.org/sparql
semantic adaptation of spreading activation
1
0,2
0,2
0,2 0,2
0,1
0,6
0,6
1
0,8
1
example of semantic spreading activation
Album, Band, Film,
Musical Artist, Music
Genre, Person, Radio
Station, Single, Song,
Television Show
Company, Election, Film,
Journalist, Musical
Artist, Newspaper,
Office Holder,
Organisation, Politician,
School, Single,
Television Show, Writer
propagation domain propagation domain
research questions
1. How can we discover linked resources of interest
to be explored ?
2. How to address remote LOD sources for it?
3. How to present and explain the results to the user
for an exploratory objective ?
http://fr.dbpedia.org/sparql
http://es.dbpedia.org/sparql
http://it.dbpedia.org/sparql
sampling algorithm
1.sparql endpoint = http://xxx/sparql
2.seeds = xxx//The_Beatles, xxx/Ken_Loach
3. compute the propagation domain (w(i,o))
4. find a path between the seeds
5. import path nodes & their neighbors
6. for(i=1; i<=maxPulse; i++){
7. pulse();
8. if(sampleSize <= maxSampleSize){
9. extend the sample
10. }
11.}
iterative import
Local Kgram instance
Online LOD source
magic numbers
1.sparql endpoint = http://xxx/sparql
2.seeds = xxx//The_Beatles, xxx/Ken_Loach
3. compute the propagation domain (w(i,o))
4. find a path between the seeds
5. import path nodes & their neighbors
6. for(i=1; i<=maxPulse; i++){
7. pulse
8. if(sampleSize <= maxSampleSize){
9. extend the sample
10. }
11.}
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0
500
1000
1500
2000
2500
3000
3500
4000
4500
0 5000 10000 15000 20000
KendallTau
ResponseTime
Triples loading limit
Sample size influence on top 100 results, maxSampleSize
Convergence, top 100 results maxPulse
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0
10
20
30
40
50
60
70
80
90
100
1 2 3 4 5 6 7 8 9 10
Kendall-Tau
Sharedresults
Iterations
Response time histogram
0
0
0
0
0
1
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
seconds
Queries response time histogram
5
20
research questions
1. How can we discover linked resources of interest
to be explored ?
2. How to address remote LOD sources for it?
3. How to present and explain the results to the user
for an exploratory objective ?
http://fr.dbpedia.org/sparql
http://es.dbpedia.org/sparql
http://it.dbpedia.org/sparql
Discovery Hub 1.0
1. Start from what you like
or are interested in
3. Be redirected on third-party
platforms to continue the
discovery experience
Book
2.
Explore, understand, disco
ver
…
Discovery Hub 1.0
short demo
CONTEXT
PROPOSITION
EVALUATION
CONCLUSION
composite queries
• randomly combining Facebook likes of 12 users
• two queries for each participants to judge the top 20 results
- The result interests me [Strongly Disagree … Strongly Agree ]
- The result is unexpected [Strongly Disagree … Strongly Agree ]
Very interesting
Not interesting at all
overall
•61.6% of the results were rated as strongly relevant
or relevant by the participants.
•65% of the results were rated as strongly
unexpected or unexpected.
•35.42% of the results were rated both as strongly
relevant or relevant and strongly unexpected or
unexpected.
Explanatory features evaluation
Common prop. Wiki-based Graph-based OverallCommon prop. Wiki-based Graph-based Overall
Very Helpful
Not helpful at all
comparison SSA(Discovery Hub) vs. sVSM (More)
• Hypothesis 1: SSA gives results at least as relevant as sVSM.
• Hypothesis 2: SSA has a weaker degradation than sVSM (better end-lists).
• Hypothesis 3: results less relevant but newer to users at the end of the lists.
• Hypothesis 4: advanced search gives better results compared to standard
query.
Measure Algo Rank Mean St. Dev.
Relevance SSA 1-10 1.54 0.305
11-20 1.28 0.243
sVSM 1-10 1.42 0.294
11-20 0.93 0.228
Discovery SSA 1-10 1.10 0.247
11-20 1.21 0.228
sVSM 1-10 1.14 0.251
11-20 1.50 0.205 0
0.5
1
1.5
2
2001 Erin Term Princess Fight Overall
SCORE
SSA sVSM
CONTEXT
PROPOSITION
EVALUATION
CONCLUSION
•semantic spreading activation
algorithm coupled to a graph
sampling to address remote
LOD sources.
•faceted browsing and
multiple explanations of
the results.
•on-going extensive user
evaluation
•publicly available http://discoveryhub.co
Discovery Hub : enabling exploratory
search starting from several interests
using linked data sources
1
0,2
0,2 0,2
0,6
0,6
1
0,8
1
current work:
- propagation over multiple data sources in parallel.
- redesign of the interface: Discovery Hub 2.0 released
perspective: other applications of semantic spreading
activation
multi-lingual mode
dbpedia:Charles_Baudelaire sameAs fr.dbpedia:Charles_Baudelaire
French
English
http://discoveryhub.co/
@discovery_hub
werarediscoveryhub@gmail.com

Más contenido relacionado

Similar a Discovery Hub: on-the-fly linked data exploratory search

Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Webebiquity
 
The (very) basics of AI for the Radiology resident
The (very) basics of AI for the Radiology residentThe (very) basics of AI for the Radiology resident
The (very) basics of AI for the Radiology residentPedro Staziaki
 
Breaking the Google Addiction
Breaking the Google AddictionBreaking the Google Addiction
Breaking the Google AddictionAlan Manifold
 
Database novelty detection
Database novelty detectionDatabase novelty detection
Database novelty detectionMostafaAliAbbas
 
Facets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationFacets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationRoberto García
 
PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning" PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning" Joshua Bloom
 
Site search analytics workshop presentation
Site search analytics workshop presentationSite search analytics workshop presentation
Site search analytics workshop presentationLouis Rosenfeld
 
Improving VIVO search through semantic ranking.
Improving VIVO search through semantic ranking.Improving VIVO search through semantic ranking.
Improving VIVO search through semantic ranking.Deepak K
 
The Place of Schema.org in Linked Ocean Data
The Place of Schema.org in Linked Ocean DataThe Place of Schema.org in Linked Ocean Data
The Place of Schema.org in Linked Ocean DataAdam Leadbetter
 
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisMathieu d'Aquin
 
Keyword-Based Navigation and Search over the Linked Data Web
Keyword-Based Navigation and Search over the Linked Data WebKeyword-Based Navigation and Search over the Linked Data Web
Keyword-Based Navigation and Search over the Linked Data WebLuca Matteis
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqEnis Afgan
 
Domain Identification for Linked Open Data
Domain Identification for Linked Open DataDomain Identification for Linked Open Data
Domain Identification for Linked Open DataSarasi Sarangi
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov
 
Developing in R - the contextual Multi-Armed Bandit edition
Developing in R - the contextual Multi-Armed Bandit editionDeveloping in R - the contextual Multi-Armed Bandit edition
Developing in R - the contextual Multi-Armed Bandit editionRobin van Emden
 
Invincea: Reasoning in Incident Response in Tapio
Invincea: Reasoning in Incident Response in TapioInvincea: Reasoning in Incident Response in Tapio
Invincea: Reasoning in Incident Response in TapioInvincea, Inc.
 

Similar a Discovery Hub: on-the-fly linked data exploratory search (20)

Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Web
 
The (very) basics of AI for the Radiology resident
The (very) basics of AI for the Radiology residentThe (very) basics of AI for the Radiology resident
The (very) basics of AI for the Radiology resident
 
Inter Lab Quigg 2
Inter Lab Quigg 2Inter Lab Quigg 2
Inter Lab Quigg 2
 
Breaking the Google Addiction
Breaking the Google AddictionBreaking the Google Addiction
Breaking the Google Addiction
 
Database novelty detection
Database novelty detectionDatabase novelty detection
Database novelty detection
 
Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
 
Facets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationFacets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data Exploration
 
1330 mon dochart2 brock
1330 mon dochart2 brock1330 mon dochart2 brock
1330 mon dochart2 brock
 
PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning" PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning"
 
Site search analytics workshop presentation
Site search analytics workshop presentationSite search analytics workshop presentation
Site search analytics workshop presentation
 
Improving VIVO search through semantic ranking.
Improving VIVO search through semantic ranking.Improving VIVO search through semantic ranking.
Improving VIVO search through semantic ranking.
 
The Place of Schema.org in Linked Ocean Data
The Place of Schema.org in Linked Ocean DataThe Place of Schema.org in Linked Ocean Data
The Place of Schema.org in Linked Ocean Data
 
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
 
Keyword-Based Navigation and Search over the Linked Data Web
Keyword-Based Navigation and Search over the Linked Data WebKeyword-Based Navigation and Search over the Linked Data Web
Keyword-Based Navigation and Search over the Linked Data Web
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-Seq
 
OpenML DALI
OpenML DALIOpenML DALI
OpenML DALI
 
Domain Identification for Linked Open Data
Domain Identification for Linked Open DataDomain Identification for Linked Open Data
Domain Identification for Linked Open Data
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
 
Developing in R - the contextual Multi-Armed Bandit edition
Developing in R - the contextual Multi-Armed Bandit editionDeveloping in R - the contextual Multi-Armed Bandit edition
Developing in R - the contextual Multi-Armed Bandit edition
 
Invincea: Reasoning in Incident Response in Tapio
Invincea: Reasoning in Incident Response in TapioInvincea: Reasoning in Incident Response in Tapio
Invincea: Reasoning in Incident Response in Tapio
 

Más de Fabien Gandon

Walking Our Way to the Web
Walking Our Way to the WebWalking Our Way to the Web
Walking Our Way to the WebFabien Gandon
 
a shift in our research focus: from knowledge acquisition to knowledge augmen...
a shift in our research focus: from knowledge acquisition to knowledge augmen...a shift in our research focus: from knowledge acquisition to knowledge augmen...
a shift in our research focus: from knowledge acquisition to knowledge augmen...Fabien Gandon
 
Evaluation d’explications pour la prédiction de liens dans les graphes de con...
Evaluation d’explications pour la prédiction de liens dans les graphes de con...Evaluation d’explications pour la prédiction de liens dans les graphes de con...
Evaluation d’explications pour la prédiction de liens dans les graphes de con...Fabien Gandon
 
A Never-Ending Project for Humanity Called “the Web”
A Never-Ending Project for Humanity Called “the Web”A Never-Ending Project for Humanity Called “the Web”
A Never-Ending Project for Humanity Called “the Web”Fabien Gandon
 
Wimmics Overview 2021
Wimmics Overview 2021Wimmics Overview 2021
Wimmics Overview 2021Fabien Gandon
 
CovidOnTheWeb : covid19 linked data published on the Web
CovidOnTheWeb : covid19 linked data published on the WebCovidOnTheWeb : covid19 linked data published on the Web
CovidOnTheWeb : covid19 linked data published on the WebFabien Gandon
 
Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Fabien Gandon
 
from linked data & knowledge graphs to linked intelligence & intelligence graphs
from linked data & knowledge graphs to linked intelligence & intelligence graphsfrom linked data & knowledge graphs to linked intelligence & intelligence graphs
from linked data & knowledge graphs to linked intelligence & intelligence graphsFabien Gandon
 
The Web We Mix - benevolent AIs for a resilient web
The Web We Mix - benevolent AIs for a resilient webThe Web We Mix - benevolent AIs for a resilient web
The Web We Mix - benevolent AIs for a resilient webFabien Gandon
 
Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018Fabien Gandon
 
Web science AI and IA
Web science AI and IAWeb science AI and IA
Web science AI and IAFabien Gandon
 
Normative Requirements as Linked Data
Normative Requirements as Linked DataNormative Requirements as Linked Data
Normative Requirements as Linked DataFabien Gandon
 
Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Fabien Gandon
 
On the many graphs of the Web and the interest of adding their missing links.
On the many graphs of the Web and the interest of adding their missing links. On the many graphs of the Web and the interest of adding their missing links.
On the many graphs of the Web and the interest of adding their missing links. Fabien Gandon
 
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...Fabien Gandon
 
How to supervise your supervisor?
How to supervise your supervisor?How to supervise your supervisor?
How to supervise your supervisor?Fabien Gandon
 
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,Fabien Gandon
 
Wimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportWimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportFabien Gandon
 
Retours sur le MOOC "Web Sémantique et Web de données"
Retours sur le MOOC "Web Sémantique et Web de données"Retours sur le MOOC "Web Sémantique et Web de données"
Retours sur le MOOC "Web Sémantique et Web de données"Fabien Gandon
 
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015Fabien Gandon
 

Más de Fabien Gandon (20)

Walking Our Way to the Web
Walking Our Way to the WebWalking Our Way to the Web
Walking Our Way to the Web
 
a shift in our research focus: from knowledge acquisition to knowledge augmen...
a shift in our research focus: from knowledge acquisition to knowledge augmen...a shift in our research focus: from knowledge acquisition to knowledge augmen...
a shift in our research focus: from knowledge acquisition to knowledge augmen...
 
Evaluation d’explications pour la prédiction de liens dans les graphes de con...
Evaluation d’explications pour la prédiction de liens dans les graphes de con...Evaluation d’explications pour la prédiction de liens dans les graphes de con...
Evaluation d’explications pour la prédiction de liens dans les graphes de con...
 
A Never-Ending Project for Humanity Called “the Web”
A Never-Ending Project for Humanity Called “the Web”A Never-Ending Project for Humanity Called “the Web”
A Never-Ending Project for Humanity Called “the Web”
 
Wimmics Overview 2021
Wimmics Overview 2021Wimmics Overview 2021
Wimmics Overview 2021
 
CovidOnTheWeb : covid19 linked data published on the Web
CovidOnTheWeb : covid19 linked data published on the WebCovidOnTheWeb : covid19 linked data published on the Web
CovidOnTheWeb : covid19 linked data published on the Web
 
Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...
 
from linked data & knowledge graphs to linked intelligence & intelligence graphs
from linked data & knowledge graphs to linked intelligence & intelligence graphsfrom linked data & knowledge graphs to linked intelligence & intelligence graphs
from linked data & knowledge graphs to linked intelligence & intelligence graphs
 
The Web We Mix - benevolent AIs for a resilient web
The Web We Mix - benevolent AIs for a resilient webThe Web We Mix - benevolent AIs for a resilient web
The Web We Mix - benevolent AIs for a resilient web
 
Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018
 
Web science AI and IA
Web science AI and IAWeb science AI and IA
Web science AI and IA
 
Normative Requirements as Linked Data
Normative Requirements as Linked DataNormative Requirements as Linked Data
Normative Requirements as Linked Data
 
Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017
 
On the many graphs of the Web and the interest of adding their missing links.
On the many graphs of the Web and the interest of adding their missing links. On the many graphs of the Web and the interest of adding their missing links.
On the many graphs of the Web and the interest of adding their missing links.
 
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
One Web of pages, One Web of peoples, One Web of Services, One Web of Data, O...
 
How to supervise your supervisor?
How to supervise your supervisor?How to supervise your supervisor?
How to supervise your supervisor?
 
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
Dans l'esprit du Pagerank: regards croisés sur les algorithmes,
 
Wimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity ReportWimmics Research Team 2015 Activity Report
Wimmics Research Team 2015 Activity Report
 
Retours sur le MOOC "Web Sémantique et Web de données"
Retours sur le MOOC "Web Sémantique et Web de données"Retours sur le MOOC "Web Sémantique et Web de données"
Retours sur le MOOC "Web Sémantique et Web de données"
 
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
Emotions in Argumentation: an Empirical Evaluation @ IJCAI 2015
 

Último

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Último (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Discovery Hub: on-the-fly linked data exploratory search

  • 1. Discovery Hub: on-the-fly linked data exploratory search Nicolas Marie, Fabien Gandon, Myriam Ribière Florentin Rodio, Damien Legrand
  • 3. Search… ExploratoryLookup ???« members » + « The Beatles» Precise information need Fuzzy information need you are here
  • 4. related work… Aemoo Kaminskas & al. LED MORE Seevl Yovisto Purpose Explorator y search Cross-domain recommendation Exploratory search on ICT domain Film recommendati on Musical recommendati on Video exploratory search Data DBpedia EN + external services DBpedia EN subset DBpedia + external services DBpedia EN subset DBpedia EN subset DBpedia EN+DE subset Multi-domain Yes Cross two domains No No, cinema No, music Yes Query Entity search Entity selection in a pre-processed list Entity search Entity search Entity recognition from Youtube. Entity recognition in keywords Algorithm EKP filtered view weighted activation DBpedia Ranker sVSM algo. DBrec algorithm Set of heuristics Ranking No Yes Yes Yes Yes Yes Explanations Wikipedia- based Path-based No Shared prop. Shared properties No Offline proc. Yes , EKP part Yes Yes Yes Yes Yes goal: domain-independent, customizable, on the fly, remote sources
  • 5. composite interest queries knowing my interest for X and Y what can I discover/learn which is related to all these resources? The Beatles Ken Loach
  • 8. research questions 1. How can we discover linked resources of interest to be explored ? 2. How to address remote LOD sources for this? 3. How to present and explain the results to the user for an exploratory objective ? http://fr.dbpedia.org/sparql http://es.dbpedia.org/sparql http://it.dbpedia.org/sparql
  • 9. semantic adaptation of spreading activation 1 0,2 0,2 0,2 0,2 0,1 0,6 0,6 1 0,8 1
  • 10. example of semantic spreading activation
  • 11. Album, Band, Film, Musical Artist, Music Genre, Person, Radio Station, Single, Song, Television Show Company, Election, Film, Journalist, Musical Artist, Newspaper, Office Holder, Organisation, Politician, School, Single, Television Show, Writer propagation domain propagation domain
  • 12. research questions 1. How can we discover linked resources of interest to be explored ? 2. How to address remote LOD sources for it? 3. How to present and explain the results to the user for an exploratory objective ? http://fr.dbpedia.org/sparql http://es.dbpedia.org/sparql http://it.dbpedia.org/sparql
  • 13. sampling algorithm 1.sparql endpoint = http://xxx/sparql 2.seeds = xxx//The_Beatles, xxx/Ken_Loach 3. compute the propagation domain (w(i,o)) 4. find a path between the seeds 5. import path nodes & their neighbors 6. for(i=1; i<=maxPulse; i++){ 7. pulse(); 8. if(sampleSize <= maxSampleSize){ 9. extend the sample 10. } 11.}
  • 14. iterative import Local Kgram instance Online LOD source
  • 15. magic numbers 1.sparql endpoint = http://xxx/sparql 2.seeds = xxx//The_Beatles, xxx/Ken_Loach 3. compute the propagation domain (w(i,o)) 4. find a path between the seeds 5. import path nodes & their neighbors 6. for(i=1; i<=maxPulse; i++){ 7. pulse 8. if(sampleSize <= maxSampleSize){ 9. extend the sample 10. } 11.}
  • 16. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 500 1000 1500 2000 2500 3000 3500 4000 4500 0 5000 10000 15000 20000 KendallTau ResponseTime Triples loading limit Sample size influence on top 100 results, maxSampleSize
  • 17. Convergence, top 100 results maxPulse 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 10 20 30 40 50 60 70 80 90 100 1 2 3 4 5 6 7 8 9 10 Kendall-Tau Sharedresults Iterations
  • 19. research questions 1. How can we discover linked resources of interest to be explored ? 2. How to address remote LOD sources for it? 3. How to present and explain the results to the user for an exploratory objective ? http://fr.dbpedia.org/sparql http://es.dbpedia.org/sparql http://it.dbpedia.org/sparql
  • 20. Discovery Hub 1.0 1. Start from what you like or are interested in 3. Be redirected on third-party platforms to continue the discovery experience Book 2. Explore, understand, disco ver …
  • 24. composite queries • randomly combining Facebook likes of 12 users • two queries for each participants to judge the top 20 results - The result interests me [Strongly Disagree … Strongly Agree ] - The result is unexpected [Strongly Disagree … Strongly Agree ] Very interesting Not interesting at all
  • 25. overall •61.6% of the results were rated as strongly relevant or relevant by the participants. •65% of the results were rated as strongly unexpected or unexpected. •35.42% of the results were rated both as strongly relevant or relevant and strongly unexpected or unexpected.
  • 26. Explanatory features evaluation Common prop. Wiki-based Graph-based OverallCommon prop. Wiki-based Graph-based Overall Very Helpful Not helpful at all
  • 27. comparison SSA(Discovery Hub) vs. sVSM (More) • Hypothesis 1: SSA gives results at least as relevant as sVSM. • Hypothesis 2: SSA has a weaker degradation than sVSM (better end-lists). • Hypothesis 3: results less relevant but newer to users at the end of the lists. • Hypothesis 4: advanced search gives better results compared to standard query. Measure Algo Rank Mean St. Dev. Relevance SSA 1-10 1.54 0.305 11-20 1.28 0.243 sVSM 1-10 1.42 0.294 11-20 0.93 0.228 Discovery SSA 1-10 1.10 0.247 11-20 1.21 0.228 sVSM 1-10 1.14 0.251 11-20 1.50 0.205 0 0.5 1 1.5 2 2001 Erin Term Princess Fight Overall SCORE SSA sVSM
  • 29. •semantic spreading activation algorithm coupled to a graph sampling to address remote LOD sources. •faceted browsing and multiple explanations of the results. •on-going extensive user evaluation •publicly available http://discoveryhub.co Discovery Hub : enabling exploratory search starting from several interests using linked data sources 1 0,2 0,2 0,2 0,6 0,6 1 0,8 1
  • 30. current work: - propagation over multiple data sources in parallel. - redesign of the interface: Discovery Hub 2.0 released perspective: other applications of semantic spreading activation
  • 31. multi-lingual mode dbpedia:Charles_Baudelaire sameAs fr.dbpedia:Charles_Baudelaire French English