SlideShare una empresa de Scribd logo
1 de 39
MOMENT – TEMPORAL META-FACT
GENERATION AND PROPAGATION IN
KNOWLEDGE GRAPHS
FATIHA SAÏS, JOANA E. GONZALES MALAVERRI,
AND GIANLUCA QUERCINI
LRI, CNRS & Paris Saclay University, France
ACM SAC-SWA, March 31h, 2020
1
KNOWLEDGE GRAPHS (KGS)
"Linking Open Data cloud diagram 2017, by Andrejs Abele, John
P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak.
http://lod-cloud.net/" 2
facts
KNOWLEDGE GRAPHS (KGS)
"Linking Open Data cloud diagram 2017, by Andrejs Abele, John
P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak.
http://lod-cloud.net/" 3
.
.
.
KNOWLEDGE GRAPHS (KGS)
4
5
2012
2013
2015 2016
2013
2007
2007
2007
2012
Academic side Commercial side
WHO IS DEVELOPING
KNOWLEDGE GRAPHS?
 KG Creation
 Information extraction from web pages (DBpedia, Yago),
collaborative (Wikidata), ...
 KG Enrichment and Expansion
 Content prediction, data fusion, data/knowledge linking, ...
 KG Validity
 Dealing with errors and ambiguities
 Fact validity
 Timeliness: truth of statements often changes with time:
 E.g., Currently, who is the US president?
KNOWLEDGE GRAPHS: SOME
KEY PROBLEMS
6
KGS – WHY TIME MATTERS?
 Time period during which a fact is valid: validity time.
 f1: <#Obama presidentOf US> is true in the temporal context [2008-2016]
 f2: <#Trump presidentOf US> is true in the temporal context [2016-now]
 Query answering
 Consistency checking
 Sort the sequence of facts.
7
PROBLEM AND PROPOSED APPROACH
Problem
 How can we capture fact validity time using only
the facts and the structure of the KGs?
8
PROBLEM AND PROPOSED APPROACH
Problem
 How can we capture fact validity time using only
the facts and the structure of the KGs?
Approach – Seed meta-fact generation (step 1)
9Knowledge Patterns
PROBLEM AND PROPOSED APPROACH
Problem
 How can we capture fact validity time using only
the facts and the structure of the KGs?
Approach – Seed meta-fact generation (step 1)
10Knowledge Patterns
PROBLEM AND PROPOSED APPROACH
Temporal meta-fact
Problem
 How can we capture fact validity time using only
the facts and the structure of the KGs?
Approach – Seed meta-fact generation (step 1)
11Knowledge Patterns
1. SEED META-FACT GENERATION
 Focus on predicates that may change over time:
 Brad Pitt acted in:
 the Fight Club in 1999
 the Curious Case of Benjamin Button in 2008
 Paul McCartney was/is married with:
 Heather Mills from 2002 to 2008
 Nancy Shevell since 2011
12
1. SEED META-FACT GENERATION
Algorithm: Query generation
 For each knowledge pattern, in the form of:
13
1. SEED META-FACT GENERATION
Algorithm: Query generation
 For each knowledge pattern, in the form of:
 Generate a SPARQL query (graph pattern = rule premise)
14
1. SEED META-FACT GENERATION
Algorithm: Meta-fact generation
 On the set of facts obtained from SPARQL queries, generate the
set of temporal meta-facts
15
1. SEED META-FACT GENERATION
Algorithm: Meta-fact generation
 On the set of facts obtained from SPARQL queries, generate the
set of temporal meta-facts
 For interval knowledge patterns: compute an intersection
between the temporal intervals associated to the subject and the
object of a fact.
16
1. SEED META-FACT GENERATION
Algorithm: Meta-fact generation
 On the set of facts obtained from SPARQL queries, generate the
set of temporal meta-facts
 For interval knowledge patterns: compute an intersection
between the temporal intervals associated to the subject and the
object of a fact.
Start-date
End-date 17
TEMPORAL META-FACT
EXPANSION
18
2. TEMPORAL META-FACT EXPANSION
 Use a set of Horn rules (generated by AMIE) to propagate existing
temporal meta-facts
19
2. TEMPORAL META-FACT EXPANSION
 Use a set of Horn rules (generated by AMIE) to propagate existing
temporal meta-facts
 Step 1: Rule selection and instantiation
 Keep only rule with PCA-confidence > theta
 Select those premises contain a predicate in the set of seed
meta-facts
 Rule instantiation on a set of facts and seed meta-facts
(generated or already existing in the KG)
20
2. TEMPORAL META-FACT EXPANSION
 Use a set of Horn rules (generated by AMIE) to propagate existing
temporal meta-facts
 Step 2: Temporal meta-fact propagation
 Case 1: only one meta-fact in the premise, then propagate the
date attached to the conclusion
 Case 2: if several meta-facts in the premise, then apply
temporal combination constraints
21
2. TEMPORAL COMBINATION
CONSTRAINTS
22
2. TEMPORAL COMBINATION
CONSTRAINTS
 Use of Allen’s interval relationships: during(TI, TI’), overlaps(TI, TI’),
meets(TI, TI’), …
 Apply one of the twelve combination constraints we defined, such as:
TI1 TI2
S(TI1) E(TI1)S(TI2) E(TI2)
TI3
S(TI3) E(TI3)

Inferred time interval
23
2. META-FACT CONFIDENCE
Assign for each inferred meta-fact a confidence value in [0;1], using:
 Quality scores of the used rule: Head-Coverage, PCA-confidence
 K1: #meta-facts associated with the atoms of the rule premise
 K2: the sum of the distances between the temporal meta-facts of
the premise
24
EXPERIMENTS
25
 Yago: contains information extracted from
Wikipedia infoboxes, WordNet, and GeoNames,
 > 10 million entities (persons, cities,
organizations),
 > 120 million facts about these entities
 Attaches temporal and spatial dimensions to
some facts and entities.
1. SEED META-FACT GENERATION:
EXPERIMENTS ON YAGO
Time-sensitive predicates
26
1. SEED META-FACT GENERATION:
EXPERIMENTS ON YAGO
Quantitative results of seed meta-fact generation
27
x 8479
x 228
x 100
x 6
x 67
1. SEED META-FACT GENERATION:
EXPERIMENTS ON YAGO
Qualitative evaluation on Movie case study
- Use of IMDb as a ground truth
- Comparaison with yago meta-facts IMDb - page
28
 Movie domain data in YAGO:
 # of films: 151427
 # of actors: 47800
 # of release dates available: 136234
1. SEED META-FACT GENERATION:
EXPERIMENTS ON YAGO
29
1. SEED META-FACT GENERATION:
EXPERIMENTS ON YAGO
 Evaluation of 5200 meta-facts of actedin predidcate
 Movie-title and actor-name equal IMDb title and one of the
IMDb actors’ name
 25% of release date in Yago are different with IMDb release
date: premiere date in IMDb and shown in a festival date in
yago
30
1. SEED META-FACT GENERATION:
EXPERIMENTS ON YAGO
 Evaluation of 5200 meta-facts of actedin predidcate
 Movie-title and actor-name equal IMDb title and one of the
IMDb actors’ name
 25% of release date in Yago are different with IMDb release
date: premiere date in IMDb and shown in a festival date in
yago
 Results:
 Reach of 75% of precision for timestamp meta-facts.
 Some meta-facts are not inferred, some films have no release
date in Yago.
 Some obtained interval meta-facts are too large: need to be
refined
31
2. TEMPORAL META-FACT
EXPANSION: EXPERIMENTS
 Dbpedia as a KG to be enriched with temporal meta-facts
 As seed meta-facts:
 Yago meta-facts: datasets A
 Seed meta-facts: dataset B
 Use of AMIE horn rules
 Wikidata as a ground truth (qualifiers)
 Use of Wikidata endpoint *
* https://query.wikidata.org/ 32
2. TEMPORAL META-FACT
EXPANSION: EXPERIMENTS
 Quantitive results on dataset A (Yago meta-facts)
33
2. TEMPORAL META-FACT
EXPANSION: EXPERIMENTS
 Quantitive results on dataset B (Seed meta-facts, step1)
34
2. TEMPORAL META-FACT
EXPANSION: EXPERIMENTS
Qualitative results: Considered timestamp meta-facts and full date
 Dataset A gets better results: better accuracy of seed meta-facts
 Dataset B reaches 0.67 of F-measure
35
CONCLUSION
 An approach for seed temporal meta-fact
generation for time-sensitive predicates
 A rule-based approach for temporal meta-fact
propagation
 A first quantitative and qualitative evaluation
36
FUTURE WORK
 Develop an approach to learn time-sensitive (evolving)
predicates.
 Improvements are needed for seed interval meta-facts
generation
 Fact validity assessment using temporal-meta facts
 Combine temporal and spatial reasoning:
 E.g.: Film release dates are associated to specific
locations (country, city).
37
ACKNOWLEDGMENTS
 Labex DigiCosme (Paris Sacaly): VASTE
Project
 CNRS: RegleX PEPS Project
38
MOMENT –TEMPORAL META-FACT
GENERATION AND PROPAGATION IN
KNOWLEDGE GRAPHS
FATIHA SAÏS, JOANA E. GONZALES MALAVERRI,
AND GIANLUCA QUERCINI
LRI, CNRS & Paris Saclay University, France
ACM SAC-SWA, March 31h, 2020
39

Más contenido relacionado

Similar a MOMENT: Temporal Meta-Fact Generation and Propagation in Knowledge Graphs

How it works- Data Science
How it works- Data ScienceHow it works- Data Science
How it works- Data ScienceEdureka!
 
A factorial study of neural network learning from differences for regression
A factorial study of neural network learning from  differences for regressionA factorial study of neural network learning from  differences for regression
A factorial study of neural network learning from differences for regressionMathieu d'Aquin
 
Data Mining Association Analysis Basic Concepts a
Data Mining Association Analysis Basic Concepts aData Mining Association Analysis Basic Concepts a
Data Mining Association Analysis Basic Concepts aOllieShoresna
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...Paolo Missier
 
Comparing Scalable Predictive Analysis using Spark XGBoost Platforms
Comparing Scalable Predictive Analysis using Spark XGBoost PlatformsComparing Scalable Predictive Analysis using Spark XGBoost Platforms
Comparing Scalable Predictive Analysis using Spark XGBoost PlatformsJongwook Woo
 
FDS Module I 20.1.2022.ppt
FDS Module I 20.1.2022.pptFDS Module I 20.1.2022.ppt
FDS Module I 20.1.2022.pptPerumalPitchandi
 
Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Robert Grossman
 
Data Provenance for Data Science
Data Provenance for Data ScienceData Provenance for Data Science
Data Provenance for Data SciencePaolo Missier
 
Overview of Data Mining
Overview of Data MiningOverview of Data Mining
Overview of Data MiningBowo Prasetyo
 
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Paragon_Science_Inc
 
Aaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridAaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridIan Foster
 
Grid Projects In The US July 2008
Grid Projects In The US July 2008Grid Projects In The US July 2008
Grid Projects In The US July 2008Ian Foster
 
A framework and approaches to develop an in-house CAT with freeware and open ...
A framework and approaches to develop an in-house CAT with freeware and open ...A framework and approaches to develop an in-house CAT with freeware and open ...
A framework and approaches to develop an in-house CAT with freeware and open ...Tetsuo Kimura
 
Big Data Lessons from the Cloud
Big Data Lessons from the CloudBig Data Lessons from the Cloud
Big Data Lessons from the CloudMapR Technologies
 
AdClickFraud_Bigdata-Apic-Ist-2019
AdClickFraud_Bigdata-Apic-Ist-2019AdClickFraud_Bigdata-Apic-Ist-2019
AdClickFraud_Bigdata-Apic-Ist-2019Neha gupta
 
A Cooperative Coevolutionary Approach to Maximise Surveillance Coverage of UA...
A Cooperative Coevolutionary Approach to Maximise Surveillance Coverage of UA...A Cooperative Coevolutionary Approach to Maximise Surveillance Coverage of UA...
A Cooperative Coevolutionary Approach to Maximise Surveillance Coverage of UA...Daniel H. Stolfi
 
Mateo Valero - Big data: de la investigación científica a la gestión empresarial
Mateo Valero - Big data: de la investigación científica a la gestión empresarialMateo Valero - Big data: de la investigación científica a la gestión empresarial
Mateo Valero - Big data: de la investigación científica a la gestión empresarialFundación Ramón Areces
 
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET Journal
 

Similar a MOMENT: Temporal Meta-Fact Generation and Propagation in Knowledge Graphs (20)

10probs.ppt
10probs.ppt10probs.ppt
10probs.ppt
 
How it works- Data Science
How it works- Data ScienceHow it works- Data Science
How it works- Data Science
 
A factorial study of neural network learning from differences for regression
A factorial study of neural network learning from  differences for regressionA factorial study of neural network learning from  differences for regression
A factorial study of neural network learning from differences for regression
 
Data Mining Association Analysis Basic Concepts a
Data Mining Association Analysis Basic Concepts aData Mining Association Analysis Basic Concepts a
Data Mining Association Analysis Basic Concepts a
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...
 
Comparing Scalable Predictive Analysis using Spark XGBoost Platforms
Comparing Scalable Predictive Analysis using Spark XGBoost PlatformsComparing Scalable Predictive Analysis using Spark XGBoost Platforms
Comparing Scalable Predictive Analysis using Spark XGBoost Platforms
 
FDS Module I 20.1.2022.ppt
FDS Module I 20.1.2022.pptFDS Module I 20.1.2022.ppt
FDS Module I 20.1.2022.ppt
 
Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011
 
Glasgow University Geo Metadata Workshop
Glasgow University Geo Metadata WorkshopGlasgow University Geo Metadata Workshop
Glasgow University Geo Metadata Workshop
 
Data Provenance for Data Science
Data Provenance for Data ScienceData Provenance for Data Science
Data Provenance for Data Science
 
Overview of Data Mining
Overview of Data MiningOverview of Data Mining
Overview of Data Mining
 
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
 
Aaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridAaas Data Intensive Science And Grid
Aaas Data Intensive Science And Grid
 
Grid Projects In The US July 2008
Grid Projects In The US July 2008Grid Projects In The US July 2008
Grid Projects In The US July 2008
 
A framework and approaches to develop an in-house CAT with freeware and open ...
A framework and approaches to develop an in-house CAT with freeware and open ...A framework and approaches to develop an in-house CAT with freeware and open ...
A framework and approaches to develop an in-house CAT with freeware and open ...
 
Big Data Lessons from the Cloud
Big Data Lessons from the CloudBig Data Lessons from the Cloud
Big Data Lessons from the Cloud
 
AdClickFraud_Bigdata-Apic-Ist-2019
AdClickFraud_Bigdata-Apic-Ist-2019AdClickFraud_Bigdata-Apic-Ist-2019
AdClickFraud_Bigdata-Apic-Ist-2019
 
A Cooperative Coevolutionary Approach to Maximise Surveillance Coverage of UA...
A Cooperative Coevolutionary Approach to Maximise Surveillance Coverage of UA...A Cooperative Coevolutionary Approach to Maximise Surveillance Coverage of UA...
A Cooperative Coevolutionary Approach to Maximise Surveillance Coverage of UA...
 
Mateo Valero - Big data: de la investigación científica a la gestión empresarial
Mateo Valero - Big data: de la investigación científica a la gestión empresarialMateo Valero - Big data: de la investigación científica a la gestión empresarial
Mateo Valero - Big data: de la investigación científica a la gestión empresarial
 
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
 

Último

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 

Último (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 

MOMENT: Temporal Meta-Fact Generation and Propagation in Knowledge Graphs

  • 1. MOMENT – TEMPORAL META-FACT GENERATION AND PROPAGATION IN KNOWLEDGE GRAPHS FATIHA SAÏS, JOANA E. GONZALES MALAVERRI, AND GIANLUCA QUERCINI LRI, CNRS & Paris Saclay University, France ACM SAC-SWA, March 31h, 2020 1
  • 2. KNOWLEDGE GRAPHS (KGS) "Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/" 2
  • 3. facts KNOWLEDGE GRAPHS (KGS) "Linking Open Data cloud diagram 2017, by Andrejs Abele, John P. McCrae, Paul Buitelaar, Anja Jentzsch and Richard Cyganiak. http://lod-cloud.net/" 3 . . .
  • 5. 5 2012 2013 2015 2016 2013 2007 2007 2007 2012 Academic side Commercial side WHO IS DEVELOPING KNOWLEDGE GRAPHS?
  • 6.  KG Creation  Information extraction from web pages (DBpedia, Yago), collaborative (Wikidata), ...  KG Enrichment and Expansion  Content prediction, data fusion, data/knowledge linking, ...  KG Validity  Dealing with errors and ambiguities  Fact validity  Timeliness: truth of statements often changes with time:  E.g., Currently, who is the US president? KNOWLEDGE GRAPHS: SOME KEY PROBLEMS 6
  • 7. KGS – WHY TIME MATTERS?  Time period during which a fact is valid: validity time.  f1: <#Obama presidentOf US> is true in the temporal context [2008-2016]  f2: <#Trump presidentOf US> is true in the temporal context [2016-now]  Query answering  Consistency checking  Sort the sequence of facts. 7
  • 8. PROBLEM AND PROPOSED APPROACH Problem  How can we capture fact validity time using only the facts and the structure of the KGs? 8
  • 9. PROBLEM AND PROPOSED APPROACH Problem  How can we capture fact validity time using only the facts and the structure of the KGs? Approach – Seed meta-fact generation (step 1) 9Knowledge Patterns
  • 10. PROBLEM AND PROPOSED APPROACH Problem  How can we capture fact validity time using only the facts and the structure of the KGs? Approach – Seed meta-fact generation (step 1) 10Knowledge Patterns
  • 11. PROBLEM AND PROPOSED APPROACH Temporal meta-fact Problem  How can we capture fact validity time using only the facts and the structure of the KGs? Approach – Seed meta-fact generation (step 1) 11Knowledge Patterns
  • 12. 1. SEED META-FACT GENERATION  Focus on predicates that may change over time:  Brad Pitt acted in:  the Fight Club in 1999  the Curious Case of Benjamin Button in 2008  Paul McCartney was/is married with:  Heather Mills from 2002 to 2008  Nancy Shevell since 2011 12
  • 13. 1. SEED META-FACT GENERATION Algorithm: Query generation  For each knowledge pattern, in the form of: 13
  • 14. 1. SEED META-FACT GENERATION Algorithm: Query generation  For each knowledge pattern, in the form of:  Generate a SPARQL query (graph pattern = rule premise) 14
  • 15. 1. SEED META-FACT GENERATION Algorithm: Meta-fact generation  On the set of facts obtained from SPARQL queries, generate the set of temporal meta-facts 15
  • 16. 1. SEED META-FACT GENERATION Algorithm: Meta-fact generation  On the set of facts obtained from SPARQL queries, generate the set of temporal meta-facts  For interval knowledge patterns: compute an intersection between the temporal intervals associated to the subject and the object of a fact. 16
  • 17. 1. SEED META-FACT GENERATION Algorithm: Meta-fact generation  On the set of facts obtained from SPARQL queries, generate the set of temporal meta-facts  For interval knowledge patterns: compute an intersection between the temporal intervals associated to the subject and the object of a fact. Start-date End-date 17
  • 19. 2. TEMPORAL META-FACT EXPANSION  Use a set of Horn rules (generated by AMIE) to propagate existing temporal meta-facts 19
  • 20. 2. TEMPORAL META-FACT EXPANSION  Use a set of Horn rules (generated by AMIE) to propagate existing temporal meta-facts  Step 1: Rule selection and instantiation  Keep only rule with PCA-confidence > theta  Select those premises contain a predicate in the set of seed meta-facts  Rule instantiation on a set of facts and seed meta-facts (generated or already existing in the KG) 20
  • 21. 2. TEMPORAL META-FACT EXPANSION  Use a set of Horn rules (generated by AMIE) to propagate existing temporal meta-facts  Step 2: Temporal meta-fact propagation  Case 1: only one meta-fact in the premise, then propagate the date attached to the conclusion  Case 2: if several meta-facts in the premise, then apply temporal combination constraints 21
  • 23. 2. TEMPORAL COMBINATION CONSTRAINTS  Use of Allen’s interval relationships: during(TI, TI’), overlaps(TI, TI’), meets(TI, TI’), …  Apply one of the twelve combination constraints we defined, such as: TI1 TI2 S(TI1) E(TI1)S(TI2) E(TI2) TI3 S(TI3) E(TI3)  Inferred time interval 23
  • 24. 2. META-FACT CONFIDENCE Assign for each inferred meta-fact a confidence value in [0;1], using:  Quality scores of the used rule: Head-Coverage, PCA-confidence  K1: #meta-facts associated with the atoms of the rule premise  K2: the sum of the distances between the temporal meta-facts of the premise 24
  • 26.  Yago: contains information extracted from Wikipedia infoboxes, WordNet, and GeoNames,  > 10 million entities (persons, cities, organizations),  > 120 million facts about these entities  Attaches temporal and spatial dimensions to some facts and entities. 1. SEED META-FACT GENERATION: EXPERIMENTS ON YAGO Time-sensitive predicates 26
  • 27. 1. SEED META-FACT GENERATION: EXPERIMENTS ON YAGO Quantitative results of seed meta-fact generation 27 x 8479 x 228 x 100 x 6 x 67
  • 28. 1. SEED META-FACT GENERATION: EXPERIMENTS ON YAGO Qualitative evaluation on Movie case study - Use of IMDb as a ground truth - Comparaison with yago meta-facts IMDb - page 28
  • 29.  Movie domain data in YAGO:  # of films: 151427  # of actors: 47800  # of release dates available: 136234 1. SEED META-FACT GENERATION: EXPERIMENTS ON YAGO 29
  • 30. 1. SEED META-FACT GENERATION: EXPERIMENTS ON YAGO  Evaluation of 5200 meta-facts of actedin predidcate  Movie-title and actor-name equal IMDb title and one of the IMDb actors’ name  25% of release date in Yago are different with IMDb release date: premiere date in IMDb and shown in a festival date in yago 30
  • 31. 1. SEED META-FACT GENERATION: EXPERIMENTS ON YAGO  Evaluation of 5200 meta-facts of actedin predidcate  Movie-title and actor-name equal IMDb title and one of the IMDb actors’ name  25% of release date in Yago are different with IMDb release date: premiere date in IMDb and shown in a festival date in yago  Results:  Reach of 75% of precision for timestamp meta-facts.  Some meta-facts are not inferred, some films have no release date in Yago.  Some obtained interval meta-facts are too large: need to be refined 31
  • 32. 2. TEMPORAL META-FACT EXPANSION: EXPERIMENTS  Dbpedia as a KG to be enriched with temporal meta-facts  As seed meta-facts:  Yago meta-facts: datasets A  Seed meta-facts: dataset B  Use of AMIE horn rules  Wikidata as a ground truth (qualifiers)  Use of Wikidata endpoint * * https://query.wikidata.org/ 32
  • 33. 2. TEMPORAL META-FACT EXPANSION: EXPERIMENTS  Quantitive results on dataset A (Yago meta-facts) 33
  • 34. 2. TEMPORAL META-FACT EXPANSION: EXPERIMENTS  Quantitive results on dataset B (Seed meta-facts, step1) 34
  • 35. 2. TEMPORAL META-FACT EXPANSION: EXPERIMENTS Qualitative results: Considered timestamp meta-facts and full date  Dataset A gets better results: better accuracy of seed meta-facts  Dataset B reaches 0.67 of F-measure 35
  • 36. CONCLUSION  An approach for seed temporal meta-fact generation for time-sensitive predicates  A rule-based approach for temporal meta-fact propagation  A first quantitative and qualitative evaluation 36
  • 37. FUTURE WORK  Develop an approach to learn time-sensitive (evolving) predicates.  Improvements are needed for seed interval meta-facts generation  Fact validity assessment using temporal-meta facts  Combine temporal and spatial reasoning:  E.g.: Film release dates are associated to specific locations (country, city). 37
  • 38. ACKNOWLEDGMENTS  Labex DigiCosme (Paris Sacaly): VASTE Project  CNRS: RegleX PEPS Project 38
  • 39. MOMENT –TEMPORAL META-FACT GENERATION AND PROPAGATION IN KNOWLEDGE GRAPHS FATIHA SAÏS, JOANA E. GONZALES MALAVERRI, AND GIANLUCA QUERCINI LRI, CNRS & Paris Saclay University, France ACM SAC-SWA, March 31h, 2020 39

Notas del editor

  1. The KGs are published