SlideShare a Scribd company logo
1 of 33
Creating Knowledge out of Interlinked Data

DBpedia Community Meeting
DBpedia Internationalization+

30/01/2014 Amsterdam

Dimitris Kontokostas
DBpedia is a community project, please see http://dbpedia.org for a List of contributors
LOD2 Presentation . 02.09.2010 . Page

AKSW, Universität Leipzig

http://lod2.eu
Structure in Wikipedia









Title
Abstract
Infoboxes
Geo-coordinates
Categories
Images
Links






other language versions
other Wikipedia pages
To the Web
Redirects
Disambiguations
Structure in Wikipedia









Title
Abstract
Infoboxes
Geo-coordinates
Categories
Images
Links






other language versions
other Wikipedia pages
To the Web
Redirects
Disambiguations
Structure in Wikipedia









Title
Abstract
Infoboxes
Geo-coordinates
Categories
Images
Links






other language versions
other Wikipedia pages
To the Web
Redirects
Disambiguations
Structure in Wikipedia









Title
Abstract
Infoboxes
Geo-coordinates
Categories
Images
Links






other language versions
other Wikipedia pages
To the Web
Redirects
Disambiguations
Structure in Wikipedia









Title
Abstract
Infoboxes
Geo-coordinates
Categories
Images
Links






other language versions
other Wikipedia pages
To the Web
Redirects
Disambiguations
Structure in Wikipedia









Title
Abstract
Infoboxes
Geo-coordinates
Categories
Images
Links






other language versions
other Wikipedia pages
To the Web
Redirects
Disambiguations
Structure in Wikipedia









Title
Abstract
Infoboxes
Geo-coordinates
Categories
Images
Links






other language versions
other Wikipedia pages
To the Web
Redirects
Disambiguations
Infobox Templates
Wikitext-Syntax
{{Infobox Korean settlement
| title
= Busan Metropolitan City
| img
= Busan.jpg
| imgcaption = A view of the [[Geumjeong]] district in Busan
| hangul
= 부산 광역시
...
| area_km2
= 763.46
| pop
= 3635389
| popyear
= 2006
| mayor
= Hur Nam-sik
| divs
= 15 wards (Gu), 1 county (Gun)
| region
= [[Yeongnam]]
| dialect
= [[Gyeongsang]]
}}
RDF representation
dbp:Busan
dbp:Busan
dbp:Busan
dbp:Busan
dbp:Busan
dbp:Busan
...

dbp:title
dbp:hangul
dbp:area_km2
dbp:pop
dbp:region
dbp:dialect

″Busan Metropolitan City″
″ 부산 광역시″ @Hang
″763.46“^xsd:float
″3635389“^xsd:int
dbp:Yeongnam
dbp:Gyeongsang
Creating Knowledge out of Interlinked Data
A closer look at infoboxes

KAIST – LOD2 16.8..2011 . Page

10

http://lod2.eu
Creating Knowledge out of Interlinked Data
A closer look at infoboxes

KAIST – LOD2 16.8..2011 . Page

11

http://lod2.eu
Creating Knowledge out of Interlinked Data
A closer look at infoboxes

KAIST – LOD2 16.8..2011 . Page

12

http://lod2.eu
Creating Knowledge out of Interlinked Data
Björk (Musician)
Occupation = Musician, Actor
Born = 21.12.1965, Reykjavík

Brown (Prime Minister)
office = Prime Minister of the UK
birth_date = 20.4.1951
birth_place = Govan

Romero (Actor)
occupation = Actor, Editor
birthdate = 4.2.1940
birthplace = New York

KAIST – LOD2 16.8..2011 . Page

13

http://lod2.eu
Creating Knowledge out of Interlinked Data
Björk (Musician)
Occupation = Musician, Actor
Born = 21.12.1965, Reykjavík

Brown (Prime Minister)
office = Prime Minister of the UK
birth_date = 20.4.1951
birth_place = Govan

Romero (Actor)
occupation = Actor, Editor
birthdate = 4.2.1940
birthplace = New York

KAIST – LOD2 16.8..2011 . Page

14

http://lod2.eu
Creating Knowledge out of Interlinked Data
Björk (Musician)
Occupation = Musician, Actor
Born = 21.12.1965, Reykjavík

Brown (Prime Minister)
office = Prime Minister of the UK
birth_date = 20.4.1951
birth_place = Govan

Romero (Actor)
occupation = Actor, Editor
birthdate = 4.2.1940
birthplace = New York

KAIST – LOD2 16.8..2011 . Page

15

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia – Collaborative Ontology Engineering

• Mappings Wiki
• http://mappings.dbpedia.org/
• Everybody can contribute new mappings or improve existing ones
• ~170 editors

• Correct Semantics:
• Combine what belongs together (birth_place, birthplace)
• Separate what is different (bornIn, birthplace)
• Big boost for Precision
• Recall is crowdsourced - Help us! :)

16
DBpedia Community Meeting / Amsterdam 30.01.2014

16

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia – Collaborative Ontology Engineering

17
DBpedia Community Meeting / Amsterdam 30.01.2014

17

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia – Internationalization

DBpedia Internationalization
Effort to port DBpedia to local (non-Enlish) wikipedia's

18
DBpedia Community Meeting / Amsterdam 30.01.2014

18

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia Internationalization (I18n)

• Local Wikipedias provide more information on local resources e.g.:
• The French version of Eiffel Tower is better than the English
• Articles of local importance might not exist in English

• Multilingual extraction was limited to basic page structure
• Labels, categories, links, raw infobox extraction

• DBpedia I18n started at 2009 with German and Korean (PHP).
• Extended at 2010 with Greek (Scala) and many languages followed.
• Now default multilingual extraction is enhanced and some languages are tailored
for even better extraction.
• Extractor tweaking / mappings definitions

19
DBpedia Community Meeting / Amsterdam 30.01.2014

19

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia I18n – Overview

• DBpedia 3.7 (08/2011) was the first to introduce internationalized datasets
• Language based namespaces (http://{lang}.DBpedia.org)
• Most are not dereferencable (except local chapters)

• DBpedia 3.9 (08/2013) provides data in 191 languages
• Mappings are enabled for 28 languages (24 active)
• 15 local DBpedia chapters http://dbpedia.org/Internationalization/Chapters
•
•
•
•

12 from Europe, Indonesian, Japanese & Korean
Provide dereferencable URIs / IRIs
Maintain their own domain and community
Mappings coordination

20
DBpedia Community Meeting / Amsterdam 30.01.2014

20

http://lod2.eu
21
Creating Knowledge out of Interlinked Data

DBpedia I18n – Mapping Activity

22
DBpedia Community Meeting / Amsterdam 30.01.2014

22

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia I18n – Mapping Statistics (2013.02)

23
DBpedia Community Meeting / Amsterdam 30.01.2014

23

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia I18n – Mapping Statistics (2013.02)

24
DBpedia Community Meeting / Amsterdam 30.01.2014

24

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia I18n – Mapping Statistics (v3.8)

25
DBpedia Community Meeting / Amsterdam 30.01.2014

25

http://lod2.eu
Creating Knowledge out of Interlinked Data
To keep our Dutch audience happy :)
v3.9 (NL): 211.927 People, 861.633 Places, 16.733 Organizations, 92.314 Works

DBpedia I18n – Mapping Statistics (v3.8)

26
DBpedia Community Meeting / Amsterdam 30.01.2014

26

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia I18n – Mapping Statistics (v3.8)

27
DBpedia Community Meeting / Amsterdam 30.01.2014

27

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia I18n – Mapping Statistics (v3.8)

28
DBpedia Community Meeting / Amsterdam 30.01.2014

28

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia I18n

Further information:
●

●

●

http://wiki.dbpedia.org/Internationalization
Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann,
Mohamed Morsey, Patrick van Kleef, Sören Auer, Christian Bizer. DBpedia – A Large-scale, Multilingual Knowledge
Base Extracted from Wikipedia. To appear in the Semantic Web Journal.
Dimitris Kontokostas, Charalampos Bratsas, Sören Auer, Sebastian Hellmann, Ioannis Antoniou, George Metakides,
Internationalization of Linked Data: The case of the Greek DBpedia edition, Web Semantics: Science, Services and
Agents on the World Wide Web, Volume 15, September 2012, Pages 51–61, ISSN 1570–8268,
10.1016/j.websem.2012.01.001.

29
DBpedia Community Meeting / Amsterdam 30.01.2014

29

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia @ GSoC

5 succesfull GSoC DBpedia projects this year:
http://wiki.dbpedia.org/gsoc2013
●

Type inference based on categories (Kasun Perera)
●

●

New interactive DBpedia interface (Denis Lukovnikov)
●

●

Live Wikidata2DBpedia endpoint (2014)

Power tool for DBpedia testing metadata (Lazaros Ioannidis)
●

●

Available at http://live.dbpedia.org

Wikidata integration (Hady ElHasar)
●

●

Available at https://github.com/dbpedia/dbpedia-links

Using Databugger output: http://databugger.aksw.org

Input format generalization for DBpedia Spotlight

Do you know any students for DBpedia @ GSoC 2014 ?
30
DBpedia Community Meeting / Amsterdam 30.01.2014

30

http://lod2.eu
Creating Knowledge out of Interlinked Data

Quality @ Dbpedia (soon)

Databugger + GsoC Power tool
SPARQL quality queries
(more than one birth date)
=>
Select ?s where {
?s dbo:birthDate ?d .
} Group by ?s
Having count(?d > 1)

31
DBpedia Community Meeting / Amsterdam 30.01.2014

31

dbr:Phil_Cuzzi => Wikipedia error
dbr:Ivan_Cattaneo
dbr:Vijay_Ghate
dbr:William_Tempest
dbr:Cliff_Speegle
dbr:Arnold,_Duke_of_Guelders
dbr:Schuyler_Grant
dbr:Vlas_Chubar
dbr:Adrian_Peterson
...

http://lod2.eu
Creating Knowledge out of Interlinked Data

DBpedia @ GSoC

32
DBpedia Community Meeting / Amsterdam 30.01.2014

32

http://lod2.eu
Creating Knowledge out of Interlinked Data

Thank you for your attention!

DBpedia is a community project, please see http://dbpedia.org for aDBpediaist of conDBpediars.
LOD2 Presentation . 02.09.2010 . Page

http://lod2.eu

More Related Content

Similar to DBpedia i18n - Amsterdam Meeting (30/01/2014)

Linking Spatial Data From The Web
Linking Spatial Data From The WebLinking Spatial Data From The Web
Linking Spatial Data From The Webchristianhbecker
 
Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025Beat Estermann
 
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeSören Auer
 
Wikipedia as source of collaboratively created Knowledge Organization Systems
Wikipedia as source of collaboratively created Knowledge Organization SystemsWikipedia as source of collaboratively created Knowledge Organization Systems
Wikipedia as source of collaboratively created Knowledge Organization SystemsJakob .
 
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studio
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studioI Linked Open Data nei Beni Culturali, alcuni progetti e casi di studio
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studioCulturaItalia
 
Nuts4nuts: geospatial information from Wikipedia (ECSS 2014)
Nuts4nuts: geospatial information from Wikipedia (ECSS 2014)Nuts4nuts: geospatial information from Wikipedia (ECSS 2014)
Nuts4nuts: geospatial information from Wikipedia (ECSS 2014)Cristian Consonni
 
DBpedia talk at Fjord Berlin
DBpedia talk at Fjord BerlinDBpedia talk at Fjord Berlin
DBpedia talk at Fjord BerlinGeorgi Kobilarov
 
Wikipedia as Knowledge Organization System
Wikipedia as Knowledge Organization SystemWikipedia as Knowledge Organization System
Wikipedia as Knowledge Organization SystemJakob .
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Antoine Isaac
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikisSören Auer
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Europeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom ViewsEuropeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom ViewsVladimir Alexiev, PhD, PMP
 

Similar to DBpedia i18n - Amsterdam Meeting (30/01/2014) (20)

Linking Spatial Data From The Web
Linking Spatial Data From The WebLinking Spatial Data From The Web
Linking Spatial Data From The Web
 
Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025Wikidata Conference 2019 GLAM Panel - 20191025
Wikidata Conference 2019 GLAM Panel - 20191025
 
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
 
The Semantic Data Web, Sören Auer, University of Leipzig
The Semantic Data Web, Sören Auer, University of LeipzigThe Semantic Data Web, Sören Auer, University of Leipzig
The Semantic Data Web, Sören Auer, University of Leipzig
 
Wikipedia as source of collaboratively created Knowledge Organization Systems
Wikipedia as source of collaboratively created Knowledge Organization SystemsWikipedia as source of collaboratively created Knowledge Organization Systems
Wikipedia as source of collaboratively created Knowledge Organization Systems
 
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studio
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studioI Linked Open Data nei Beni Culturali, alcuni progetti e casi di studio
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studio
 
Nuts4nuts: geospatial information from Wikipedia (ECSS 2014)
Nuts4nuts: geospatial information from Wikipedia (ECSS 2014)Nuts4nuts: geospatial information from Wikipedia (ECSS 2014)
Nuts4nuts: geospatial information from Wikipedia (ECSS 2014)
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
Irish Digital Libraries Summit
Irish Digital Libraries SummitIrish Digital Libraries Summit
Irish Digital Libraries Summit
 
LOD2 Webinar Series FOX
LOD2 Webinar Series FOXLOD2 Webinar Series FOX
LOD2 Webinar Series FOX
 
Semantic Technologies for Cultural Heritage
Semantic Technologies for Cultural HeritageSemantic Technologies for Cultural Heritage
Semantic Technologies for Cultural Heritage
 
DBpedia talk at Fjord Berlin
DBpedia talk at Fjord BerlinDBpedia talk at Fjord Berlin
DBpedia talk at Fjord Berlin
 
Wikipedia as Knowledge Organization System
Wikipedia as Knowledge Organization SystemWikipedia as Knowledge Organization System
Wikipedia as Knowledge Organization System
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
Resources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the WebResources, resources, resources: the three rs of the Web
Resources, resources, resources: the three rs of the Web
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Europeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom ViewsEuropeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom Views
 
Linked Open Data stuff
Linked Open Data stuffLinked Open Data stuff
Linked Open Data stuff
 

More from Dimitris Kontokostas

Data quality assessment - connecting the pieces...
Data quality assessment - connecting the pieces...Data quality assessment - connecting the pieces...
Data quality assessment - connecting the pieces...Dimitris Kontokostas
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2Dimitris Kontokostas
 
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...Dimitris Kontokostas
 
8th DBpedia meeting / California 2016
8th DBpedia meeting /  California 20168th DBpedia meeting /  California 2016
8th DBpedia meeting / California 2016Dimitris Kontokostas
 
Semantically enhanced quality assurance in the jurion business use case
Semantically enhanced quality assurance in the jurion  business use caseSemantically enhanced quality assurance in the jurion  business use case
Semantically enhanced quality assurance in the jurion business use caseDimitris Kontokostas
 
Graph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFGraph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFDimitris Kontokostas
 
DBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in DublinDBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in DublinDimitris Kontokostas
 
NLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology ConstraintsNLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology ConstraintsDimitris Kontokostas
 
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)Dimitris Kontokostas
 

More from Dimitris Kontokostas (14)

Introduction to apache kafka
Introduction to apache kafkaIntroduction to apache kafka
Introduction to apache kafka
 
Data quality assessment - connecting the pieces...
Data quality assessment - connecting the pieces...Data quality assessment - connecting the pieces...
Data quality assessment - connecting the pieces...
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
 
Data quality in Real Estate
Data quality in Real EstateData quality in Real Estate
Data quality in Real Estate
 
8th DBpedia meeting / California 2016
8th DBpedia meeting /  California 20168th DBpedia meeting /  California 2016
8th DBpedia meeting / California 2016
 
Semantically enhanced quality assurance in the jurion business use case
Semantically enhanced quality assurance in the jurion  business use caseSemantically enhanced quality assurance in the jurion  business use case
Semantically enhanced quality assurance in the jurion business use case
 
Graph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFGraph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDF
 
DBpedia past, present & future
DBpedia past, present & futureDBpedia past, present & future
DBpedia past, present & future
 
DBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in DublinDBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in Dublin
 
DBpedia ♥ Commons
DBpedia ♥ CommonsDBpedia ♥ Commons
DBpedia ♥ Commons
 
NLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology ConstraintsNLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology Constraints
 
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
 
DBpedia Viewer - LDOW 2014
DBpedia Viewer - LDOW 2014DBpedia Viewer - LDOW 2014
DBpedia Viewer - LDOW 2014
 

Recently uploaded

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Recently uploaded (20)

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

DBpedia i18n - Amsterdam Meeting (30/01/2014)

  • 1. Creating Knowledge out of Interlinked Data DBpedia Community Meeting DBpedia Internationalization+ 30/01/2014 Amsterdam Dimitris Kontokostas DBpedia is a community project, please see http://dbpedia.org for a List of contributors LOD2 Presentation . 02.09.2010 . Page AKSW, Universität Leipzig http://lod2.eu
  • 9. Infobox Templates Wikitext-Syntax {{Infobox Korean settlement | title = Busan Metropolitan City | img = Busan.jpg | imgcaption = A view of the [[Geumjeong]] district in Busan | hangul = 부산 광역시 ... | area_km2 = 763.46 | pop = 3635389 | popyear = 2006 | mayor = Hur Nam-sik | divs = 15 wards (Gu), 1 county (Gun) | region = [[Yeongnam]] | dialect = [[Gyeongsang]] }} RDF representation dbp:Busan dbp:Busan dbp:Busan dbp:Busan dbp:Busan dbp:Busan ... dbp:title dbp:hangul dbp:area_km2 dbp:pop dbp:region dbp:dialect ″Busan Metropolitan City″ ″ 부산 광역시″ @Hang ″763.46“^xsd:float ″3635389“^xsd:int dbp:Yeongnam dbp:Gyeongsang
  • 10. Creating Knowledge out of Interlinked Data A closer look at infoboxes KAIST – LOD2 16.8..2011 . Page 10 http://lod2.eu
  • 11. Creating Knowledge out of Interlinked Data A closer look at infoboxes KAIST – LOD2 16.8..2011 . Page 11 http://lod2.eu
  • 12. Creating Knowledge out of Interlinked Data A closer look at infoboxes KAIST – LOD2 16.8..2011 . Page 12 http://lod2.eu
  • 13. Creating Knowledge out of Interlinked Data Björk (Musician) Occupation = Musician, Actor Born = 21.12.1965, Reykjavík Brown (Prime Minister) office = Prime Minister of the UK birth_date = 20.4.1951 birth_place = Govan Romero (Actor) occupation = Actor, Editor birthdate = 4.2.1940 birthplace = New York KAIST – LOD2 16.8..2011 . Page 13 http://lod2.eu
  • 14. Creating Knowledge out of Interlinked Data Björk (Musician) Occupation = Musician, Actor Born = 21.12.1965, Reykjavík Brown (Prime Minister) office = Prime Minister of the UK birth_date = 20.4.1951 birth_place = Govan Romero (Actor) occupation = Actor, Editor birthdate = 4.2.1940 birthplace = New York KAIST – LOD2 16.8..2011 . Page 14 http://lod2.eu
  • 15. Creating Knowledge out of Interlinked Data Björk (Musician) Occupation = Musician, Actor Born = 21.12.1965, Reykjavík Brown (Prime Minister) office = Prime Minister of the UK birth_date = 20.4.1951 birth_place = Govan Romero (Actor) occupation = Actor, Editor birthdate = 4.2.1940 birthplace = New York KAIST – LOD2 16.8..2011 . Page 15 http://lod2.eu
  • 16. Creating Knowledge out of Interlinked Data DBpedia – Collaborative Ontology Engineering • Mappings Wiki • http://mappings.dbpedia.org/ • Everybody can contribute new mappings or improve existing ones • ~170 editors • Correct Semantics: • Combine what belongs together (birth_place, birthplace) • Separate what is different (bornIn, birthplace) • Big boost for Precision • Recall is crowdsourced - Help us! :) 16 DBpedia Community Meeting / Amsterdam 30.01.2014 16 http://lod2.eu
  • 17. Creating Knowledge out of Interlinked Data DBpedia – Collaborative Ontology Engineering 17 DBpedia Community Meeting / Amsterdam 30.01.2014 17 http://lod2.eu
  • 18. Creating Knowledge out of Interlinked Data DBpedia – Internationalization DBpedia Internationalization Effort to port DBpedia to local (non-Enlish) wikipedia's 18 DBpedia Community Meeting / Amsterdam 30.01.2014 18 http://lod2.eu
  • 19. Creating Knowledge out of Interlinked Data DBpedia Internationalization (I18n) • Local Wikipedias provide more information on local resources e.g.: • The French version of Eiffel Tower is better than the English • Articles of local importance might not exist in English • Multilingual extraction was limited to basic page structure • Labels, categories, links, raw infobox extraction • DBpedia I18n started at 2009 with German and Korean (PHP). • Extended at 2010 with Greek (Scala) and many languages followed. • Now default multilingual extraction is enhanced and some languages are tailored for even better extraction. • Extractor tweaking / mappings definitions 19 DBpedia Community Meeting / Amsterdam 30.01.2014 19 http://lod2.eu
  • 20. Creating Knowledge out of Interlinked Data DBpedia I18n – Overview • DBpedia 3.7 (08/2011) was the first to introduce internationalized datasets • Language based namespaces (http://{lang}.DBpedia.org) • Most are not dereferencable (except local chapters) • DBpedia 3.9 (08/2013) provides data in 191 languages • Mappings are enabled for 28 languages (24 active) • 15 local DBpedia chapters http://dbpedia.org/Internationalization/Chapters • • • • 12 from Europe, Indonesian, Japanese & Korean Provide dereferencable URIs / IRIs Maintain their own domain and community Mappings coordination 20 DBpedia Community Meeting / Amsterdam 30.01.2014 20 http://lod2.eu
  • 21. 21
  • 22. Creating Knowledge out of Interlinked Data DBpedia I18n – Mapping Activity 22 DBpedia Community Meeting / Amsterdam 30.01.2014 22 http://lod2.eu
  • 23. Creating Knowledge out of Interlinked Data DBpedia I18n – Mapping Statistics (2013.02) 23 DBpedia Community Meeting / Amsterdam 30.01.2014 23 http://lod2.eu
  • 24. Creating Knowledge out of Interlinked Data DBpedia I18n – Mapping Statistics (2013.02) 24 DBpedia Community Meeting / Amsterdam 30.01.2014 24 http://lod2.eu
  • 25. Creating Knowledge out of Interlinked Data DBpedia I18n – Mapping Statistics (v3.8) 25 DBpedia Community Meeting / Amsterdam 30.01.2014 25 http://lod2.eu
  • 26. Creating Knowledge out of Interlinked Data To keep our Dutch audience happy :) v3.9 (NL): 211.927 People, 861.633 Places, 16.733 Organizations, 92.314 Works DBpedia I18n – Mapping Statistics (v3.8) 26 DBpedia Community Meeting / Amsterdam 30.01.2014 26 http://lod2.eu
  • 27. Creating Knowledge out of Interlinked Data DBpedia I18n – Mapping Statistics (v3.8) 27 DBpedia Community Meeting / Amsterdam 30.01.2014 27 http://lod2.eu
  • 28. Creating Knowledge out of Interlinked Data DBpedia I18n – Mapping Statistics (v3.8) 28 DBpedia Community Meeting / Amsterdam 30.01.2014 28 http://lod2.eu
  • 29. Creating Knowledge out of Interlinked Data DBpedia I18n Further information: ● ● ● http://wiki.dbpedia.org/Internationalization Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef, Sören Auer, Christian Bizer. DBpedia – A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia. To appear in the Semantic Web Journal. Dimitris Kontokostas, Charalampos Bratsas, Sören Auer, Sebastian Hellmann, Ioannis Antoniou, George Metakides, Internationalization of Linked Data: The case of the Greek DBpedia edition, Web Semantics: Science, Services and Agents on the World Wide Web, Volume 15, September 2012, Pages 51–61, ISSN 1570–8268, 10.1016/j.websem.2012.01.001. 29 DBpedia Community Meeting / Amsterdam 30.01.2014 29 http://lod2.eu
  • 30. Creating Knowledge out of Interlinked Data DBpedia @ GSoC 5 succesfull GSoC DBpedia projects this year: http://wiki.dbpedia.org/gsoc2013 ● Type inference based on categories (Kasun Perera) ● ● New interactive DBpedia interface (Denis Lukovnikov) ● ● Live Wikidata2DBpedia endpoint (2014) Power tool for DBpedia testing metadata (Lazaros Ioannidis) ● ● Available at http://live.dbpedia.org Wikidata integration (Hady ElHasar) ● ● Available at https://github.com/dbpedia/dbpedia-links Using Databugger output: http://databugger.aksw.org Input format generalization for DBpedia Spotlight Do you know any students for DBpedia @ GSoC 2014 ? 30 DBpedia Community Meeting / Amsterdam 30.01.2014 30 http://lod2.eu
  • 31. Creating Knowledge out of Interlinked Data Quality @ Dbpedia (soon) Databugger + GsoC Power tool SPARQL quality queries (more than one birth date) => Select ?s where { ?s dbo:birthDate ?d . } Group by ?s Having count(?d > 1) 31 DBpedia Community Meeting / Amsterdam 30.01.2014 31 dbr:Phil_Cuzzi => Wikipedia error dbr:Ivan_Cattaneo dbr:Vijay_Ghate dbr:William_Tempest dbr:Cliff_Speegle dbr:Arnold,_Duke_of_Guelders dbr:Schuyler_Grant dbr:Vlas_Chubar dbr:Adrian_Peterson ... http://lod2.eu
  • 32. Creating Knowledge out of Interlinked Data DBpedia @ GSoC 32 DBpedia Community Meeting / Amsterdam 30.01.2014 32 http://lod2.eu
  • 33. Creating Knowledge out of Interlinked Data Thank you for your attention! DBpedia is a community project, please see http://dbpedia.org for aDBpediaist of conDBpediars. LOD2 Presentation . 02.09.2010 . Page http://lod2.eu