SlideShare a Scribd company logo
1 of 45
Download to read offline
Wikidata
for
Libraries and Archives
Emw
Exploring Wikidata and the Semantic Web for Libraries
Metropolitan New York Library Council
2015-03-16
Wikidata is a free linked database that can be
read and edited by
humans and machines.
Wikidata's goals
● Centralize interwiki links
● Centralize Wikipedia infoboxes
● Provide an interface for rich queries
● Structure the sum of all human knowledge
What you'll learn from this talk
● How to edit Wikidata
● Authority control properties
● Ontology
● Wikidata vocabulary
● Querying and other tools
● Wikidata & Commons plans
Elements of a Wikidata statement
Example: New York City (Q60)
Items and properties
●
Each item and property has its own page
● Items
– Represent subjects: Douglas Adams, Challenger disaster
– Have identifiers like Q42, Q921090
–13,745,153 items
● Properties
– Represent attribute names: occupation, cause of
– Have identifiers like P106, P828
– 1,429 properties
Statements and claims
● Claims
– Claims are “triplets”
● Formally: subject, predicate, object
● In Wikidata: item, property, value
● Example: Douglas Adams, occupation, author
● Statements
– A claim is only part of a statement
– Statements also include:
● References
● Ranks
Qualifiers, ranks, references
● Qualifiers
– Qualifiers are properties used on claims rather than items
– “Yonkers population 12,733 at time (P585) 1860”
● Ranks
– Preferred, normal, deprecated
– Useful to mark outdated claims
● References
– Source of claim; provenance
– “... stated in (P248) 1860 United States Census”
More on Wikidata vocabulary
https://www.wikidata.org/wiki/Wikidata:Glossary
Wikipedia articles have a Wikidata item link in the
left navigation panel.
Wikidata link on Wikipedia
Getting to Wikidata from Wikipedia
Wikidata's instant search suggests items that have
labels or aliases matching your keyword.
Wikidata search
Search by label
Search by alias: “flu” -> influenza
Finding properties
● Is there a property for “number of windows”?
● What was the ID of that property, again?
● Search
– In main site search box, prefix search term with “P:”
– “P:number of”, “P:occupation”
– Instant search doesn't work for properties, only items
● Browse
– https://www.wikidata.org/wiki/Wikidata:List_of_properties
^ bookmark this!
Let's edit Wikidata
Waldseemuller map
https://www.wikidata.org/wiki/Q195801
● instance of (P31): map
● creator (P170): Martin Waldseemüller
● publication (P577): April 1507
● language of work (P407): Latin
● depicts (P180): Americas, Africa,
Europe, Asia, Pacific Ocean
● has part (P527): woodcut
● location (P276): Library of Congress
Tools
– Querying: Autolist, by Magnus Manske
● http://tools.wmflabs.org/autolist/autolist1.html
– Batch editing: Widar, by Magnus Manske
● https://tools.wmflabs.org/autolist/
– Software framework: Wikidata Toolkit, by Markus Kroetzsch et al.
● https://www.mediawiki.org/wiki/Wikidata_Toolkit
● https://github.com/Wikidata/Wikidata-Toolkit
Querying in Wikidata
Use case: Get a list of history books
Pseudo-query:
instance of: book AND genre: history
instance of: P31
book: Q571
genre: P136
history: Q309
Wikidata query in Autolist:
claim[31:571] AND claim[136:309]
http://tools.wmflabs.org/autolist/autolist1.html?q=claim[31:571]%20AND%20claim[136:309]
Authority control on Wikidata
● “Identifier” properties link Wikidata items to entities in
external databases
● Also known as authority control numbers
● Examples:
– VIAF identifier (P214)
– Freebase identifier (P646)
– MusicBrainz artist ID (P434)
Identifier properties on Wikidata
Top 10 ID properties on Wikidata as of 2015-03-15
1) Freebase identifier (P646) 1,153,786
2) China administrative division code (P442) 742,240
3) VIAF identifier (P214) 527,786
4) GeoNames ID (P1566) 490,444
5) GND identifier (P227) 345,247
6) ITIS TSN (P815) 334,053
7) PlantList-ID (P1070) 331,179
8) Tropicos taxon name identifier (P960) 318,719
9) IMDb identifier (P345) 286,637
10) IPNI taxon name identifier (P961) 276,515
https://www.wikidata.org/wiki/Wikidata:Database_reports/List_of_properties/Top100
Identifier properties on WikidataIdentifier properties on Wikidata
Ontology
● “A specification of a conceptualization”
● Rooted in long-standing philosophical traditions
● Useful for:
– building concept hierarchies
– making common knowledge computable
Tree of Porphyry
User:VoiceOfTheCommons, CC-BY-SA 3.0
Classes and instances
● Plato is a human is a animal
● Plato instance of human subclass of animal
● Instance: concrete object, individual
● Class: abstract object
Concept hierarchy: painting
http://tools.wmflabs.org/wikidata-todo/tree.html?q=3305213&rp=279&lang=en
Concept tree diagram: painting
http://tools.wmflabs.org/wikidata-todo/tree.html?q=3305213&rp=279&lang=en&method=d3
Ontology on Wikidata
● instance of (P31)
– rdf:type in RDF and OWL
– 12,692,181 usages
– Most popular Wikidata property
● subclass of (P279)
– “all instances of A are also instances of B”
– rdfs:subClassOf in RDF and OWL
– 190,095 usages
Ontology on Wikidata
● Last but not least: part of (P361)
– Third basic membership property
– Top-level “part-whole” relation
● Instance of, subclass of and part of are all transitive
● Transitive relation :
A subclass of B
B subclass of C
:. A subclass of C
https://www.wikidata.org/wiki/Help:Basic_membership_properties
subproperty of (P1647)
● How to link author (P50) and creator (P170)
– subproperty of (P1647)
– author subproperty of creator
● Wikidata supports claims about properties
● subproperty of (P1647) has semantics of
rdfs:subPropertyOf
Structured data for images
● Use case:
– A library, archive or other cultural institution wants to
make its images discoverable on Wikidata
● Structured Data on Wikimedia Commons
– Project of the Wikimedia Foundation (WMF)
Structured Data
for Wikimedia Commons
● Purpose
– use structured data for all media files on Wikimedia sites
– make it easier for users to read, translate and edit file
information
– enable developers to build better tools, both to
contribute and reuse Commons content.
https://commons.wikimedia.org/wiki/Commons:Structured_data/Overview
Structured Data for Commons: FAQ
● How long will it take?
–Completion will take years, but project will be released in stages.
–Forward-facing part of project is on hold.
● Is file meta data staying on Commons?
–Yes
● How will file information be structured?
–To be determined
● Will Commons still have templates?
–Yes
● Will Commons still have categories?
–Yes
Wikidata for Commons
Partial support already exists!
Wikidata links to Commons
Commons links to Wikidata
Library ontology on Wikidata
● What is a book?
● Different levels of existence (FRBR)
– Work
– Expression - e.g., translation for a language
– Manifestation - edition, thing with an ISBN
– Item - physical copy
FRBR example
● Work
– Bible (Q1845)
● Expression
– Latin translation of the Bible
● Manifestation
– Vulgate (Q131175): a particular Latin translation
● Item
– Gutenberg Bible (Q158075): a physical book in New York Public Library(!)
Linking FRBR levels in Wikidata
Done by the property edition or translation of (P629)
● Gutenberg Bible (item)
– edition or translation of: Vulgate
● Vulgate (manifestation)
– edition or translation of: Bible
● Bible (work)
Trouble: what is an instance?
● Thing with a unique location in space and time
● Bible instance of religious text: incorrect?
– Gutenberg Bible instance of religious text: all agree
● Solution: Bible as information artifact
Information Artifact Ontology
● IAO sets forth a basic ontological distinction:
– Entities in the world
● Actual people: Grace Hopper, Barack Obama
● That very old tree
● Gutenberg Bible
– Information artifacts
● Entities about something in reality
● Concretized by some information bearer
Information Artifact Ontology
● Addresses problems in Dublin Core Metadata Initiative
(DCMI)
– Smith (2014)
http://ncor.buffalo.edu/2014/IAOW/IAO-Tutorial-Smith-Rio-Sep-2014.pptx
● Rooted in Basic Formal Ontology (BFO), which is used in
many scienitific ontologies
– Smith et al. (2007). Nature Biotechnology.
http://www.nature.com/nbt/journal/v25/n11/full/nbt1346.html
Summary
● Wikidata is a free knowledge base that anyone can edit
● Search properties by “p:search term”
● Wikidata items link to external databases through
identifier properties, e.g. VIAF identifier (P214)
● Wikidata supports FRBR and other ontologies
Thank you!
https://www.wikidata.org/wiki/User:Emw

More Related Content

What's hot

FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Dataaschwarte
 
Semantic Web And Coldfusion
Semantic Web And ColdfusionSemantic Web And Coldfusion
Semantic Web And Coldfusionwilliam_greenly
 
Wikipedia infobox type_prediction_slides_dl4_k_gs
Wikipedia infobox type_prediction_slides_dl4_k_gsWikipedia infobox type_prediction_slides_dl4_k_gs
Wikipedia infobox type_prediction_slides_dl4_k_gsRussa Biswas
 
Web Data Management with RDF
Web Data Management with RDFWeb Data Management with RDF
Web Data Management with RDFM. Tamer Özsu
 
An Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataAn Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataOlaf Hartig
 
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italianiDiego Valerio Camarda
 
Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...Lars G. Svensson
 
ALEC (A List of Everything Cool)
ALEC (A List of Everything Cool)ALEC (A List of Everything Cool)
ALEC (A List of Everything Cool)Roderic Page
 
On the Way to a Holding Ontology
On the Way to a Holding OntologyOn the Way to a Holding Ontology
On the Way to a Holding OntologyJakob .
 
PyConUK 2016 - Writing English Right
PyConUK 2016  - Writing English RightPyConUK 2016  - Writing English Right
PyConUK 2016 - Writing English RightTatiana Al-Chueyr
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedJakob .
 
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIsJosef Petrák
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...DeVonne Parks, CEM
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTHerbert Van de Sompel
 
Linked open data for cultural heritage
Linked open data for cultural heritageLinked open data for cultural heritage
Linked open data for cultural heritageAthanasios Velios
 

What's hot (18)

FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Data
 
Semantic Web And Coldfusion
Semantic Web And ColdfusionSemantic Web And Coldfusion
Semantic Web And Coldfusion
 
Wikipedia infobox type_prediction_slides_dl4_k_gs
Wikipedia infobox type_prediction_slides_dl4_k_gsWikipedia infobox type_prediction_slides_dl4_k_gs
Wikipedia infobox type_prediction_slides_dl4_k_gs
 
Web Data Management with RDF
Web Data Management with RDFWeb Data Management with RDF
Web Data Management with RDF
 
An Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataAn Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of Data
 
Keynote session - LOD2014 W3C event
Keynote session - LOD2014 W3C eventKeynote session - LOD2014 W3C event
Keynote session - LOD2014 W3C event
 
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
 
Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...
 
ALEC (A List of Everything Cool)
ALEC (A List of Everything Cool)ALEC (A List of Everything Cool)
ALEC (A List of Everything Cool)
 
On the Way to a Holding Ontology
On the Way to a Holding OntologyOn the Way to a Holding Ontology
On the Way to a Holding Ontology
 
PyConUK 2016 - Writing English Right
PyConUK 2016  - Writing English RightPyConUK 2016  - Writing English Right
PyConUK 2016 - Writing English Right
 
Linked Open Data stuff
Linked Open Data stuffLinked Open Data stuff
Linked Open Data stuff
 
Clark - Metadata is the Message
Clark - Metadata is the MessageClark - Metadata is the Message
Clark - Metadata is the Message
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
 
Linked open data for cultural heritage
Linked open data for cultural heritageLinked open data for cultural heritage
Linked open data for cultural heritage
 

Similar to Wikidata for libraries and archives

Up and running with Wikidata
Up and running with WikidataUp and running with Wikidata
Up and running with Wikidata_Emw
 
An Ambitious Wikidata Tutorial
An Ambitious Wikidata TutorialAn Ambitious Wikidata Tutorial
An Ambitious Wikidata Tutorial_Emw
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLJane Frazier
 
Quipu: Quechua Knowledge Graph [Pilot: Building virtual assistants based on Q...
Quipu: Quechua Knowledge Graph [Pilot: Building virtual assistants based on Q...Quipu: Quechua Knowledge Graph [Pilot: Building virtual assistants based on Q...
Quipu: Quechua Knowledge Graph [Pilot: Building virtual assistants based on Q...Elwin Huaman
 
2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk Cambridge2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk CambridgeMagnus Manske
 
Curation and Digital Storytelling
Curation and Digital StorytellingCuration and Digital Storytelling
Curation and Digital StorytellingShawn Day
 
Linked Open Data: A simple how-to
Linked Open Data: A simple how-toLinked Open Data: A simple how-to
Linked Open Data: A simple how-tonvitucci
 
2014 10-11 Wikidata talk London WMF UK
2014 10-11 Wikidata talk London WMF UK2014 10-11 Wikidata talk London WMF UK
2014 10-11 Wikidata talk London WMF UKMagnus Manske
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015Michele Pasin
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsDr. Neil Brittliff
 
Entity Retrieval (SIGIR 2013 tutorial)
Entity Retrieval (SIGIR 2013 tutorial)Entity Retrieval (SIGIR 2013 tutorial)
Entity Retrieval (SIGIR 2013 tutorial)krisztianbalog
 
Hacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge GraphsHacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge GraphsArangoDB Database
 
Analysing Structured Scholarly Data Embedded in Web Pages
Analysing Structured Scholarly Data Embedded in Web PagesAnalysing Structured Scholarly Data Embedded in Web Pages
Analysing Structured Scholarly Data Embedded in Web PagesUjwal Gadiraju
 
Knowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
Knowledge Matters! The Role of Knowledge Graphs in Modern AI SystemsKnowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
Knowledge Matters! The Role of Knowledge Graphs in Modern AI SystemsHeiko Paulheim
 
Ontologies introduction - ecoOnto meeting
Ontologies introduction - ecoOnto meetingOntologies introduction - ecoOnto meeting
Ontologies introduction - ecoOnto meetingjchabalier
 
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)krisztianbalog
 
Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204Beat Estermann
 
Csvconf data hacking-with_wikimedia_projects
Csvconf data hacking-with_wikimedia_projectsCsvconf data hacking-with_wikimedia_projects
Csvconf data hacking-with_wikimedia_projectsmattsenate
 

Similar to Wikidata for libraries and archives (20)

Up and running with Wikidata
Up and running with WikidataUp and running with Wikidata
Up and running with Wikidata
 
An Ambitious Wikidata Tutorial
An Ambitious Wikidata TutorialAn Ambitious Wikidata Tutorial
An Ambitious Wikidata Tutorial
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
 
Quipu: Quechua Knowledge Graph [Pilot: Building virtual assistants based on Q...
Quipu: Quechua Knowledge Graph [Pilot: Building virtual assistants based on Q...Quipu: Quechua Knowledge Graph [Pilot: Building virtual assistants based on Q...
Quipu: Quechua Knowledge Graph [Pilot: Building virtual assistants based on Q...
 
2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk Cambridge2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk Cambridge
 
Curation and Digital Storytelling
Curation and Digital StorytellingCuration and Digital Storytelling
Curation and Digital Storytelling
 
Linked Open Data: A simple how-to
Linked Open Data: A simple how-toLinked Open Data: A simple how-to
Linked Open Data: A simple how-to
 
2014 10-11 Wikidata talk London WMF UK
2014 10-11 Wikidata talk London WMF UK2014 10-11 Wikidata talk London WMF UK
2014 10-11 Wikidata talk London WMF UK
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your Analytics
 
Entity Retrieval (SIGIR 2013 tutorial)
Entity Retrieval (SIGIR 2013 tutorial)Entity Retrieval (SIGIR 2013 tutorial)
Entity Retrieval (SIGIR 2013 tutorial)
 
Hacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge GraphsHacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge Graphs
 
Analysing Structured Scholarly Data Embedded in Web Pages
Analysing Structured Scholarly Data Embedded in Web PagesAnalysing Structured Scholarly Data Embedded in Web Pages
Analysing Structured Scholarly Data Embedded in Web Pages
 
Knowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
Knowledge Matters! The Role of Knowledge Graphs in Modern AI SystemsKnowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
Knowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
 
RDA
RDA RDA
RDA
 
Ontologies introduction - ecoOnto meeting
Ontologies introduction - ecoOnto meetingOntologies introduction - ecoOnto meeting
Ontologies introduction - ecoOnto meeting
 
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
 
Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204
 
Csvconf data hacking-with_wikimedia_projects
Csvconf data hacking-with_wikimedia_projectsCsvconf data hacking-with_wikimedia_projects
Csvconf data hacking-with_wikimedia_projects
 

Recently uploaded

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 

Recently uploaded (20)

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 

Wikidata for libraries and archives

  • 1. Wikidata for Libraries and Archives Emw Exploring Wikidata and the Semantic Web for Libraries Metropolitan New York Library Council 2015-03-16
  • 2. Wikidata is a free linked database that can be read and edited by humans and machines.
  • 3. Wikidata's goals ● Centralize interwiki links ● Centralize Wikipedia infoboxes ● Provide an interface for rich queries ● Structure the sum of all human knowledge
  • 4. What you'll learn from this talk ● How to edit Wikidata ● Authority control properties ● Ontology ● Wikidata vocabulary ● Querying and other tools ● Wikidata & Commons plans
  • 5. Elements of a Wikidata statement
  • 6. Example: New York City (Q60)
  • 7. Items and properties ● Each item and property has its own page ● Items – Represent subjects: Douglas Adams, Challenger disaster – Have identifiers like Q42, Q921090 –13,745,153 items ● Properties – Represent attribute names: occupation, cause of – Have identifiers like P106, P828 – 1,429 properties
  • 8. Statements and claims ● Claims – Claims are “triplets” ● Formally: subject, predicate, object ● In Wikidata: item, property, value ● Example: Douglas Adams, occupation, author ● Statements – A claim is only part of a statement – Statements also include: ● References ● Ranks
  • 9. Qualifiers, ranks, references ● Qualifiers – Qualifiers are properties used on claims rather than items – “Yonkers population 12,733 at time (P585) 1860” ● Ranks – Preferred, normal, deprecated – Useful to mark outdated claims ● References – Source of claim; provenance – “... stated in (P248) 1860 United States Census”
  • 10. More on Wikidata vocabulary https://www.wikidata.org/wiki/Wikidata:Glossary
  • 11. Wikipedia articles have a Wikidata item link in the left navigation panel. Wikidata link on Wikipedia
  • 12. Getting to Wikidata from Wikipedia
  • 13. Wikidata's instant search suggests items that have labels or aliases matching your keyword. Wikidata search
  • 15. Search by alias: “flu” -> influenza
  • 16. Finding properties ● Is there a property for “number of windows”? ● What was the ID of that property, again? ● Search – In main site search box, prefix search term with “P:” – “P:number of”, “P:occupation” – Instant search doesn't work for properties, only items ● Browse – https://www.wikidata.org/wiki/Wikidata:List_of_properties ^ bookmark this!
  • 18. Waldseemuller map https://www.wikidata.org/wiki/Q195801 ● instance of (P31): map ● creator (P170): Martin Waldseemüller ● publication (P577): April 1507 ● language of work (P407): Latin ● depicts (P180): Americas, Africa, Europe, Asia, Pacific Ocean ● has part (P527): woodcut ● location (P276): Library of Congress
  • 19. Tools – Querying: Autolist, by Magnus Manske ● http://tools.wmflabs.org/autolist/autolist1.html – Batch editing: Widar, by Magnus Manske ● https://tools.wmflabs.org/autolist/ – Software framework: Wikidata Toolkit, by Markus Kroetzsch et al. ● https://www.mediawiki.org/wiki/Wikidata_Toolkit ● https://github.com/Wikidata/Wikidata-Toolkit
  • 20. Querying in Wikidata Use case: Get a list of history books Pseudo-query: instance of: book AND genre: history instance of: P31 book: Q571 genre: P136 history: Q309 Wikidata query in Autolist: claim[31:571] AND claim[136:309]
  • 22. Authority control on Wikidata ● “Identifier” properties link Wikidata items to entities in external databases ● Also known as authority control numbers ● Examples: – VIAF identifier (P214) – Freebase identifier (P646) – MusicBrainz artist ID (P434)
  • 23. Identifier properties on Wikidata Top 10 ID properties on Wikidata as of 2015-03-15 1) Freebase identifier (P646) 1,153,786 2) China administrative division code (P442) 742,240 3) VIAF identifier (P214) 527,786 4) GeoNames ID (P1566) 490,444 5) GND identifier (P227) 345,247 6) ITIS TSN (P815) 334,053 7) PlantList-ID (P1070) 331,179 8) Tropicos taxon name identifier (P960) 318,719 9) IMDb identifier (P345) 286,637 10) IPNI taxon name identifier (P961) 276,515 https://www.wikidata.org/wiki/Wikidata:Database_reports/List_of_properties/Top100 Identifier properties on WikidataIdentifier properties on Wikidata
  • 24. Ontology ● “A specification of a conceptualization” ● Rooted in long-standing philosophical traditions ● Useful for: – building concept hierarchies – making common knowledge computable
  • 26. Classes and instances ● Plato is a human is a animal ● Plato instance of human subclass of animal ● Instance: concrete object, individual ● Class: abstract object
  • 28. Concept tree diagram: painting http://tools.wmflabs.org/wikidata-todo/tree.html?q=3305213&rp=279&lang=en&method=d3
  • 29. Ontology on Wikidata ● instance of (P31) – rdf:type in RDF and OWL – 12,692,181 usages – Most popular Wikidata property ● subclass of (P279) – “all instances of A are also instances of B” – rdfs:subClassOf in RDF and OWL – 190,095 usages
  • 30. Ontology on Wikidata ● Last but not least: part of (P361) – Third basic membership property – Top-level “part-whole” relation ● Instance of, subclass of and part of are all transitive ● Transitive relation : A subclass of B B subclass of C :. A subclass of C https://www.wikidata.org/wiki/Help:Basic_membership_properties
  • 31. subproperty of (P1647) ● How to link author (P50) and creator (P170) – subproperty of (P1647) – author subproperty of creator ● Wikidata supports claims about properties ● subproperty of (P1647) has semantics of rdfs:subPropertyOf
  • 32. Structured data for images ● Use case: – A library, archive or other cultural institution wants to make its images discoverable on Wikidata ● Structured Data on Wikimedia Commons – Project of the Wikimedia Foundation (WMF)
  • 33. Structured Data for Wikimedia Commons ● Purpose – use structured data for all media files on Wikimedia sites – make it easier for users to read, translate and edit file information – enable developers to build better tools, both to contribute and reuse Commons content. https://commons.wikimedia.org/wiki/Commons:Structured_data/Overview
  • 34. Structured Data for Commons: FAQ ● How long will it take? –Completion will take years, but project will be released in stages. –Forward-facing part of project is on hold. ● Is file meta data staying on Commons? –Yes ● How will file information be structured? –To be determined ● Will Commons still have templates? –Yes ● Will Commons still have categories? –Yes
  • 35. Wikidata for Commons Partial support already exists!
  • 36. Wikidata links to Commons
  • 37. Commons links to Wikidata
  • 38. Library ontology on Wikidata ● What is a book? ● Different levels of existence (FRBR) – Work – Expression - e.g., translation for a language – Manifestation - edition, thing with an ISBN – Item - physical copy
  • 39. FRBR example ● Work – Bible (Q1845) ● Expression – Latin translation of the Bible ● Manifestation – Vulgate (Q131175): a particular Latin translation ● Item – Gutenberg Bible (Q158075): a physical book in New York Public Library(!)
  • 40. Linking FRBR levels in Wikidata Done by the property edition or translation of (P629) ● Gutenberg Bible (item) – edition or translation of: Vulgate ● Vulgate (manifestation) – edition or translation of: Bible ● Bible (work)
  • 41. Trouble: what is an instance? ● Thing with a unique location in space and time ● Bible instance of religious text: incorrect? – Gutenberg Bible instance of religious text: all agree ● Solution: Bible as information artifact
  • 42. Information Artifact Ontology ● IAO sets forth a basic ontological distinction: – Entities in the world ● Actual people: Grace Hopper, Barack Obama ● That very old tree ● Gutenberg Bible – Information artifacts ● Entities about something in reality ● Concretized by some information bearer
  • 43. Information Artifact Ontology ● Addresses problems in Dublin Core Metadata Initiative (DCMI) – Smith (2014) http://ncor.buffalo.edu/2014/IAOW/IAO-Tutorial-Smith-Rio-Sep-2014.pptx ● Rooted in Basic Formal Ontology (BFO), which is used in many scienitific ontologies – Smith et al. (2007). Nature Biotechnology. http://www.nature.com/nbt/journal/v25/n11/full/nbt1346.html
  • 44. Summary ● Wikidata is a free knowledge base that anyone can edit ● Search properties by “p:search term” ● Wikidata items link to external databases through identifier properties, e.g. VIAF identifier (P214) ● Wikidata supports FRBR and other ontologies