SlideShare una empresa de Scribd logo
1 de 25
Descargar para leer sin conexión
The nature.com
ontologies portal
nature.com/ontologies
Tony Hammond, Michele Pasin
Macmillan Science and Education
Who we are
We are both part of Macmillan Science and Education*
-  Macmillan S&E is a global STM publisher
-  Tony Hammond is Data Architect, Technology
@tonyhammond
-  Michele Pasin is Information Architect, Product Office
@lambdaman
* We merged earlier this year (May 2015) with Springer Science+Business Media
to become Springer Nature. We are currently actively engaged in integrating our
businesses.
Macmillan: science and education brands
May 2015
We publish a lot of science! (1845-2015)
http://www.nature.com/developers/hacks/articles/by-year
1,2 million articles in total
Why we’re here today: to ask some questions
We have been making semantic data available in RDF models for a number of
years through our data.nature.com portal (2012–2015)
Big questions:
-  Is this data of any use to the Linked Science community?
-  Should Springer Nature continue to invest in LOD sharing?
More specifically:
-  Does the data contain enough items of interest? [Content]
-  Are the vocabularies understandable and useful? [Structure]
-  Are the data easy to get and to reuse? [Accessibility]
-  Is dereference / download / query the preferred option?
Our work so far
-  Step 1: Linked Data Platform (2012–2014)
-  datasets
-  downloads + SPARQL endpoint
-  linked data dereference
-  Step 2: Ontologies Portal (2015–)
-  datasets + models (core, domain)
-  downloads
-  extensive documentation
The Ontologies Portal
www.nature.com/ontologies
Our goals and rationale
-  Semantic technologies are an effective way to do enterprise metadata
management at web scale
-  Initially used primarily for data publishing / sharing (data.nature.com, 2011)
-  Since 2013, a core component of our digital publishing workflow (see ISWC14 paper)
-  Contributing to an emerging web of linked science data
-  As a major publisher since 1845, ideally positioned to bootstrap a science ‘publications hub’
-  Building on the fundamental ties that exist between the actual research works and the
publications that tell the story about it
The vision of a science graph
What’s available
The core ontology
-  Language: OWL 2, Profile: ALCHI(D)
-  Entities: ~50 classes, ~140 properties
-  Principles: Incremental Formalization/ Enterprise Integration / Model Coherence
http://www.nature.com/ontologies/core/
The core ontology: mappings
:Asset
:Thing
:Publication
:Concept
:Event
:Subject
:Type
:Agent
:ArticleType
:Publishing
Event
:Aggregation
Event
:Component
:Document
:Serial
cidoc-crm:
Information_Carrier
cidoc-crm:
Conceptual_Object
dbpedia:Agent
dc:Agent
dcterms:Agent
cidoc-crm:Agent
vcard:Agent
foaf:Agent
event:Event
bibo:Event
schema:Event
cidoc-crm:
TemporalEntity
cidoc-crm:Type
vcard:Type
fabio:SubjectTerm
bibo:Document
cidoc-crm:Document
foaf:Document
bibo:Periodical
fabio:Periodical
schema:Periodical
bibo:DocumentPart
fabio:Expression
cidoc-crm:InformationObject
= owl:equivalentClass
http://www.nature.com/ontologies/linksets/core/
Domain models: subjects ontology
-  Structure: SKOS, multi hierarchical tree, 6 branches, 7 levels of depth
-  Entities: ~2500 concepts
-  Mappings: 100% of terms, using skos:broadMatch or skos:closeMatch, (Dbpedia and
MESH) www.nature.com/ontologies/models/subjects/
http://www.nature.com/developers/hacks/#1
Subjects visualizations
Datasets
-  Articles: 25m records (for 1.2m articles) with metadata like title, publication etc.. except authors
-  Contributors: 11m records (for 2.7m contributors) i.e. the article’s authors, structured and ordered
but not disambiguated
-  Citations: 218m records (for 9.3m citations) – from an earlier release
Datasets: articles-wikipedia links
How: data extracted using wikipedia search API, 51,309 links over 145 years
Quality: only ~900 were links to nature.com without a DOI, rest all use DOIs correctly
Encoding: cito:isCitedBy => wiki URL, foaf:topic => dbPedia URI
http://www.nature.com/developers/hacks/wikilinks
Data publishing: sources
Sources:
Ontologies (small scale; RDF native)
-  mastered as RDF data (Turtle)
-  managed in GitHub
-  in-memory RDF models built using Apache Jena
-  models augmented at build time using SPIN rules
-  deployed to MarkLogic as RDF/XML for query
-  exported as RDF dataset (Turtle) and as CSV
Documents (large scale; XML native)
-  mastered as XML data
-  managed in MarkLogic XML database
-  data mined from XML documents (1.2m articles) using Scala
-  in-memory RDF models built using Apache Jena
-  injected as RDF/XML sections into XML documents for query
-  exported as RDF dataset (N-Quads)
Organization:
Named graphs – one graph per class
Data publishing: workflows
Data publishing: rules (enrichment)
construct {
?s npg:publicationStartYear ?xds1 .
?s npg:publicationStartYearMonth ?xds2 .
?s npg:publicationStartDate ?xds3 .
?s npg:publicationEndYear ?xde1 .
?s npg:publicationEndYearMonth ?xde2 .
?s npg:publicationEndDate ?xde3 .
}
where {
?s a npg:Journal .
optional { ?s npg:dateStart ?dateStart } optional { ?s npg:dateEnd ?dateEnd }
{
bind (if(regex(?dateStart, "^d{4}"), substr(?dateStart,1,4), "") as ?ds1)
bind (xsd:gYear(?ds1) as ?xds1)
} union {
bind (if(regex(?dateStart, "^d{4}-d{2}"), substr(?dateStart,1,7), "") as ?ds2)
bind (xsd:gYearMonth(?ds2) as ?xds2)
} union {
bind (if(regex(?dateStart, "^d{4}-d{2}-d{2}$"), substr(?dateStart,1,10), "") as ?ds3)
bind (xsd:date(?ds3) as ?xds3)
} union {
…
}
filter (?xds1 != "" || ?xds2 != "" || ?xds3 != "" || ?xde1 != "" || ?xde2 != "" || ?xde3 != "")
}
Data publishing: rules (validation)
construct {
npgg:journals npg:hasConstraintViolation [
a spin:ConstraintViolation ;
npg:severityLevel "Warning" ;
rdfs:label ?message ;
spin:rule [ a sp:Construct ; sp:text ?query ; ] ;
] .
}
where {
{ select (count(?s) as ?count)
where {
?s a npg:Journal .
filter ( not exists { ?s bibo:shortTitle ?h . } ) }
}
bind (concat("! Found ", str(?count), " journals with no short title") as ?message)
bind (""”
construct {
npgg:journals npg:hasConstraintViolation [
a spin:ConstraintViolation ;
spin:violationRoot ?s ; … ] .
} where { … }
""" as ?query)
}
Data publishing: rules (contracts)
knowledge-bases:public
...
npg:hasContract [
rdfs:comment "Contract for ArticleTypes Ontology" ;
npg:graph npgg:article-types ;
npg:hasBinding [
npg:onOntology article-types: ;
npg:allowsPredicate
dc:creator , dc:date , dc:publisher , dc:rights , dcterms:license ,
npg:webpage , owl:imports , owl:versionInfo , rdf:type , rdfs:comment ,
skos:definition , skos:prefLabel , skos:note ,
vann:preferredNamespacePrefix , vann:preferredNamespaceUri
;
] , [
npg:onInstanceOf npg:ArticleType ;
npg:allowsPredicate
npg:hasRoot , npg:isPrimaryArticleType ,
npg:id , npg:isLeaf , npg:isRoot , npg:treeDepth ,
rdf:type , rdfs:isDefinedBy , rdfs:seeAlso ,
skos:broadMatch , skos:broader , skos:closeMatch ,
skos:definition , skos:exactMatch , skos:inScheme , skos:narrower ,
skos:prefLabel , skos:relatedMatch , skos:topConceptOf
;
] ;
] ;
...
Data publishing: rules (contracts)
Next steps
More features:
-  Linked data dereference
-  Richer dataset descriptions (VoID, PROV, HCLS Profile, etc.)
-  SPARQL endpoint?
-  JSON-LD API?
More data:
-  Adding extra data points (funding info, affiliations, …)
-  Revamp citations dataset
-  Longer term: extending archive to include Springer content
More feedback:
-  User testing around data accessibility
-  Surveying communities/users for this data
Looking ahead: how can a publisher make linked
science happen?
From a business perspective:
-  Finding adequate licensing solutions
-  Justifying the effort to publishers
-  What’s the ROI?
From a communities perspective:
-  Do we actually know who are the users?
-  How do we get more feedback/uptake?
-  Should we work more with non-linked-data communities?
Questions?

Más contenido relacionado

La actualidad más candente

Cenitpede: Analyzing Webcrawl
Cenitpede: Analyzing WebcrawlCenitpede: Analyzing Webcrawl
Cenitpede: Analyzing Webcrawl
Primal Pappachan
 
Analytics and Access to the UK web archive
Analytics and Access to the UK web archiveAnalytics and Access to the UK web archive
Analytics and Access to the UK web archive
Lewis Crawford
 

La actualidad más candente (20)

Cenitpede: Analyzing Webcrawl
Cenitpede: Analyzing WebcrawlCenitpede: Analyzing Webcrawl
Cenitpede: Analyzing Webcrawl
 
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural HeritageBuild Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
 
A Semantic Data Model for Web Applications
A Semantic Data Model for Web ApplicationsA Semantic Data Model for Web Applications
A Semantic Data Model for Web Applications
 
Using the whole web as your dataset
Using the whole web as your datasetUsing the whole web as your dataset
Using the whole web as your dataset
 
grlc Makes GitHub Taste Like Linked Data APIs
grlc Makes GitHub Taste Like Linked Data APIsgrlc Makes GitHub Taste Like Linked Data APIs
grlc Makes GitHub Taste Like Linked Data APIs
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Repeatable Semantic Queries for the Linked Data Agnostic
Repeatable Semantic Queries for the Linked Data AgnosticRepeatable Semantic Queries for the Linked Data Agnostic
Repeatable Semantic Queries for the Linked Data Agnostic
 
Knowledge discoverylaurahollink
Knowledge discoverylaurahollinkKnowledge discoverylaurahollink
Knowledge discoverylaurahollink
 
Knowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPediaKnowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPedia
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Analytics and Access to the UK web archive
Analytics and Access to the UK web archiveAnalytics and Access to the UK web archive
Analytics and Access to the UK web archive
 
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
 
The RDF Report Card: Beyond the Triple Count
The RDF Report Card: Beyond the Triple CountThe RDF Report Card: Beyond the Triple Count
The RDF Report Card: Beyond the Triple Count
 
Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...
 
Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcase
 
Linked Open Data and DANS
Linked Open Data and DANSLinked Open Data and DANS
Linked Open Data and DANS
 
SSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow TutorialSSSW2015 Data Workflow Tutorial
SSSW2015 Data Workflow Tutorial
 
The Bounties of Semantic Data Integration for the Enterprise
The Bounties of Semantic Data Integration for the Enterprise The Bounties of Semantic Data Integration for the Enterprise
The Bounties of Semantic Data Integration for the Enterprise
 

Destacado

This is Welcome Lookbook SS16
This is Welcome Lookbook SS16This is Welcome Lookbook SS16
This is Welcome Lookbook SS16
Jessica Atkinson
 
Учебный план для highload гуру / Андрей Аксёнов (Sphinx Technologies Inc.)
Учебный план для highload гуру / Андрей Аксёнов (Sphinx Technologies Inc.)Учебный план для highload гуру / Андрей Аксёнов (Sphinx Technologies Inc.)
Учебный план для highload гуру / Андрей Аксёнов (Sphinx Technologies Inc.)
Ontico
 

Destacado (15)

Kelompok 8 variabel
Kelompok 8   variabelKelompok 8   variabel
Kelompok 8 variabel
 
Numbers 1 20
Numbers 1 20Numbers 1 20
Numbers 1 20
 
Voltage Stability Indices: Taxonomy, Formulation and Calculation algorithm
Voltage Stability Indices: Taxonomy, Formulation and Calculation algorithmVoltage Stability Indices: Taxonomy, Formulation and Calculation algorithm
Voltage Stability Indices: Taxonomy, Formulation and Calculation algorithm
 
This is Welcome Lookbook SS16
This is Welcome Lookbook SS16This is Welcome Lookbook SS16
This is Welcome Lookbook SS16
 
Quiero salir de mi mundo
Quiero salir de mi mundoQuiero salir de mi mundo
Quiero salir de mi mundo
 
Nancy Nkanyani
Nancy NkanyaniNancy Nkanyani
Nancy Nkanyani
 
Big Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementBig Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and Management
 
Tour of language landscape (code.talks)
Tour of language landscape (code.talks)Tour of language landscape (code.talks)
Tour of language landscape (code.talks)
 
حجية الدليل الرقمي وموقع المشروع اليبي
حجية الدليل الرقمي وموقع المشروع اليبيحجية الدليل الرقمي وموقع المشروع اليبي
حجية الدليل الرقمي وموقع المشروع اليبي
 
Personal Income Tax 2016 Guide Part 3
Personal Income Tax 2016 Guide Part 3Personal Income Tax 2016 Guide Part 3
Personal Income Tax 2016 Guide Part 3
 
Rural Digiserv project
Rural Digiserv projectRural Digiserv project
Rural Digiserv project
 
2016 Springer - publishing scientific research - dublin
2016 Springer - publishing scientific research - dublin2016 Springer - publishing scientific research - dublin
2016 Springer - publishing scientific research - dublin
 
Personal relief 2016
Personal relief 2016Personal relief 2016
Personal relief 2016
 
Personal Income Tax 2016 Guide Part 9
Personal Income Tax 2016 Guide Part 9Personal Income Tax 2016 Guide Part 9
Personal Income Tax 2016 Guide Part 9
 
Учебный план для highload гуру / Андрей Аксёнов (Sphinx Technologies Inc.)
Учебный план для highload гуру / Андрей Аксёнов (Sphinx Technologies Inc.)Учебный план для highload гуру / Андрей Аксёнов (Sphinx Technologies Inc.)
Учебный план для highload гуру / Андрей Аксёнов (Sphinx Technologies Inc.)
 

Similar a The Nature.com ontologies portal - Linked Science 2015

RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
Carole Goble
 

Similar a The Nature.com ontologies portal - Linked Science 2015 (20)

Publishing Linked Data using Schema.org
Publishing Linked Data using Schema.orgPublishing Linked Data using Schema.org
Publishing Linked Data using Schema.org
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
Knowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectKnowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything Project
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in Practice
 
Linked Open Data Utrecht University Library
Linked Open Data Utrecht University LibraryLinked Open Data Utrecht University Library
Linked Open Data Utrecht University Library
 
Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
lodlam summit session browsable linked data
lodlam summit session browsable linked datalodlam summit session browsable linked data
lodlam summit session browsable linked data
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
Hacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge GraphsHacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge Graphs
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xml
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data Strategy
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Research
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 

Más de Michele Pasin

Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Prosopography and Computer Ontologies: Towards a Formal Representation of the...Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Michele Pasin
 
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Michele Pasin
 
An Ontological View of Canonical Citations
An Ontological View of Canonical CitationsAn Ontological View of Canonical Citations
An Ontological View of Canonical Citations
Michele Pasin
 
Livecoding with impromptu
Livecoding with impromptuLivecoding with impromptu
Livecoding with impromptu
Michele Pasin
 
Introducing FRBR-OO (CCH KR workshop 2.2)
Introducing FRBR-OO (CCH KR workshop 2.2)Introducing FRBR-OO (CCH KR workshop 2.2)
Introducing FRBR-OO (CCH KR workshop 2.2)
Michele Pasin
 
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Michele Pasin
 
KR Workshop 1 - Ontologies
KR Workshop 1 - OntologiesKR Workshop 1 - Ontologies
KR Workshop 1 - Ontologies
Michele Pasin
 

Más de Michele Pasin (13)

Designing great dashboards: a slidedeck for dashboard developers
Designing great dashboards: a slidedeck for dashboard developersDesigning great dashboards: a slidedeck for dashboard developers
Designing great dashboards: a slidedeck for dashboard developers
 
STI 2022 - Generating large-scale network analyses of scientific landscapes i...
STI 2022 - Generating large-scale network analyses of scientific landscapes i...STI 2022 - Generating large-scale network analyses of scientific landscapes i...
STI 2022 - Generating large-scale network analyses of scientific landscapes i...
 
How do philosophers think their own disciplines?
How do philosophers think their own disciplines?How do philosophers think their own disciplines?
How do philosophers think their own disciplines?
 
Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...
 
Exploring highly interconnected humanities data: are faceted browsers always ...
Exploring highly interconnected humanities data: are faceted browsers always ...Exploring highly interconnected humanities data: are faceted browsers always ...
Exploring highly interconnected humanities data: are faceted browsers always ...
 
Semantic Web Approaches in Digital History: an Introduction
Semantic Web Approaches in Digital History: an IntroductionSemantic Web Approaches in Digital History: an Introduction
Semantic Web Approaches in Digital History: an Introduction
 
Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Prosopography and Computer Ontologies: Towards a Formal Representation of the...Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Prosopography and Computer Ontologies: Towards a Formal Representation of the...
 
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
 
An Ontological View of Canonical Citations
An Ontological View of Canonical CitationsAn Ontological View of Canonical Citations
An Ontological View of Canonical Citations
 
Livecoding with impromptu
Livecoding with impromptuLivecoding with impromptu
Livecoding with impromptu
 
Introducing FRBR-OO (CCH KR workshop 2.2)
Introducing FRBR-OO (CCH KR workshop 2.2)Introducing FRBR-OO (CCH KR workshop 2.2)
Introducing FRBR-OO (CCH KR workshop 2.2)
 
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)
 
KR Workshop 1 - Ontologies
KR Workshop 1 - OntologiesKR Workshop 1 - Ontologies
KR Workshop 1 - Ontologies
 

Último

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
amitlee9823
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
gajnagarg
 
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
amitlee9823
 
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
gajnagarg
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
gajnagarg
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
gajnagarg
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Último (20)

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
 
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
 
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 

The Nature.com ontologies portal - Linked Science 2015

  • 1. The nature.com ontologies portal nature.com/ontologies Tony Hammond, Michele Pasin Macmillan Science and Education
  • 2. Who we are We are both part of Macmillan Science and Education* -  Macmillan S&E is a global STM publisher -  Tony Hammond is Data Architect, Technology @tonyhammond -  Michele Pasin is Information Architect, Product Office @lambdaman * We merged earlier this year (May 2015) with Springer Science+Business Media to become Springer Nature. We are currently actively engaged in integrating our businesses.
  • 3. Macmillan: science and education brands May 2015
  • 4. We publish a lot of science! (1845-2015) http://www.nature.com/developers/hacks/articles/by-year 1,2 million articles in total
  • 5. Why we’re here today: to ask some questions We have been making semantic data available in RDF models for a number of years through our data.nature.com portal (2012–2015) Big questions: -  Is this data of any use to the Linked Science community? -  Should Springer Nature continue to invest in LOD sharing? More specifically: -  Does the data contain enough items of interest? [Content] -  Are the vocabularies understandable and useful? [Structure] -  Are the data easy to get and to reuse? [Accessibility] -  Is dereference / download / query the preferred option?
  • 6. Our work so far -  Step 1: Linked Data Platform (2012–2014) -  datasets -  downloads + SPARQL endpoint -  linked data dereference -  Step 2: Ontologies Portal (2015–) -  datasets + models (core, domain) -  downloads -  extensive documentation
  • 8. Our goals and rationale -  Semantic technologies are an effective way to do enterprise metadata management at web scale -  Initially used primarily for data publishing / sharing (data.nature.com, 2011) -  Since 2013, a core component of our digital publishing workflow (see ISWC14 paper) -  Contributing to an emerging web of linked science data -  As a major publisher since 1845, ideally positioned to bootstrap a science ‘publications hub’ -  Building on the fundamental ties that exist between the actual research works and the publications that tell the story about it
  • 9. The vision of a science graph
  • 11. The core ontology -  Language: OWL 2, Profile: ALCHI(D) -  Entities: ~50 classes, ~140 properties -  Principles: Incremental Formalization/ Enterprise Integration / Model Coherence http://www.nature.com/ontologies/core/
  • 12. The core ontology: mappings :Asset :Thing :Publication :Concept :Event :Subject :Type :Agent :ArticleType :Publishing Event :Aggregation Event :Component :Document :Serial cidoc-crm: Information_Carrier cidoc-crm: Conceptual_Object dbpedia:Agent dc:Agent dcterms:Agent cidoc-crm:Agent vcard:Agent foaf:Agent event:Event bibo:Event schema:Event cidoc-crm: TemporalEntity cidoc-crm:Type vcard:Type fabio:SubjectTerm bibo:Document cidoc-crm:Document foaf:Document bibo:Periodical fabio:Periodical schema:Periodical bibo:DocumentPart fabio:Expression cidoc-crm:InformationObject = owl:equivalentClass http://www.nature.com/ontologies/linksets/core/
  • 13. Domain models: subjects ontology -  Structure: SKOS, multi hierarchical tree, 6 branches, 7 levels of depth -  Entities: ~2500 concepts -  Mappings: 100% of terms, using skos:broadMatch or skos:closeMatch, (Dbpedia and MESH) www.nature.com/ontologies/models/subjects/
  • 15. Datasets -  Articles: 25m records (for 1.2m articles) with metadata like title, publication etc.. except authors -  Contributors: 11m records (for 2.7m contributors) i.e. the article’s authors, structured and ordered but not disambiguated -  Citations: 218m records (for 9.3m citations) – from an earlier release
  • 16. Datasets: articles-wikipedia links How: data extracted using wikipedia search API, 51,309 links over 145 years Quality: only ~900 were links to nature.com without a DOI, rest all use DOIs correctly Encoding: cito:isCitedBy => wiki URL, foaf:topic => dbPedia URI http://www.nature.com/developers/hacks/wikilinks
  • 17. Data publishing: sources Sources: Ontologies (small scale; RDF native) -  mastered as RDF data (Turtle) -  managed in GitHub -  in-memory RDF models built using Apache Jena -  models augmented at build time using SPIN rules -  deployed to MarkLogic as RDF/XML for query -  exported as RDF dataset (Turtle) and as CSV Documents (large scale; XML native) -  mastered as XML data -  managed in MarkLogic XML database -  data mined from XML documents (1.2m articles) using Scala -  in-memory RDF models built using Apache Jena -  injected as RDF/XML sections into XML documents for query -  exported as RDF dataset (N-Quads) Organization: Named graphs – one graph per class
  • 19. Data publishing: rules (enrichment) construct { ?s npg:publicationStartYear ?xds1 . ?s npg:publicationStartYearMonth ?xds2 . ?s npg:publicationStartDate ?xds3 . ?s npg:publicationEndYear ?xde1 . ?s npg:publicationEndYearMonth ?xde2 . ?s npg:publicationEndDate ?xde3 . } where { ?s a npg:Journal . optional { ?s npg:dateStart ?dateStart } optional { ?s npg:dateEnd ?dateEnd } { bind (if(regex(?dateStart, "^d{4}"), substr(?dateStart,1,4), "") as ?ds1) bind (xsd:gYear(?ds1) as ?xds1) } union { bind (if(regex(?dateStart, "^d{4}-d{2}"), substr(?dateStart,1,7), "") as ?ds2) bind (xsd:gYearMonth(?ds2) as ?xds2) } union { bind (if(regex(?dateStart, "^d{4}-d{2}-d{2}$"), substr(?dateStart,1,10), "") as ?ds3) bind (xsd:date(?ds3) as ?xds3) } union { … } filter (?xds1 != "" || ?xds2 != "" || ?xds3 != "" || ?xde1 != "" || ?xde2 != "" || ?xde3 != "") }
  • 20. Data publishing: rules (validation) construct { npgg:journals npg:hasConstraintViolation [ a spin:ConstraintViolation ; npg:severityLevel "Warning" ; rdfs:label ?message ; spin:rule [ a sp:Construct ; sp:text ?query ; ] ; ] . } where { { select (count(?s) as ?count) where { ?s a npg:Journal . filter ( not exists { ?s bibo:shortTitle ?h . } ) } } bind (concat("! Found ", str(?count), " journals with no short title") as ?message) bind (""” construct { npgg:journals npg:hasConstraintViolation [ a spin:ConstraintViolation ; spin:violationRoot ?s ; … ] . } where { … } """ as ?query) }
  • 21. Data publishing: rules (contracts) knowledge-bases:public ... npg:hasContract [ rdfs:comment "Contract for ArticleTypes Ontology" ; npg:graph npgg:article-types ; npg:hasBinding [ npg:onOntology article-types: ; npg:allowsPredicate dc:creator , dc:date , dc:publisher , dc:rights , dcterms:license , npg:webpage , owl:imports , owl:versionInfo , rdf:type , rdfs:comment , skos:definition , skos:prefLabel , skos:note , vann:preferredNamespacePrefix , vann:preferredNamespaceUri ; ] , [ npg:onInstanceOf npg:ArticleType ; npg:allowsPredicate npg:hasRoot , npg:isPrimaryArticleType , npg:id , npg:isLeaf , npg:isRoot , npg:treeDepth , rdf:type , rdfs:isDefinedBy , rdfs:seeAlso , skos:broadMatch , skos:broader , skos:closeMatch , skos:definition , skos:exactMatch , skos:inScheme , skos:narrower , skos:prefLabel , skos:relatedMatch , skos:topConceptOf ; ] ; ] ; ...
  • 22. Data publishing: rules (contracts)
  • 23. Next steps More features: -  Linked data dereference -  Richer dataset descriptions (VoID, PROV, HCLS Profile, etc.) -  SPARQL endpoint? -  JSON-LD API? More data: -  Adding extra data points (funding info, affiliations, …) -  Revamp citations dataset -  Longer term: extending archive to include Springer content More feedback: -  User testing around data accessibility -  Surveying communities/users for this data
  • 24. Looking ahead: how can a publisher make linked science happen? From a business perspective: -  Finding adequate licensing solutions -  Justifying the effort to publishers -  What’s the ROI? From a communities perspective: -  Do we actually know who are the users? -  How do we get more feedback/uptake? -  Should we work more with non-linked-data communities?