SlideShare una empresa de Scribd logo
1 de 46
Persistent Identifier
Linking
Tom Demeranville
THOR Senior Project Officer and ORCID
Software Engineer
https://orcid.org/0000-0003-0902-4386
Martin Fenner
DataCite Technical Director
https://orcid.org/0000-0003-1419-2405
Laura Rueda
DataCite Communications Director
https://orcid.org/0000-0001-5952-7630
Linking Data and Data
Challenges
How to cite data with right granularity?
How to link data and contributors with right
granularity?
Datasets that are part of larger datasets or
heterogenous collections
Multiple versions of the same dataset
Dynamic data
Linking Data and Data
Challenges – Granularity of Data in ORCID Record
http://search.datacite.org/contributors/0000-0002-8635-8390
Linking Data and Data
Challenges – Versioned Data in ORCID Record
http://orcid.org/0000-0003-1419-2405
Linking Data and Data
Research – Granularity
Attribution vs. Specificity
Persistent identifiers for datasets need to
support different levels of granularity
Ideally this is done my multiple persistent
identifiers linked via Has Part/Is Part Of
relationship
Collections will play an increasingly
important role
Linking Data and Data
Research – Data Versioning
Versioning of data is important for specificity and
verifiability
Practices and expectations for versioning of data vary
widely between communities and data centers
The data repository is ultimately responsible for
decisions about versioning
General recommendations can only include high-level
best practices and common vocabulary
Linking Data and Data
Research – Cross-Linking of Databases
Linking Data and Data
Implementation – Cross-Linking of Databases
Cross-linking between different databases not conceptually different
from article-data linking, implementation should follow same
principles (see next section)
Linking Data and Data
Implementation – Collections
http://search.datacite.org/works/10.1594/PANGAEA.611088
Linking Data and Data
Demo
Collection of climate data from ship logbooks
http://search.datacite.org/works/10.1594/PANGAEA.611088
Dryad Datasets associated with a specific publication
http://search.datacite.org/works/10.5061/DRYAD.9R161.1
Linking Data and Articles
Challenges
Data underlying the findings described in a manuscript
not always fully available
Data underlying the findings described in a manuscript
made available, but hidden in supplementary information
and not easily findable
Data underlying the findings described in a manuscript
made available, but not properly linked to/from article
Linking Data and Articles
Implementation – Follow FAIR Data Principles
From: http://slideshare.net/lshtm/preparing-data-for-sharing-the-fair-principles
Linking Data and Articles
Research - Conceptual Model
Linkage as Triples. In the form subject-predicate-
object, consistent with the Resource Description
Framework (RDF) data model.
Describing the relation. Additional information
such as relation type (e.g. A is new version of B)
and provenance.
Persistent Identifiers as HTTP URIs. This makes
them actionable, and compatible with the RDF
data model.
Centralized infrastructure for persistent identifier
linking. Provided for example by ORCID and
DataCite, facilitating discovery.
Linking Data with Articles
Implementation – Discover Article/Data Links
DataCite Event Data (https://eventdata.datacite.org)
Collect, aggregate and make available article/data links from DataCite
metadata and other sources
Crossref Event Data (https://api.eventdata.crossref.org)
Collect and make available article/data links from Crossref metadata
and other sources
OpenAIRE Data/Literature Linking Service (http://dliservice.research-
infrastructures.eu)
Collect and make available article/data links from a variety of sources
Linking Data with Articles
Implementation – Exchange Article/Data Links
Standard metadata for exchanging Article/Data Links
Joint Collaboration within RDA/WDS Data Publishing Services WG
(http://www.scholix.org/guidelines)
Link Exchange between Crossref and DataCite
Using the same open source software
(https://github.com/lagotto/lagotto) for their respective Event Data
services
Linking Data with Articles
Demo
Supplementary Information hosted in Data Repository
http://search.datacite.org/works/10.6084/M9.FIGSHARE.3427304
Five datasets from Cambridge Crystallographic Data Centre linked to the same article
http://search.datacite.org/works/10.1021/acs.cgd.6b00527
Software library described in Journal of Open Source Software
http://search.datacite.org/works/10.21105/joss.00026
PLOS articles linked with at least one DataCite DOI
http://search.datacite.org/data-centers/340
DataCite DOI -> Crossref DOI links exported from DataCite to Crossref
http://api.eventdata.crossref.org/works?source_id=datacite_crossref
In practical terms...
Real interoperability is much more than a framework:
• Compatible data models
• Metadata quality
• Development effort
• Coordination
During this first year, THOR has:
• Assessed how artefacts, contributors, organisations and
others are modelled
• Explored different implementations (ADS, Dryad… )
• Proposed approaches to overcome mismatches
Metadata compatibility - ORCID/DataCite
• Personal names (single and multiple fields)
Metadata compatibility - ORCID/DataCite
• Personal names (single and multiple fields)
<creators>
<creator>
<creatorName>Miller, Elizabeth</creatorName>
<givenName>Elizabeth</givenName>
<familyName>Miller</familyName>
<nameIdentifier
schemeURI="http://orcid.org/"
nameIdentifierScheme="ORCID">
0000-0001-5000-0007
</nameIdentifier>
<affiliation>DataCite</affiliation>
</creator>
</creators>
<creators>
<creator>
<creatorName>Miller, Elizabeth</creatorName>
<nameIdentifier
schemeURI="http://orcid.org/"
nameIdentifierScheme="ORCID">
0000-0001-5000-0007
</nameIdentifier>
<affiliation>DataCite</affiliation>
</creator>
</creators>
Metadata compatibility - ORCID/DataCite
• Contributor roles
Metadata compatibility - ORCID/DataCite
• Relation types
Metadata compatibility - ORCID/DataCite
• Lack of standards
• Low adoption
• Organisations:
• ISNI / Ringgold / Others
• Open standard?
• Funding, projects:
• Crossref’s Open Funder Registry
• Coverage and quality?
The results!
• ORCID Auto-Update:
Whenever a publication or a dataset
receives a DOI and its metadata
contains ORCID iDs, the ORCID
record of the author(s) can be
updated automatically!
• Authors receive a notification (inbox)
• They can configure:
• Accept updates automatically
• Level of privacy
The results!
• DataCite and Crossref Event Data:
The results!
• EThOS is the UK’s thesis service,
offering search and discovery of all
UK theses, and direct access to all
those that are digitally, openly available.
The results!
• PANGAEA archives, publishes and
distributes geo-referenced data about
climate variability, the marine
environment and geological research.
• PANGAEA attempts to resolve ORCID
iDs and annotate author names using a
heuristic algorithm
• Data citations from literature are rare!
• PANGAEA is keeping track of the link from
datasets back to articles (“reverse links”)
Linking Data and Contributors
Implementation – ORCID Search and Link
http://search.datacite.org/works?query=martin+fenner
Linking Data and Contributors
Implementation – ORCID Auto-Update
https://profiles.datacite.org/users/me
Linking Data and Contributors
Demo
Link Works via ORCID record
https://orcid.org/my-orcid
DataCite/ORCID Search and Link after authenticating with ORCID
https://profiles.datacite.org/users/me
Linking identifier types
There are a LOTS of
identifier types
Attempting to work with
them all raises LOTS of
questions
Remember this?
It’s the crosslinks between EMBL-
EBI databases
Most of those databases use
different identifier types
There are 560 collections!
This can make things tricky
Linking identifier types
Case study - identifier types in the life sciences
ORCID currently supports 33 identifier types, such as DOIs.
These are part of a fixed vocabulary, with associated rules about
validation and how to resolve them.
Adding a new one can be difficult, adding 500 is really difficult.
We now know that this does not scale.
But to fully realise our mission, we need to be able to do it, and
so do others.
Linking identifier types
Case study - External identifiers at ORCID
Linking identifier types
Challenges
Resolution
Equivalence
Maintenance
Usability
Linking identifier types
Challenges - Resolution
Not all of them are resolvable
Ideally, they’d already be URIs, but that’s not the case.
Mandating URIs is problematic as it could exclude large parts of
the community with established practice
How do we turn the “foo” identifier with value “bar” into a URI so
that the identifier can be resolved?
Do we need a set of transformation rules?
Linking identifier types
Challenges - Equivalence
Identifiers as URIs can introduce another
problem - Some have more than one
representation, in more than one place
The Protein Data Bank identifier (PDB)
“3coj” can be resolved in lots of places:
• PDB Europe: http://www.ebi.ac.uk/pdbe/entry/pdb/3coj
• PDB Japan: http://pdbj.org/mine/summary/3coj
• RCSB Protein Data Bank:
http://www.rcsb.org/pdb/explore/explore.do?structureId=3coj
• Protopedia: http://proteopedia.org/wiki/index.php/3coj
• PDBsum: https://www.ebi.ac.uk/thornton-srv/databases/cgi-
bin/pdbsum/GetPage.pl?pdbcode=3coj
Linking identifier types
Challenges - Equivalence (2)
These URLs all point at the same conceptual entity. But for
systems that group entities by identifiers, this can be a problem.
How do we check for equivalence?
How do we transform the URI into an identifier?
Can we separate the location of things from their identifier?
Linking identifier types
Challenges - Maintenance
People may define the same thing in different ways.
For example, the display name, validation rules or resolution URIs
Working with multiple identifiers from multiple sources quickly
becomes difficult. It’s a jumbled pile of bilateral agreements.
Who owns the defnition, who updates it, where is it kept?
How do we handle overlaps and conflicts?
How do we make the process hassle free and timely?
Linking identifier types
Challenges - Usability
Presenting a list of a thousand identifier types
to a user is bad.
Where do definitions and display names come
from, what about internationalisation etc?
Are users expected to know the URI of their
identifiers or the identifier itself?
Should systems be able to recognise and
transform between representations?
Linking identifier types
What are we doing to address these issues?
1: ORCID are working with EBI to integrate with
systems such as MIRIAM and identifiers.org
2: Refactoring the ORCID registry to streamline the
addition of identifier types
3: Investigating how ORCID might enable member
defined identifier types
The life sciences community
realised the issues and did
something about it. They
developed the MIRIAM registry.
It provides the data required to
transform local identifiers into
URIs, enabling resolution of
metadata and the data itself.
Decouples the identification of an
entity from its location on the Web.
Linking identifier types
Integration - identifiers in the life sciences
Identifiers.org is a service built on
top of the MIRIAM registry
It turns the URNs used by
MIRIAM into URLs for the web
It provides persistent resolvable
identifiers. The PDB identifier
“3coj” can be resolved at
http://identifiers.org/pdb/3coj
Linking identifier types
Integration - identifiers in the life sciences
Image from: Identifiers.org and MIRIAM Registry: community resources to provide persistent identification,
http://doi.org/10.1093/nar/gkr1097
Linking identifier types
Integration - identifiers in the life sciences
ORCID will reference these services for life science identifiers,
but there are still unanswered questions, which may have
multiple correct answers.
Does ORCID work with the “3coj” the identifier of type PDB?
or the “http://identifiers.org/pdb/3coj” of the type identifiers.org?
or is it some hybrid system that works with both?
THOR provides the platform to help answer these types of
questions.
Controlled vocabularies can, in fact, impede interoperability by
restricting links to specific systems. Yet we need to know what
is valid and what isn’t.
ORCID is moving to a system whereby the identifier vocabulary is
well understood and defined, yet not fixed and easily extensible
in an on-demand manner.
Clients can query the current list of identifier types using the
public API. We will soon add the rules associated with them
https://pub.sandbox.orcid.org/v2.0_rc2/#!/Identifier_API/viewIdentifierTypes
Linking identifier types
Integration - ‘un’controlled vocabularies identifier types
The communities that use identifiers and the databases that
create them are the best places to define and maintain their
definitions
We’re investigating if the ORCID registry could enable external
clients to define identifier types and the rules that go with them,
on-the-fly, for re-use by themselves and others?
We’re evaluating to see if this will meet the needs of scholarly
communication including EBI, CERN, DRYAD, PANGAEA and
the communities they serve.
Linking identifier types
Integration - ‘un’controlled vocabularies identifier types
Some of the images in these slides were designed by
freepik.com
THOR is funded by the European Commission under call H2020-EINFRA-2014-2, project
number 654039

Más contenido relacionado

La actualidad más candente

INSTRUCT - Integrated Structural Biology Infrastructure
INSTRUCT - Integrated Structural Biology InfrastructureINSTRUCT - Integrated Structural Biology Infrastructure
INSTRUCT - Integrated Structural Biology InfrastructureResearch Data Alliance
 
D4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementD4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementResearch Data Alliance
 
FAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data SharingFAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data SharingMerce Crosas
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesPistoia Alliance
 
Dataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. BorgmanDataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. Borgmandatascienceiqss
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...datascienceiqss
 
Implementing Archivematica, research data network
Implementing Archivematica, research data networkImplementing Archivematica, research data network
Implementing Archivematica, research data networkJisc RDM
 
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zenecaKerstin Forsberg
 
Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6ARDC
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016 Rebecca Raworth, MLIS
 
RDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelRDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelOpenAIRE
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecycleAnita de Waard
 
Persistent Identifier Services and their Metadata by John Kunze
Persistent Identifier Services and their Metadata by John KunzePersistent Identifier Services and their Metadata by John Kunze
Persistent Identifier Services and their Metadata by John Kunzedatascienceiqss
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeTom Plasterer
 

La actualidad más candente (20)

INSTRUCT - Integrated Structural Biology Infrastructure
INSTRUCT - Integrated Structural Biology InfrastructureINSTRUCT - Integrated Structural Biology Infrastructure
INSTRUCT - Integrated Structural Biology Infrastructure
 
D4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementD4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data management
 
FAIR data overview
FAIR data overviewFAIR data overview
FAIR data overview
 
Preparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR PrinciplesPreparing Data for Sharing: The FAIR Principles
Preparing Data for Sharing: The FAIR Principles
 
FAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data SharingFAIR Data Management and FAIR Data Sharing
FAIR Data Management and FAIR Data Sharing
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Meadows apr28-1
Meadows apr28-1Meadows apr28-1
Meadows apr28-1
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Dataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. BorgmanDataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. Borgman
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
 
Implementing Archivematica, research data network
Implementing Archivematica, research data networkImplementing Archivematica, research data network
Implementing Archivematica, research data network
 
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
RDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelRDA FAIR Data Maturity Model
RDA FAIR Data Maturity Model
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data Lifecycle
 
Persistent Identifier Services and their Metadata by John Kunze
Persistent Identifier Services and their Metadata by John KunzePersistent Identifier Services and their Metadata by John Kunze
Persistent Identifier Services and their Metadata by John Kunze
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
 

Similar a THOR Workshop - Persistent Identifier Linking

Metadata Provenance Tutorial at SWIB 13, Part 1
Metadata Provenance Tutorial at SWIB 13, Part 1Metadata Provenance Tutorial at SWIB 13, Part 1
Metadata Provenance Tutorial at SWIB 13, Part 1Kai Eckert
 
Mduke sagecite-jisc-march11
Mduke sagecite-jisc-march11Mduke sagecite-jisc-march11
Mduke sagecite-jisc-march11monicaduke
 
Resilient Linked Data
Resilient Linked DataResilient Linked Data
Resilient Linked DataDave Reynolds
 
Clinical Quality Linked Data on health.data.gov
Clinical Quality Linked Data on health.data.govClinical Quality Linked Data on health.data.gov
Clinical Quality Linked Data on health.data.govGeorge Thomas
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
ORCID Update - AAP PSP Annual Meeting February 2011
ORCID Update - AAP PSP Annual Meeting February 2011ORCID Update - AAP PSP Annual Meeting February 2011
ORCID Update - AAP PSP Annual Meeting February 2011hratner
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesSusanna-Assunta Sansone
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveJanifer Gatenby
 
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?Albert Hoitingh
 
Sabrina Kirrane INSIGHT Viva Presentation
Sabrina Kirrane INSIGHT Viva Presentation Sabrina Kirrane INSIGHT Viva Presentation
Sabrina Kirrane INSIGHT Viva Presentation Sabrina Kirrane
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networksalitora
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale Bernadette Hyland-Wood
 
20131117 charleston bryant
20131117 charleston bryant20131117 charleston bryant
20131117 charleston bryantORCID, Inc
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapubeswcsummerschool
 
Authority and VValidation in Digital Communications
Authority and VValidation in Digital CommunicationsAuthority and VValidation in Digital Communications
Authority and VValidation in Digital CommunicationsORCID, Inc
 

Similar a THOR Workshop - Persistent Identifier Linking (20)

Data, data, everywhere? Not nearly enough!
Data, data, everywhere? Not nearly enough!Data, data, everywhere? Not nearly enough!
Data, data, everywhere? Not nearly enough!
 
Metadata Provenance Tutorial at SWIB 13, Part 1
Metadata Provenance Tutorial at SWIB 13, Part 1Metadata Provenance Tutorial at SWIB 13, Part 1
Metadata Provenance Tutorial at SWIB 13, Part 1
 
Mduke sagecite-jisc-march11
Mduke sagecite-jisc-march11Mduke sagecite-jisc-march11
Mduke sagecite-jisc-march11
 
Resilient Linked Data
Resilient Linked DataResilient Linked Data
Resilient Linked Data
 
Clinical Quality Linked Data on health.data.gov
Clinical Quality Linked Data on health.data.govClinical Quality Linked Data on health.data.gov
Clinical Quality Linked Data on health.data.gov
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
ORCID Update - AAP PSP Annual Meeting February 2011
ORCID Update - AAP PSP Annual Meeting February 2011ORCID Update - AAP PSP Annual Meeting February 2011
ORCID Update - AAP PSP Annual Meeting February 2011
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipes
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspective
 
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
ExpertsLive NL 2022 - Microsoft Purview - What's in it for my organization?
 
Sabrina Kirrane INSIGHT Viva Presentation
Sabrina Kirrane INSIGHT Viva Presentation Sabrina Kirrane INSIGHT Viva Presentation
Sabrina Kirrane INSIGHT Viva Presentation
 
Weaving a Web of Linked Data - September 26th, 2019
Weaving a Web of Linked Data - September 26th, 2019Weaving a Web of Linked Data - September 26th, 2019
Weaving a Web of Linked Data - September 26th, 2019
 
The Power of Data
The Power of DataThe Power of Data
The Power of Data
 
The FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharingThe FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharing
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networks
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
20131117 charleston bryant
20131117 charleston bryant20131117 charleston bryant
20131117 charleston bryant
 
Pratical Deep Dive into the Semantic Web - #smconnect
Pratical Deep Dive into the Semantic Web - #smconnectPratical Deep Dive into the Semantic Web - #smconnect
Pratical Deep Dive into the Semantic Web - #smconnect
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
 
Authority and VValidation in Digital Communications
Authority and VValidation in Digital CommunicationsAuthority and VValidation in Digital Communications
Authority and VValidation in Digital Communications
 

Último

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxBhagirath Gogikar
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
Unit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 oUnit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 oManavSingh202607
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 

Último (20)

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Unit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 oUnit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 o
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 

THOR Workshop - Persistent Identifier Linking

  • 2. Tom Demeranville THOR Senior Project Officer and ORCID Software Engineer https://orcid.org/0000-0003-0902-4386 Martin Fenner DataCite Technical Director https://orcid.org/0000-0003-1419-2405 Laura Rueda DataCite Communications Director https://orcid.org/0000-0001-5952-7630
  • 3. Linking Data and Data Challenges How to cite data with right granularity? How to link data and contributors with right granularity? Datasets that are part of larger datasets or heterogenous collections Multiple versions of the same dataset Dynamic data
  • 4. Linking Data and Data Challenges – Granularity of Data in ORCID Record http://search.datacite.org/contributors/0000-0002-8635-8390
  • 5. Linking Data and Data Challenges – Versioned Data in ORCID Record http://orcid.org/0000-0003-1419-2405
  • 6. Linking Data and Data Research – Granularity Attribution vs. Specificity Persistent identifiers for datasets need to support different levels of granularity Ideally this is done my multiple persistent identifiers linked via Has Part/Is Part Of relationship Collections will play an increasingly important role
  • 7. Linking Data and Data Research – Data Versioning Versioning of data is important for specificity and verifiability Practices and expectations for versioning of data vary widely between communities and data centers The data repository is ultimately responsible for decisions about versioning General recommendations can only include high-level best practices and common vocabulary
  • 8. Linking Data and Data Research – Cross-Linking of Databases
  • 9. Linking Data and Data Implementation – Cross-Linking of Databases Cross-linking between different databases not conceptually different from article-data linking, implementation should follow same principles (see next section)
  • 10. Linking Data and Data Implementation – Collections http://search.datacite.org/works/10.1594/PANGAEA.611088
  • 11. Linking Data and Data Demo Collection of climate data from ship logbooks http://search.datacite.org/works/10.1594/PANGAEA.611088 Dryad Datasets associated with a specific publication http://search.datacite.org/works/10.5061/DRYAD.9R161.1
  • 12. Linking Data and Articles Challenges Data underlying the findings described in a manuscript not always fully available Data underlying the findings described in a manuscript made available, but hidden in supplementary information and not easily findable Data underlying the findings described in a manuscript made available, but not properly linked to/from article
  • 13. Linking Data and Articles Implementation – Follow FAIR Data Principles From: http://slideshare.net/lshtm/preparing-data-for-sharing-the-fair-principles
  • 14. Linking Data and Articles Research - Conceptual Model Linkage as Triples. In the form subject-predicate- object, consistent with the Resource Description Framework (RDF) data model. Describing the relation. Additional information such as relation type (e.g. A is new version of B) and provenance. Persistent Identifiers as HTTP URIs. This makes them actionable, and compatible with the RDF data model. Centralized infrastructure for persistent identifier linking. Provided for example by ORCID and DataCite, facilitating discovery.
  • 15. Linking Data with Articles Implementation – Discover Article/Data Links DataCite Event Data (https://eventdata.datacite.org) Collect, aggregate and make available article/data links from DataCite metadata and other sources Crossref Event Data (https://api.eventdata.crossref.org) Collect and make available article/data links from Crossref metadata and other sources OpenAIRE Data/Literature Linking Service (http://dliservice.research- infrastructures.eu) Collect and make available article/data links from a variety of sources
  • 16. Linking Data with Articles Implementation – Exchange Article/Data Links Standard metadata for exchanging Article/Data Links Joint Collaboration within RDA/WDS Data Publishing Services WG (http://www.scholix.org/guidelines) Link Exchange between Crossref and DataCite Using the same open source software (https://github.com/lagotto/lagotto) for their respective Event Data services
  • 17. Linking Data with Articles Demo Supplementary Information hosted in Data Repository http://search.datacite.org/works/10.6084/M9.FIGSHARE.3427304 Five datasets from Cambridge Crystallographic Data Centre linked to the same article http://search.datacite.org/works/10.1021/acs.cgd.6b00527 Software library described in Journal of Open Source Software http://search.datacite.org/works/10.21105/joss.00026 PLOS articles linked with at least one DataCite DOI http://search.datacite.org/data-centers/340 DataCite DOI -> Crossref DOI links exported from DataCite to Crossref http://api.eventdata.crossref.org/works?source_id=datacite_crossref
  • 18. In practical terms... Real interoperability is much more than a framework: • Compatible data models • Metadata quality • Development effort • Coordination During this first year, THOR has: • Assessed how artefacts, contributors, organisations and others are modelled • Explored different implementations (ADS, Dryad… ) • Proposed approaches to overcome mismatches
  • 19. Metadata compatibility - ORCID/DataCite • Personal names (single and multiple fields)
  • 20. Metadata compatibility - ORCID/DataCite • Personal names (single and multiple fields) <creators> <creator> <creatorName>Miller, Elizabeth</creatorName> <givenName>Elizabeth</givenName> <familyName>Miller</familyName> <nameIdentifier schemeURI="http://orcid.org/" nameIdentifierScheme="ORCID"> 0000-0001-5000-0007 </nameIdentifier> <affiliation>DataCite</affiliation> </creator> </creators> <creators> <creator> <creatorName>Miller, Elizabeth</creatorName> <nameIdentifier schemeURI="http://orcid.org/" nameIdentifierScheme="ORCID"> 0000-0001-5000-0007 </nameIdentifier> <affiliation>DataCite</affiliation> </creator> </creators>
  • 21. Metadata compatibility - ORCID/DataCite • Contributor roles
  • 22. Metadata compatibility - ORCID/DataCite • Relation types
  • 23. Metadata compatibility - ORCID/DataCite • Lack of standards • Low adoption • Organisations: • ISNI / Ringgold / Others • Open standard? • Funding, projects: • Crossref’s Open Funder Registry • Coverage and quality?
  • 24. The results! • ORCID Auto-Update: Whenever a publication or a dataset receives a DOI and its metadata contains ORCID iDs, the ORCID record of the author(s) can be updated automatically! • Authors receive a notification (inbox) • They can configure: • Accept updates automatically • Level of privacy
  • 25. The results! • DataCite and Crossref Event Data:
  • 26. The results! • EThOS is the UK’s thesis service, offering search and discovery of all UK theses, and direct access to all those that are digitally, openly available.
  • 27. The results! • PANGAEA archives, publishes and distributes geo-referenced data about climate variability, the marine environment and geological research. • PANGAEA attempts to resolve ORCID iDs and annotate author names using a heuristic algorithm • Data citations from literature are rare! • PANGAEA is keeping track of the link from datasets back to articles (“reverse links”)
  • 28. Linking Data and Contributors Implementation – ORCID Search and Link http://search.datacite.org/works?query=martin+fenner
  • 29. Linking Data and Contributors Implementation – ORCID Auto-Update https://profiles.datacite.org/users/me
  • 30. Linking Data and Contributors Demo Link Works via ORCID record https://orcid.org/my-orcid DataCite/ORCID Search and Link after authenticating with ORCID https://profiles.datacite.org/users/me
  • 31. Linking identifier types There are a LOTS of identifier types Attempting to work with them all raises LOTS of questions
  • 32. Remember this? It’s the crosslinks between EMBL- EBI databases Most of those databases use different identifier types There are 560 collections! This can make things tricky Linking identifier types Case study - identifier types in the life sciences
  • 33. ORCID currently supports 33 identifier types, such as DOIs. These are part of a fixed vocabulary, with associated rules about validation and how to resolve them. Adding a new one can be difficult, adding 500 is really difficult. We now know that this does not scale. But to fully realise our mission, we need to be able to do it, and so do others. Linking identifier types Case study - External identifiers at ORCID
  • 35. Linking identifier types Challenges - Resolution Not all of them are resolvable Ideally, they’d already be URIs, but that’s not the case. Mandating URIs is problematic as it could exclude large parts of the community with established practice How do we turn the “foo” identifier with value “bar” into a URI so that the identifier can be resolved? Do we need a set of transformation rules?
  • 36. Linking identifier types Challenges - Equivalence Identifiers as URIs can introduce another problem - Some have more than one representation, in more than one place The Protein Data Bank identifier (PDB) “3coj” can be resolved in lots of places: • PDB Europe: http://www.ebi.ac.uk/pdbe/entry/pdb/3coj • PDB Japan: http://pdbj.org/mine/summary/3coj • RCSB Protein Data Bank: http://www.rcsb.org/pdb/explore/explore.do?structureId=3coj • Protopedia: http://proteopedia.org/wiki/index.php/3coj • PDBsum: https://www.ebi.ac.uk/thornton-srv/databases/cgi- bin/pdbsum/GetPage.pl?pdbcode=3coj
  • 37. Linking identifier types Challenges - Equivalence (2) These URLs all point at the same conceptual entity. But for systems that group entities by identifiers, this can be a problem. How do we check for equivalence? How do we transform the URI into an identifier? Can we separate the location of things from their identifier?
  • 38. Linking identifier types Challenges - Maintenance People may define the same thing in different ways. For example, the display name, validation rules or resolution URIs Working with multiple identifiers from multiple sources quickly becomes difficult. It’s a jumbled pile of bilateral agreements. Who owns the defnition, who updates it, where is it kept? How do we handle overlaps and conflicts? How do we make the process hassle free and timely?
  • 39. Linking identifier types Challenges - Usability Presenting a list of a thousand identifier types to a user is bad. Where do definitions and display names come from, what about internationalisation etc? Are users expected to know the URI of their identifiers or the identifier itself? Should systems be able to recognise and transform between representations?
  • 40. Linking identifier types What are we doing to address these issues? 1: ORCID are working with EBI to integrate with systems such as MIRIAM and identifiers.org 2: Refactoring the ORCID registry to streamline the addition of identifier types 3: Investigating how ORCID might enable member defined identifier types
  • 41. The life sciences community realised the issues and did something about it. They developed the MIRIAM registry. It provides the data required to transform local identifiers into URIs, enabling resolution of metadata and the data itself. Decouples the identification of an entity from its location on the Web. Linking identifier types Integration - identifiers in the life sciences
  • 42. Identifiers.org is a service built on top of the MIRIAM registry It turns the URNs used by MIRIAM into URLs for the web It provides persistent resolvable identifiers. The PDB identifier “3coj” can be resolved at http://identifiers.org/pdb/3coj Linking identifier types Integration - identifiers in the life sciences Image from: Identifiers.org and MIRIAM Registry: community resources to provide persistent identification, http://doi.org/10.1093/nar/gkr1097
  • 43. Linking identifier types Integration - identifiers in the life sciences ORCID will reference these services for life science identifiers, but there are still unanswered questions, which may have multiple correct answers. Does ORCID work with the “3coj” the identifier of type PDB? or the “http://identifiers.org/pdb/3coj” of the type identifiers.org? or is it some hybrid system that works with both? THOR provides the platform to help answer these types of questions.
  • 44. Controlled vocabularies can, in fact, impede interoperability by restricting links to specific systems. Yet we need to know what is valid and what isn’t. ORCID is moving to a system whereby the identifier vocabulary is well understood and defined, yet not fixed and easily extensible in an on-demand manner. Clients can query the current list of identifier types using the public API. We will soon add the rules associated with them https://pub.sandbox.orcid.org/v2.0_rc2/#!/Identifier_API/viewIdentifierTypes Linking identifier types Integration - ‘un’controlled vocabularies identifier types
  • 45. The communities that use identifiers and the databases that create them are the best places to define and maintain their definitions We’re investigating if the ORCID registry could enable external clients to define identifier types and the rules that go with them, on-the-fly, for re-use by themselves and others? We’re evaluating to see if this will meet the needs of scholarly communication including EBI, CERN, DRYAD, PANGAEA and the communities they serve. Linking identifier types Integration - ‘un’controlled vocabularies identifier types
  • 46. Some of the images in these slides were designed by freepik.com THOR is funded by the European Commission under call H2020-EINFRA-2014-2, project number 654039

Notas del editor

  1. For example: a user might wish to claim some sequencing data within their ORCID record (or the database might want to add it for them). How can this happen if they do not have a known identifier? Having other-id:218751258217 in a record doesn’t help anyone.
  2. We could treat PDB Europe identifiers as being conceptually as different from PDB Japan as DOIs are from Handles. This benefits from simplicity but effectively ignores the problem, providing no way of associating the two identifiers. Reverse lookup is required, the ability to query these resolving services in the reverse direction, such that, for example, a query for http://www.ebi.ac.uk/pdbe/entry/pdb/3coj points to a common ‘umbrella’/’collection’ identifier such as http://identifiers.org/pdb/3coj.
  3. (but is that now the identifier? A meta identifier)
  4. (yes, it does matter!) (compounded by the fact some identifiers in identifiers.org already exist within ORCID e.g. PMC identifiers. ARGH!
  5. This will enable new identifier types and their associated metadata (for example multi-language descriptions) to be added to the registry in response to community needs.
  6. E.g. ISGN Once we've done the evaluation then we will put it on the dev roadmap.