SlideShare una empresa de Scribd logo
1 de 38
Descargar para leer sin conexión
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
External controlled vocabularies support
in Dataverse
Slava Tykhonov
Senior Information Scientist (DANS-
KNAW)
Ideas Challenge Meeting | eifl
April 8, 2021
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
About me: DANS-KNAW projects (2016-2021)
● CLARIAH+ (ongoing)
● EOSC Synergy (ongoing)
● SSHOC Dataverse (ongoing)
● CESSDA DataverseEU 2018
● Time Machine Europe Supervisor at DANS-KNAW
● PARTHENOS Horizon 2020
● CESSDA PID (Personal Identifiers) Horizon 2020
● CLARIAH
● RDA (Research Data Alliance) PITTS Horizon 2020
● CESSDA SaW H2020-EU.1.4.1.1 Horizon 2020
2
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
3
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
What is Dataverse?
● Open source project developed by IQSS of Harvard University
● Great product with very long history (from 2006) created by experienced and
Agile development team
● Clear vision and understanding of research communities requirements, public
roadmap
● Well developed architecture with rich APIs allows to build application layers
around Dataverse
● Strong community behind of Dataverse is helping to improve the basic
functionality and develop it further.
● DANS-KNAW is leading SSHOC WP5.2 task to deliver production ready
Dataverse repository for CESSDA, CLARIN and DARIAH communities
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
DANS Data Stations - Future Data Services
Dataverse is API based data platform and a key framework for Open Innovation!
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
6
DANS-KNAW is one of worldwide leaders in FAIR Data (FAIRsFAIR)
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
FAIR and Dataverse
Source:
Mercè Crosas,
“FAIR principles and beyond:
implementation in Dataverse”
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Out of the box CV support in Dataverse (1)
Source: Dataverse Metadata Schema
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Out of the box CV support in Dataverse (2)
Internal vocabularies are stored in Dataverse, we need more CVs!
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
The importance of standards and ontologies
Generic controlled vocabularies to link metadata in the bibliographic collections are well
known: ORCID, GRID, GeoNames, Getty.
Medical knowledge graphs powered by:
● Biological Expression Language (BEL)
● Medical Subject Headings (MeSH®) by U.S. National Library of Medicine (NIH)
● Wikidata (Open ontology) - Wikipedia
Integration based on metadata standards:
● MARC21, Dublin Core (DC), Data Documentation Initiative (DDI)
The most of prominent ontologies already available as a Web Services with API
endpoints.
10
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Simple Knowledge Organization System (SKOS)
SKOS models a thesauri-like resources:
- skos:Concepts with preferred labels and alternative labels (synonyms) attached to them
(skos:prefLabel, skos:altLabel).
- skos:Concept can be related with skos:broader, skos:narrower and skos:related properties.
- terms and concepts could have more than one broader term and concept.
SKOS allows to create a semantic layer on top of objects, a network with statements and
relationships.
A major difference of SKOS is logical “is-a hierarchies”. In thesauri the hierarchical relation can
represent anything from “is-a” to “part-of”.
11
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Global Research Identifier Database (GRID) in SKOS
12
We already have a lot of data in
the global Dataverse network.
Can we provide depositors a
convenient web interface to link
their metadata to external
controlled vocabularies?
Is it possible to disambiguate
concepts and create links
automatically?
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
SKOSMOS framework to discover ontologies
13
● SKOSMOS is developed in
Europe by the National Library
of Finland (NLF)
● active global user community
● search and browsing interface
for SKOS concept
● multilingual vocabularies
support
● used for different use cases
(publish vocabularies, build
discovery systems, vocabulary
visualization)
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
SKOSMOS API specification in Swagger
14
Source: Finto API
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
SKOSMOS API example for GRID ontology
15
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Semantic Gateway as plugin app (in development)
Source: Dataverse gateway
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Dataverse deposit form with selected CVs
Every field could be linked to the controlled vocabularies in FAIR way!
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
One metadata field linked to many ontologies
Language switch in Dataverse will change the language of suggested terms!
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Improved metadata schema with 4 child fields
Vocabulary and Term selected by user, Vocabulary URL and Term URL filled automatically:
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Configuration for external controlled vocabularies
Pull Request to Dataverse core https://github.com/IQSS/dataverse/pull/7712
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Javascript interface
CV interface implemented as Javascript
and placed outside of Dataverse
application.
internal:
“js-url”: “/resources/js/cvoc-interface.js”
External:
“js-url”:
“https://raw.githubusercontent.com/Dans-
labs/semantic-
gateway/main/static/js/interface.js”
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
SKOSMOS python module (SKOSMOS-Client)
from skosmos_client import SkosmosClient
# then you can create your own client
skosmos = SkosmosClient(api_base='http://api.finto.fi/rest/v1/')
Finding the available vocabularies:
Vocabulary id: afo title: AFO - Natural resource and environment
ontology
Vocabulary id: allars title: Allärs - General thesaurus in Swedish
Vocabulary id: cn title: Finnish Corporate Names
Vocabulary id: ic title: Iconclass
...
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
BARTOC in SKOSMOS
Source: http://bartoc-skosmos.unibas.ch/en/
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Other SKOSMOS supported services
● Finto (Finnish thesaurus and ontology service)
● CESSDA CV Service has implemented SKOSMOS interface
● CESSDA ELSST (European Language Social Science Thesaurus)
● ACDH Vocabularies (Austrian Academy of Sciences)
● Thesaurus INRAE (Paris, France)
● AGROVOC Multilingual Thesaurus (United Nations)
● UNESCO Thesaurus
● European Space Agency ESA
NDE (Netwerk Digitaal Erfgoed) is working with DANS on the (partial)
support of SKOSMOS protocol to get a proper external CV connection to
DANS Data Stations.
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Use case: COVID-19 expert questions
25
Source: Epidemic Questions Answering
“In response to the COVID-19 pandemic, the Epidemic Question Answering (EPIC-QA) track challenges teams to
develop systems capable of automatically answering ad-hoc questions about the disease COVID-19, its causal virus
SARS-CoV-2, related corona viruses, and the recommended response to the pandemic. While COVID-19 has been an
impetus for a large body of emergent scientific research and inquiry, the response to COVID-19 raises questions for
consumers.”
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
COVID-19 questions in SKOSMOS framework
26
Source: COVID-19 questions in SKOSMOS
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
COVID-19 questions in Dataverse metadata
27
Source: COVID-19 European data hub in Harvard Dataverse
● COVID-19 ontologies can be hosted by
SKOSMOS framework
● Researchers can enrich metadata by
adding standardized questions
provided by SKOSMOS ontologies
● rich metadata exported back to Linked
Open Data Cloud to increase a chance
to be found
● enriched metadata can be used for
further ML models training
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Known issues with external controlled vocabularies
● how can CV support could be applied to any field
● support of any available vocabulary
● backward compatibility with fields from the old metadata schema
● clean UI experience (one selection can fill 4 child fields)
● can we use non-managed vocabularies or free-text values in same
field
● concept drift (the change of meaning of concepts)
● interoperability across all Dataverse instances
● how to ensure CVs are coming from authoritative services
...
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Issue: how can CV support could be applied to any field?
Problem: would support keyword field (with addition of one child field) and any new/existing fields built
to have the 4 required child fields. For example, subject, funder ID, grant ID, etc?
Possible solution: changes could be applied to existing text fields without modifying the metadata block
by adding concept URIs. However requires changes on CV interface side.
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Issue: support of any available vocabulary
Problem: currently the implementation specific to SKOSMOS protocols
which handles many vocabs.
Solution: the interface to external API endpoints with vocabularies has been
placed outside of Dataverse as external Javascript and could be extended
with support of any API.
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Issue: Backward Compatibility
Problem: our implementation of external controlled vocabularies support requires 4
child fields instead of 3 (default for Dataverse).
Possible solution: create a flyway script to adapt existing fields entries if metadata
schema will get extension with new 4th field to keep the concepts URIs. This 4th field
can’t be null.
Source: Harvard Dataverse
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Issue: clean User Interface experience
Problem: display retains the 4 fields even though one selection determines
all 4. Could hide/disable other fields? With SKOSMOS-served vocabularies,
some child fields will be filled automatically.
Possible solution: use more flexible configuration to define 2 child fields
(label/URI) instead of 4 where it’s possible. Make 3 fields read-only and
managed by Dataverse, not user, if it’s unavoidable.
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Issue: non-managed vocabularies or free-text values
Problem: can user mix non-managed controlled vocabularies or free-text
values in the same field?
Possible solution: input could allow disabling the selector with some
‘manual’ mode. If user wants to store some term that doesn’t match any
entry in CV, it could be allowed to be stored as text. However, it’s not
sustainable solution - how to sync free-text terms with external CVs?
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Issue: how to ensure CV is from authoritative service?
Problem: since the service URL is part of config, it could be configured to
use other services (a locally managed one, a mirror, etc.)
Possible solution: admin is responsible for using an authoritative source.
However, we don’t know how to control this in the distributed network.
Could become a serious issue if service is moving from one to another
service provider, mirrors should be considered here.
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Issue: concept drift
Concept drift is related to the cases where a concept may replace the meaning of
other concepts, or other concepts can take over its meaning. Can lead to the
problems with data quality, very difficult to trace and address.
Possible scenarios of concept drift:
- at the concept identifier level (label drift)
- in the basic properties of the concept (intensional drift)
- to the things the concept refers to (extensionaldrift)
Source: Detecting and Reporting Extensional Concept Drift in Statistical Linked Data
Possible solution: create and maintain cache of every concept inside of data
repository
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Issue: interoperability across all Dataverse instances
Problem: this implementation requires the same configuration to import data
and metadata from another Dataverse instance. If not configured, shows as
4 child fields.
Possible solution: terms from unsupported (undefined) vocabs would just
show as their URLs in another instance.
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Linking data (files) to external CVs, not only metadata
Source: Scholars Portal’ Data Curation Tool (Canada)
This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782
Thank you for your attention!
Slava Tykhonov (DANS-KNAW)
vyacheslav.tykhonov@dans.knaw.nl
https://www.sshopencloud.eu
info@sshopencloud.eu
@SSHOpenCloud
/in/sshopencloud
Join our community

Más contenido relacionado

La actualidad más candente

Metaverse for Dataverse
Metaverse for DataverseMetaverse for Dataverse
Metaverse for Dataversevty
 
Ontologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and DataverseOntologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and Dataversevty
 
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...vty
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligencevty
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...vty
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Projectvty
 
Running Dataverse repository in the European Open Science Cloud (EOSC)
Running Dataverse repository in the European Open Science Cloud (EOSC)Running Dataverse repository in the European Open Science Cloud (EOSC)
Running Dataverse repository in the European Open Science Cloud (EOSC)vty
 
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC,  Service QA and DataverseIntegration of WORSICA’s thematic service in EOSC,  Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataversevty
 
Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...vty
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...Vyacheslav Tykhonov
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...Andrea Scharnhorst
 
Dataverse in the European Open Science Cloud
Dataverse in the European Open Science CloudDataverse in the European Open Science Cloud
Dataverse in the European Open Science Cloudvty
 
Dataverse opportunities
Dataverse opportunitiesDataverse opportunities
Dataverse opportunitiesvty
 

La actualidad más candente (20)

Metaverse for Dataverse
Metaverse for DataverseMetaverse for Dataverse
Metaverse for Dataverse
 
Ontologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and DataverseOntologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and Dataverse
 
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
Running Dataverse repository in the European Open Science Cloud (EOSC)
Running Dataverse repository in the European Open Science Cloud (EOSC)Running Dataverse repository in the European Open Science Cloud (EOSC)
Running Dataverse repository in the European Open Science Cloud (EOSC)
 
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC,  Service QA and DataverseIntegration of WORSICA’s thematic service in EOSC,  Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
 
Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
 
Dataverse in the European Open Science Cloud
Dataverse in the European Open Science CloudDataverse in the European Open Science Cloud
Dataverse in the European Open Science Cloud
 
Dataverse opportunities
Dataverse opportunitiesDataverse opportunities
Dataverse opportunities
 
LOD2 Webinar: SIREn
LOD2 Webinar: SIREnLOD2 Webinar: SIREn
LOD2 Webinar: SIREn
 
LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the StackLOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the Stack
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORELOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
 
LOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViewsLOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViews
 
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and RepairLOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
 

Similar a External Controlled Vocabularies Support in Dataverse

Dataverse repository for research data in the COVID-19 Museum
Dataverse repository for research data  in the COVID-19 MuseumDataverse repository for research data  in the COVID-19 Museum
Dataverse repository for research data in the COVID-19 Museumvty
 
SSHOC Dataverse in the European Open Science Cloud
SSHOC Dataverse in the European Open Science CloudSSHOC Dataverse in the European Open Science Cloud
SSHOC Dataverse in the European Open Science Cloudvty
 
Requirements of DARIAH community for a Dataverse repository (SSHOC 2020)
Requirements of DARIAH community for a Dataverse repository (SSHOC 2020)Requirements of DARIAH community for a Dataverse repository (SSHOC 2020)
Requirements of DARIAH community for a Dataverse repository (SSHOC 2020)Péter Király
 
SSHOC at EOSC-hub Week - SSH & eInfra Projects - Daan Broeder
SSHOC at EOSC-hub Week - SSH & eInfra Projects - Daan BroederSSHOC at EOSC-hub Week - SSH & eInfra Projects - Daan Broeder
SSHOC at EOSC-hub Week - SSH & eInfra Projects - Daan BroederSSHOC
 
SSHOC General Presentation
SSHOC General PresentationSSHOC General Presentation
SSHOC General PresentationSSHOC
 
5. RDA community - Expectations from 14th plenary (Marieke Willems, Trust-IT)
5. RDA community - Expectations from 14th plenary (Marieke Willems, Trust-IT)5. RDA community - Expectations from 14th plenary (Marieke Willems, Trust-IT)
5. RDA community - Expectations from 14th plenary (Marieke Willems, Trust-IT)SSHOC
 
PaNOSC and ExPaNDS commitment to Open Science
PaNOSC and ExPaNDS commitment to Open SciencePaNOSC and ExPaNDS commitment to Open Science
PaNOSC and ExPaNDS commitment to Open SciencePaNOSC
 
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...PaNOSC
 
2. EOSC-hub (Daan Broeder, CLARIN ERIC)
2. EOSC-hub (Daan Broeder, CLARIN ERIC)2. EOSC-hub (Daan Broeder, CLARIN ERIC)
2. EOSC-hub (Daan Broeder, CLARIN ERIC)SSHOC
 
Overview of the Sustainability Plans of the ICT-29b) Projects
Overview of the Sustainability Plans of the ICT-29b) ProjectsOverview of the Sustainability Plans of the ICT-29b) Projects
Overview of the Sustainability Plans of the ICT-29b) ProjectsPretaLLOD
 
Sshoc kick off meeting - Work Package 5 Pitch - Annette Scherpenzeel
Sshoc kick off meeting - Work Package 5 Pitch - Annette ScherpenzeelSshoc kick off meeting - Work Package 5 Pitch - Annette Scherpenzeel
Sshoc kick off meeting - Work Package 5 Pitch - Annette ScherpenzeelSSHOC
 
Sshoc kick off meeting - Work package 3 Pitch - Daan Broeder - KNAW HuC/CLARI...
Sshoc kick off meeting - Work package 3 Pitch - Daan Broeder - KNAW HuC/CLARI...Sshoc kick off meeting - Work package 3 Pitch - Daan Broeder - KNAW HuC/CLARI...
Sshoc kick off meeting - Work package 3 Pitch - Daan Broeder - KNAW HuC/CLARI...SSHOC
 
Sshoc kick off meeting - Work Package 9 Pitch
Sshoc kick off meeting - Work Package 9 PitchSshoc kick off meeting - Work Package 9 Pitch
Sshoc kick off meeting - Work Package 9 PitchSSHOC
 
3. Collaboration with other cluster projects - Governance (Franciska de Jong,...
3. Collaboration with other cluster projects - Governance (Franciska de Jong,...3. Collaboration with other cluster projects - Governance (Franciska de Jong,...
3. Collaboration with other cluster projects - Governance (Franciska de Jong,...SSHOC
 
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQLIntroduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQLPretaLLOD
 
ESCAPE Kick-off meeting - WP1 (Feb 2019 - 2nd day)
ESCAPE Kick-off meeting - WP1 (Feb 2019 - 2nd day) ESCAPE Kick-off meeting - WP1 (Feb 2019 - 2nd day)
ESCAPE Kick-off meeting - WP1 (Feb 2019 - 2nd day) ESCAPE EU
 
ESCAPE Kick-off meeting - Welcome (Feb 2019)
ESCAPE Kick-off meeting - Welcome (Feb 2019)ESCAPE Kick-off meeting - Welcome (Feb 2019)
ESCAPE Kick-off meeting - Welcome (Feb 2019)ESCAPE EU
 
PaNOSC: EOSC for Photon and Neutron Facilities Users
PaNOSC: EOSC for Photon and Neutron Facilities Users PaNOSC: EOSC for Photon and Neutron Facilities Users
PaNOSC: EOSC for Photon and Neutron Facilities Users EOSC-hub project
 
European Research Projects as EOSC Service Providers
European Research Projects as EOSC Service ProvidersEuropean Research Projects as EOSC Service Providers
European Research Projects as EOSC Service ProvidersPedro Príncipe
 

Similar a External Controlled Vocabularies Support in Dataverse (20)

Dataverse repository for research data in the COVID-19 Museum
Dataverse repository for research data  in the COVID-19 MuseumDataverse repository for research data  in the COVID-19 Museum
Dataverse repository for research data in the COVID-19 Museum
 
SSHOC Dataverse in the European Open Science Cloud
SSHOC Dataverse in the European Open Science CloudSSHOC Dataverse in the European Open Science Cloud
SSHOC Dataverse in the European Open Science Cloud
 
Requirements of DARIAH community for a Dataverse repository (SSHOC 2020)
Requirements of DARIAH community for a Dataverse repository (SSHOC 2020)Requirements of DARIAH community for a Dataverse repository (SSHOC 2020)
Requirements of DARIAH community for a Dataverse repository (SSHOC 2020)
 
SSHOC at EOSC-hub Week - SSH & eInfra Projects - Daan Broeder
SSHOC at EOSC-hub Week - SSH & eInfra Projects - Daan BroederSSHOC at EOSC-hub Week - SSH & eInfra Projects - Daan Broeder
SSHOC at EOSC-hub Week - SSH & eInfra Projects - Daan Broeder
 
SSH & eInfra projects
SSH & eInfra projectsSSH & eInfra projects
SSH & eInfra projects
 
SSHOC General Presentation
SSHOC General PresentationSSHOC General Presentation
SSHOC General Presentation
 
5. RDA community - Expectations from 14th plenary (Marieke Willems, Trust-IT)
5. RDA community - Expectations from 14th plenary (Marieke Willems, Trust-IT)5. RDA community - Expectations from 14th plenary (Marieke Willems, Trust-IT)
5. RDA community - Expectations from 14th plenary (Marieke Willems, Trust-IT)
 
PaNOSC and ExPaNDS commitment to Open Science
PaNOSC and ExPaNDS commitment to Open SciencePaNOSC and ExPaNDS commitment to Open Science
PaNOSC and ExPaNDS commitment to Open Science
 
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
 
2. EOSC-hub (Daan Broeder, CLARIN ERIC)
2. EOSC-hub (Daan Broeder, CLARIN ERIC)2. EOSC-hub (Daan Broeder, CLARIN ERIC)
2. EOSC-hub (Daan Broeder, CLARIN ERIC)
 
Overview of the Sustainability Plans of the ICT-29b) Projects
Overview of the Sustainability Plans of the ICT-29b) ProjectsOverview of the Sustainability Plans of the ICT-29b) Projects
Overview of the Sustainability Plans of the ICT-29b) Projects
 
Sshoc kick off meeting - Work Package 5 Pitch - Annette Scherpenzeel
Sshoc kick off meeting - Work Package 5 Pitch - Annette ScherpenzeelSshoc kick off meeting - Work Package 5 Pitch - Annette Scherpenzeel
Sshoc kick off meeting - Work Package 5 Pitch - Annette Scherpenzeel
 
Sshoc kick off meeting - Work package 3 Pitch - Daan Broeder - KNAW HuC/CLARI...
Sshoc kick off meeting - Work package 3 Pitch - Daan Broeder - KNAW HuC/CLARI...Sshoc kick off meeting - Work package 3 Pitch - Daan Broeder - KNAW HuC/CLARI...
Sshoc kick off meeting - Work package 3 Pitch - Daan Broeder - KNAW HuC/CLARI...
 
Sshoc kick off meeting - Work Package 9 Pitch
Sshoc kick off meeting - Work Package 9 PitchSshoc kick off meeting - Work Package 9 Pitch
Sshoc kick off meeting - Work Package 9 Pitch
 
3. Collaboration with other cluster projects - Governance (Franciska de Jong,...
3. Collaboration with other cluster projects - Governance (Franciska de Jong,...3. Collaboration with other cluster projects - Governance (Franciska de Jong,...
3. Collaboration with other cluster projects - Governance (Franciska de Jong,...
 
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQLIntroduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
Introduction to RDF and related Vocabularies/Languages. Introduction to SPARQL
 
ESCAPE Kick-off meeting - WP1 (Feb 2019 - 2nd day)
ESCAPE Kick-off meeting - WP1 (Feb 2019 - 2nd day) ESCAPE Kick-off meeting - WP1 (Feb 2019 - 2nd day)
ESCAPE Kick-off meeting - WP1 (Feb 2019 - 2nd day)
 
ESCAPE Kick-off meeting - Welcome (Feb 2019)
ESCAPE Kick-off meeting - Welcome (Feb 2019)ESCAPE Kick-off meeting - Welcome (Feb 2019)
ESCAPE Kick-off meeting - Welcome (Feb 2019)
 
PaNOSC: EOSC for Photon and Neutron Facilities Users
PaNOSC: EOSC for Photon and Neutron Facilities Users PaNOSC: EOSC for Photon and Neutron Facilities Users
PaNOSC: EOSC for Photon and Neutron Facilities Users
 
European Research Projects as EOSC Service Providers
European Research Projects as EOSC Service ProvidersEuropean Research Projects as EOSC Service Providers
European Research Projects as EOSC Service Providers
 

Más de vty

Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs vty
 
Decentralisation and knowledge graphs
Decentralisation and knowledge graphs Decentralisation and knowledge graphs
Decentralisation and knowledge graphs vty
 
Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure vty
 
External CV support in Dataverse 5.7
External CV support in Dataverse 5.7External CV support in Dataverse 5.7
External CV support in Dataverse 5.7vty
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes vty
 
Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21vty
 
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2vty
 
Data standardization process for social sciences and humanities
Data standardization process for social sciences and humanitiesData standardization process for social sciences and humanities
Data standardization process for social sciences and humanitiesvty
 
Development in Dataverse SSHOC project
Development in Dataverse SSHOC projectDevelopment in Dataverse SSHOC project
Development in Dataverse SSHOC projectvty
 
DataverseEU as multilingual repository
DataverseEU as multilingual repositoryDataverseEU as multilingual repository
DataverseEU as multilingual repositoryvty
 

Más de vty (10)

Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs
 
Decentralisation and knowledge graphs
Decentralisation and knowledge graphs Decentralisation and knowledge graphs
Decentralisation and knowledge graphs
 
Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure
 
External CV support in Dataverse 5.7
External CV support in Dataverse 5.7External CV support in Dataverse 5.7
External CV support in Dataverse 5.7
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes
 
Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21
 
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
 
Data standardization process for social sciences and humanities
Data standardization process for social sciences and humanitiesData standardization process for social sciences and humanities
Data standardization process for social sciences and humanities
 
Development in Dataverse SSHOC project
Development in Dataverse SSHOC projectDevelopment in Dataverse SSHOC project
Development in Dataverse SSHOC project
 
DataverseEU as multilingual repository
DataverseEU as multilingual repositoryDataverseEU as multilingual repository
DataverseEU as multilingual repository
 

Último

DETECTION OF MUTATION BY CLB METHOD.pptx
DETECTION OF MUTATION BY CLB METHOD.pptxDETECTION OF MUTATION BY CLB METHOD.pptx
DETECTION OF MUTATION BY CLB METHOD.pptx201bo007
 
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasBACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasChayanika Das
 
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPRPirithiRaju
 
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyLAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyChayanika Das
 
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...jana861314
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
Introduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsIntroduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsCreative-Biolabs
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxEnvironmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxpriyankatabhane
 
3.-Acknowledgment-Dedication-Abstract.docx
3.-Acknowledgment-Dedication-Abstract.docx3.-Acknowledgment-Dedication-Abstract.docx
3.-Acknowledgment-Dedication-Abstract.docxUlahVanessaBasa
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionJadeNovelo1
 
Advances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerAdvances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerLuis Miguel Chong Chong
 
Efficient Fourier Pricing of Multi-Asset Options: Quasi-Monte Carlo & Domain ...
Efficient Fourier Pricing of Multi-Asset Options: Quasi-Monte Carlo & Domain ...Efficient Fourier Pricing of Multi-Asset Options: Quasi-Monte Carlo & Domain ...
Efficient Fourier Pricing of Multi-Asset Options: Quasi-Monte Carlo & Domain ...Chiheb Ben Hammouda
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsDobusch Leonhard
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...Chayanika Das
 
Food_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyFood_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyHemantThakare8
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxfarhanvvdk
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfGABYFIORELAMALPARTID1
 

Último (20)

DETECTION OF MUTATION BY CLB METHOD.pptx
DETECTION OF MUTATION BY CLB METHOD.pptxDETECTION OF MUTATION BY CLB METHOD.pptx
DETECTION OF MUTATION BY CLB METHOD.pptx
 
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika DasBACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
BACTERIAL SECRETION SYSTEM by Dr. Chayanika Das
 
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
 
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyLAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
 
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
Speed Breeding in Vegetable Crops- innovative approach for present era of cro...
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
Introduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative BiolabsIntroduction of Organ-On-A-Chip - Creative Biolabs
Introduction of Organ-On-A-Chip - Creative Biolabs
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxEnvironmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
 
3.-Acknowledgment-Dedication-Abstract.docx
3.-Acknowledgment-Dedication-Abstract.docx3.-Acknowledgment-Dedication-Abstract.docx
3.-Acknowledgment-Dedication-Abstract.docx
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and Function
 
Advances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerAdvances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of Cancer
 
Efficient Fourier Pricing of Multi-Asset Options: Quasi-Monte Carlo & Domain ...
Efficient Fourier Pricing of Multi-Asset Options: Quasi-Monte Carlo & Domain ...Efficient Fourier Pricing of Multi-Asset Options: Quasi-Monte Carlo & Domain ...
Efficient Fourier Pricing of Multi-Asset Options: Quasi-Monte Carlo & Domain ...
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and Pitfalls
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
 
Food_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiologyFood_safety_Management_pptx.pptx in microbiology
Food_safety_Management_pptx.pptx in microbiology
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptx
 
Introduction Classification Of Alkaloids
Introduction Classification Of AlkaloidsIntroduction Classification Of Alkaloids
Introduction Classification Of Alkaloids
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
 

External Controlled Vocabularies Support in Dataverse

  • 1. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 External controlled vocabularies support in Dataverse Slava Tykhonov Senior Information Scientist (DANS- KNAW) Ideas Challenge Meeting | eifl April 8, 2021
  • 2. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 About me: DANS-KNAW projects (2016-2021) ● CLARIAH+ (ongoing) ● EOSC Synergy (ongoing) ● SSHOC Dataverse (ongoing) ● CESSDA DataverseEU 2018 ● Time Machine Europe Supervisor at DANS-KNAW ● PARTHENOS Horizon 2020 ● CESSDA PID (Personal Identifiers) Horizon 2020 ● CLARIAH ● RDA (Research Data Alliance) PITTS Horizon 2020 ● CESSDA SaW H2020-EU.1.4.1.1 Horizon 2020 2
  • 3. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 3
  • 4. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 What is Dataverse? ● Open source project developed by IQSS of Harvard University ● Great product with very long history (from 2006) created by experienced and Agile development team ● Clear vision and understanding of research communities requirements, public roadmap ● Well developed architecture with rich APIs allows to build application layers around Dataverse ● Strong community behind of Dataverse is helping to improve the basic functionality and develop it further. ● DANS-KNAW is leading SSHOC WP5.2 task to deliver production ready Dataverse repository for CESSDA, CLARIN and DARIAH communities
  • 5. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 DANS Data Stations - Future Data Services Dataverse is API based data platform and a key framework for Open Innovation!
  • 6. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 6 DANS-KNAW is one of worldwide leaders in FAIR Data (FAIRsFAIR)
  • 7. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 FAIR and Dataverse Source: Mercè Crosas, “FAIR principles and beyond: implementation in Dataverse”
  • 8. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Out of the box CV support in Dataverse (1) Source: Dataverse Metadata Schema
  • 9. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Out of the box CV support in Dataverse (2) Internal vocabularies are stored in Dataverse, we need more CVs!
  • 10. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 The importance of standards and ontologies Generic controlled vocabularies to link metadata in the bibliographic collections are well known: ORCID, GRID, GeoNames, Getty. Medical knowledge graphs powered by: ● Biological Expression Language (BEL) ● Medical Subject Headings (MeSH®) by U.S. National Library of Medicine (NIH) ● Wikidata (Open ontology) - Wikipedia Integration based on metadata standards: ● MARC21, Dublin Core (DC), Data Documentation Initiative (DDI) The most of prominent ontologies already available as a Web Services with API endpoints. 10
  • 11. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Simple Knowledge Organization System (SKOS) SKOS models a thesauri-like resources: - skos:Concepts with preferred labels and alternative labels (synonyms) attached to them (skos:prefLabel, skos:altLabel). - skos:Concept can be related with skos:broader, skos:narrower and skos:related properties. - terms and concepts could have more than one broader term and concept. SKOS allows to create a semantic layer on top of objects, a network with statements and relationships. A major difference of SKOS is logical “is-a hierarchies”. In thesauri the hierarchical relation can represent anything from “is-a” to “part-of”. 11
  • 12. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Global Research Identifier Database (GRID) in SKOS 12 We already have a lot of data in the global Dataverse network. Can we provide depositors a convenient web interface to link their metadata to external controlled vocabularies? Is it possible to disambiguate concepts and create links automatically?
  • 13. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 SKOSMOS framework to discover ontologies 13 ● SKOSMOS is developed in Europe by the National Library of Finland (NLF) ● active global user community ● search and browsing interface for SKOS concept ● multilingual vocabularies support ● used for different use cases (publish vocabularies, build discovery systems, vocabulary visualization)
  • 14. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 SKOSMOS API specification in Swagger 14 Source: Finto API
  • 15. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 SKOSMOS API example for GRID ontology 15
  • 16. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Semantic Gateway as plugin app (in development) Source: Dataverse gateway
  • 17. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Dataverse deposit form with selected CVs Every field could be linked to the controlled vocabularies in FAIR way!
  • 18. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 One metadata field linked to many ontologies Language switch in Dataverse will change the language of suggested terms!
  • 19. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Improved metadata schema with 4 child fields Vocabulary and Term selected by user, Vocabulary URL and Term URL filled automatically:
  • 20. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Configuration for external controlled vocabularies Pull Request to Dataverse core https://github.com/IQSS/dataverse/pull/7712
  • 21. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Javascript interface CV interface implemented as Javascript and placed outside of Dataverse application. internal: “js-url”: “/resources/js/cvoc-interface.js” External: “js-url”: “https://raw.githubusercontent.com/Dans- labs/semantic- gateway/main/static/js/interface.js”
  • 22. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 SKOSMOS python module (SKOSMOS-Client) from skosmos_client import SkosmosClient # then you can create your own client skosmos = SkosmosClient(api_base='http://api.finto.fi/rest/v1/') Finding the available vocabularies: Vocabulary id: afo title: AFO - Natural resource and environment ontology Vocabulary id: allars title: Allärs - General thesaurus in Swedish Vocabulary id: cn title: Finnish Corporate Names Vocabulary id: ic title: Iconclass ...
  • 23. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 BARTOC in SKOSMOS Source: http://bartoc-skosmos.unibas.ch/en/
  • 24. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Other SKOSMOS supported services ● Finto (Finnish thesaurus and ontology service) ● CESSDA CV Service has implemented SKOSMOS interface ● CESSDA ELSST (European Language Social Science Thesaurus) ● ACDH Vocabularies (Austrian Academy of Sciences) ● Thesaurus INRAE (Paris, France) ● AGROVOC Multilingual Thesaurus (United Nations) ● UNESCO Thesaurus ● European Space Agency ESA NDE (Netwerk Digitaal Erfgoed) is working with DANS on the (partial) support of SKOSMOS protocol to get a proper external CV connection to DANS Data Stations.
  • 25. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Use case: COVID-19 expert questions 25 Source: Epidemic Questions Answering “In response to the COVID-19 pandemic, the Epidemic Question Answering (EPIC-QA) track challenges teams to develop systems capable of automatically answering ad-hoc questions about the disease COVID-19, its causal virus SARS-CoV-2, related corona viruses, and the recommended response to the pandemic. While COVID-19 has been an impetus for a large body of emergent scientific research and inquiry, the response to COVID-19 raises questions for consumers.”
  • 26. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 COVID-19 questions in SKOSMOS framework 26 Source: COVID-19 questions in SKOSMOS
  • 27. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 COVID-19 questions in Dataverse metadata 27 Source: COVID-19 European data hub in Harvard Dataverse ● COVID-19 ontologies can be hosted by SKOSMOS framework ● Researchers can enrich metadata by adding standardized questions provided by SKOSMOS ontologies ● rich metadata exported back to Linked Open Data Cloud to increase a chance to be found ● enriched metadata can be used for further ML models training
  • 28. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Known issues with external controlled vocabularies ● how can CV support could be applied to any field ● support of any available vocabulary ● backward compatibility with fields from the old metadata schema ● clean UI experience (one selection can fill 4 child fields) ● can we use non-managed vocabularies or free-text values in same field ● concept drift (the change of meaning of concepts) ● interoperability across all Dataverse instances ● how to ensure CVs are coming from authoritative services ...
  • 29. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Issue: how can CV support could be applied to any field? Problem: would support keyword field (with addition of one child field) and any new/existing fields built to have the 4 required child fields. For example, subject, funder ID, grant ID, etc? Possible solution: changes could be applied to existing text fields without modifying the metadata block by adding concept URIs. However requires changes on CV interface side.
  • 30. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Issue: support of any available vocabulary Problem: currently the implementation specific to SKOSMOS protocols which handles many vocabs. Solution: the interface to external API endpoints with vocabularies has been placed outside of Dataverse as external Javascript and could be extended with support of any API.
  • 31. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Issue: Backward Compatibility Problem: our implementation of external controlled vocabularies support requires 4 child fields instead of 3 (default for Dataverse). Possible solution: create a flyway script to adapt existing fields entries if metadata schema will get extension with new 4th field to keep the concepts URIs. This 4th field can’t be null. Source: Harvard Dataverse
  • 32. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Issue: clean User Interface experience Problem: display retains the 4 fields even though one selection determines all 4. Could hide/disable other fields? With SKOSMOS-served vocabularies, some child fields will be filled automatically. Possible solution: use more flexible configuration to define 2 child fields (label/URI) instead of 4 where it’s possible. Make 3 fields read-only and managed by Dataverse, not user, if it’s unavoidable.
  • 33. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Issue: non-managed vocabularies or free-text values Problem: can user mix non-managed controlled vocabularies or free-text values in the same field? Possible solution: input could allow disabling the selector with some ‘manual’ mode. If user wants to store some term that doesn’t match any entry in CV, it could be allowed to be stored as text. However, it’s not sustainable solution - how to sync free-text terms with external CVs?
  • 34. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Issue: how to ensure CV is from authoritative service? Problem: since the service URL is part of config, it could be configured to use other services (a locally managed one, a mirror, etc.) Possible solution: admin is responsible for using an authoritative source. However, we don’t know how to control this in the distributed network. Could become a serious issue if service is moving from one to another service provider, mirrors should be considered here.
  • 35. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Issue: concept drift Concept drift is related to the cases where a concept may replace the meaning of other concepts, or other concepts can take over its meaning. Can lead to the problems with data quality, very difficult to trace and address. Possible scenarios of concept drift: - at the concept identifier level (label drift) - in the basic properties of the concept (intensional drift) - to the things the concept refers to (extensionaldrift) Source: Detecting and Reporting Extensional Concept Drift in Statistical Linked Data Possible solution: create and maintain cache of every concept inside of data repository
  • 36. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Issue: interoperability across all Dataverse instances Problem: this implementation requires the same configuration to import data and metadata from another Dataverse instance. If not configured, shows as 4 child fields. Possible solution: terms from unsupported (undefined) vocabs would just show as their URLs in another instance.
  • 37. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Linking data (files) to external CVs, not only metadata Source: Scholars Portal’ Data Curation Tool (Canada)
  • 38. This project is funded from the EU Horizon 2020 Research and Innovation Programme (2014-2020) under Grant Agreement No. 823782 Thank you for your attention! Slava Tykhonov (DANS-KNAW) vyacheslav.tykhonov@dans.knaw.nl https://www.sshopencloud.eu info@sshopencloud.eu @SSHOpenCloud /in/sshopencloud Join our community