SlideShare una empresa de Scribd logo
1 de 38
Sean Bechhofer
sean.bechhofer@manchester.ac.uk
@seanbechhofer
Making Metadata Work, ISKO
London, 23rd June 2014
Metadata for
Research Objects
1
Publication
• Publications are about argumentation: Convince
the reader of the validity of a position
– Reproducible Results System: facilitates
enactment and publication of reproducible
research.
• Results are reinforced by reproducability
– Explicit representation of method.
• Verifiability as a key factor in scientific discovery.
J. Mesirov Accessible Reproducible Research Science 327(5964), p.415-416,
2010 doi:10.1126/science.1179653
Stodden et. al. Reproducible Research: Addressing the Need for Data and
Code Sharing in Computational Science Computing in Science and
Engineering 12(5), p.8-13, 2010 doi:10.1109/MCSE.2010.113
C.Goble et. al. Accelerating Scientists’ Knowledge Turns
Communications in Computer and Information Science Volume 348,
2013, pp 3-25 doi:10.1007/978-3-642-37186-8_1
Reproducible Science
3
Goble: SSI Collaborations Workshop 2014
Scientific Workflows
4
» Scientific workflows are at the heart of
experimental science
› Enable automation of scientific
methods
› Support experimental
reproducibility
› Encourage best practices
» There is then a need to preserve
these workflows
› Scientific development based on
method reuse and repurpose
› Conservation is key
» Workflow preservation is a
multidimensional challenge
› Representation of complex
objects
› Decay analysis, diagnosis, and
prevention
› Social Objects that can be
inspected, reused, repurposed
Preservation of scientific workflows in
data-intensive science
Preservation
Technical
Multi-step computational process
Repeatable and comparative
Explicate computation
Social
Virtual Witnessing
Transparent, precise, citable
documentation
Accurate provenance logs
Reusable protocols, know-how,
best practice
Can I review /
repeat your
method?
Can I defend
my method?
Can I reuse /
reproduce this
method?
Context: Semantic Web and Linked
Data
• SW: Explicit machine-readable representation of information
• LD: A set of best practices for publishing
and connecting data on the Web
1. Use URIs to name things
2. Use dereferencable HTTP URIs
3. Provide useful content on
lookup using standards
4. Include links to other stuff
6
• An aggregation object that bundles together experimental
resources that are essential to a computational scientific study
or investigation.
– data used
– results produced in an experiment study;
– (computational) methods employed to
produce and analyse that data;
– people involved in the investigation.
• Plus annotation information that provides additional
information about both the bundle itself and the resources of
the bundle
– descriptions
– provenance
Research Objects
7
ROs as a Currency
8
Creator
Contributor
Collaborator
Comparator
Re-User
Evaluator
Reviewer
Trainee
Trainer
Reader
Publisher
Curator
Librarian
Repository
Manager
• Three principles underlie the approach:
• Identity
– Referring to resources
(and the aggregation itself)
• Aggregation
– Describing the aggregation structure
and its constituent parts
• Annotation
– Associating information with aggregated resources.
Research Objects
9
Identity
• Mechanisms for referring to the resources that are aggregated
within a Research Object
• URIs
– Web Resources
• DOIs
– Documents/papers/datasets
• ORCID IDs
– Researchers
10
Identifier Issues
• HTTP URIs provide both access and identification
• PIDs: Persistent Identifiers (e.g.DOIs) tend to resolve to
human-readable landing pages
– With embedded links to further (possibly machine-
readable) resources
• ROs seen as non-information resources with descriptive
(RDF) metadata
– Redirection/negotiation
– Standard patterns for Linked Data resources
• Bidirectional mappings between URIs and PIDs
• Versioning through, e.g. Memento
11
H. Van de Sompel et. al. Persistent Identifiers for Scholarly Assets
and the Web: The Need for an Unambiguous Mapping 9th
International Digital Curation Conference
Aggregation
• Open Archives Initiation Object Reuse and Exchange (OAI
ORE) is a standard for describing aggregations of web
resources
– http://www.openarchives.org/ore/
• Uses a Resource Map to describe the aggregated resources
• Proxies allow for statements about the resources within the
aggregation
– Capturing context and viewpoints
• Several concrete serialisations
– RDF/XML, Atom, RDFa
12
Graceful Degradation
Annotation
• Open Annotation specification is a community developed data
model for annotation of web resources
– http://www.openannotation.org/spec/core/
• Developed by the W3C Open Annotation Community Group
• Allows for “stand-off” annotations
– Annotation as a first class citizen
• Developed to fit with Web Architecture
13
Graceful Degradation
Annotation Content
• Essential to the understanding and interpretation of the
scientific outcomes captured by a Research Object as well as
the reuse of the resources within it.
– Provenance information about the experiments, the study
or any other experimental resources
– Evolution information about the Research Object and its
resources,
– Descriptions of computational methods
or processes
– Dependency information or settings
about the experiment executions
14
Core & Extensions
• Core model provides support for aggregation and annotation
• Extensions provide additional vocabularies for domain specific
tasks
• Workflow Provenance
– Information capturing workflow executions
• Workflow Description
– Abstractions describing Processes, inputs and outputs
• Research Object Evolution
– Information describing change and “snapshots”
15
RO Model
16
Provenance
• W3C’s PROV model allows for capture of information relating
to
– Attribution
 Who did it?
– Derivation
 Data sources used
– Activities
 What happened
(and when)
• Significant eco-system (generators, viewers, consumers) has
grown up around PROV
– IPAW & TAPP
17
Copyright © 2013 W3C® (MIT, ERCIM, Keio, Beihang), All Rights
Reserved.
Tooling
18
preservation and access to preserved ROs as depicted in Figure 6. Optionally, an external repository may
used to support the frequently evolving research objects. The repositories may be housed in a single
multiple physical repositories, and use the same or differing technologies (e.g. a repository may use a dig
preservation solution for the Preservation Repository and specialized digital library solution for the Acce
Repository). Additionally, as the Preservation Repository does not have the same interactive u
requirements as the access and live repositories, it could be implemented with slower (or offline) stora
alternatives.
Figure 6. Conceptual Archival System Storage Architecture.
ROs and OAIS
• ROs as Information Packages in OAIS
• myExperiment as live/access repository
• ROHUB as archival repository
19
SCAPE: Planning and Watch
20
Watch
OperationsPlanning
Env &
Users
Repository
plan
deploy
monitor monitor
monitor
access
ingest,
harvest
execution
http://www.scape-project.eu/
• SCAPE project concerned with Digital Preservation.
• Planning and Watch infrastructure to helpmmonitor
the state of a repository and co-ordinate appropriate actions
• Driven by policies.
myExperiment and RODL
Decay, Service
Deprecation,
Data source monitoring,
Checklists,
Minimal Models
Wf4Ever: Monitoring and Watch
21
Watch
OperationsPlanning
Env &
Users
Repository
plan
deploy
monitor monitor
monitor
access
ingest,
harvest
execution
• Ideas applied to workflow preservation
Decay
• Survey of 92 Taverna workflows from myExperiment
• Volatile Third-Party
Resources
• Missing Data
• Missing Execution Environments
• Poor descriptions
22
Belhajjame et. al. Why workflows break — Understanding
and combating decay in Taverna workflows e-Science 2012
doi:10.1109/eScience.2012.6404482
(a) An overview of the decay causes. (b) Workflow decay due to third party resources.
Fig. 3. Summary of workflow decay causes.
Checklists and Validation
• Checklists widely used to support safety, quality and
consistency
• Common in experimental science
– Expressing minimum information
required
– Supporting “health” monitoring of
workflow-centric ROs.
• Checklists can be defined in terms of
the RO model and its annotations
– Generic checklist service then
executes against that model and
the given annotations
– Provenance 23
Minim Data Model
pliant” or “ minimally compliant” with a checklist if it satisfies all of its MAY,
SHOULD or MUST items respectively.
Fig. 1. An overview of the Minim model schema.
Checklist
Requirement
QueryTestRule SparqlQuery
Result modifier
(string)
Query pattern
(string)Rule
CardinalityTest
Min cardinality
(integer)
AggregationTest
URI template
(string)
Max cardinality
(integer)
min
max
affirmRuleaggregatesTemplate
hasRequirement:
hasMustRequirement
hasShouldRequirement
hasMayRequirement
isLiveTemplate
sparql_query result_mod
toModel Notation key:
Explicit entity Implicit (super)class
Literal value
(type)
property
query
graph
QueryResultTest
RuleTest
exists
0..1
0..1
1
1
0..1
0..1
1 1
1
1..*
SoftwareEnvRule
URI template
(string)
Query
AccessibilityTest
URI template
(string)
ExistsTest
Rule
max 1 1
Query
Model
isDerivedBy
1..1
Our Minim data model (see Figure 1) provides 4 core constructs to express
a quality requirement: 24
Zhao et. al. A Checklist-Based Approach for
Quality Assessment of Scientific Information
3rd In. Workshop on Linked Science, 2013
Checklist Evaluation
25
Checklist Evaluation
26
RO Bundle
• A single, transferable object encapsulating the description and
resources of an RO
– Download, transfer, publish
• ZIP-based format (resources) plus a manifest describing
aggregation and annotations (description)
– Unpack with standard tooling
• JSON-LD as a representation for manifest
– Lightweight linked-data format
– Compatible with existing JSON tooling and services
– PROV-O and OAC for annotations
27
http://wf4ever.github.io/ro/bundle/
Bundling via git/Zenodo/figshare
• Scientist works with local folder structure.
– Version management via github.
– Local tooling produces metadata description
– Metadata about the aggregation (and its resources)
provided by “hidden folder”
• Zenodo/figshare pull snapshot from github
– Providing DOIs for the aggregrations
– Additional release cycles can prompt new DOIs
28
Zenodo
29
figshare
30
ROs as RDFa
31
http://rohub.linkeddata.es
RDFa
32
http://rohub.linkeddata.es
Code as a Research Object
33
COMBINE Archive
34
http://co.mbine.org/documents/archive
GigaScience/ISA
35
http://isa-tools.github.io/soapdenovo2/
IPython
36
Wrap Up
• Aggregation objects bundling together experimental resources
that are essential to a computational scientific study or
investigation
– Intended to support greater transparency and
reproducability
• Annotations provide additional information
about the bundle and its contents
– Metadata is key here
• Use of existing standards, vocabularies and
infrastructure
• Nascent tooling to support creation,
management and publication
37
Thanks!
• All the members of the Wf4Ever team
– iSOCO: Intelligent Software Components S.A., Spain
– University of Manchester, School of Computer Science, Manchester, United
Kingdom
– University of Oxford, Department of Zoology, Oxford, UK
– Poznan Supercomputing and Networking Center. Poznan, Poland
– IAA: Instituto de Astrofísica de Andalucía, Granada, Spain
– Leiden University Medical Centre, Centre for Human and Clinical Genetics,
The Netherlands
• Colleagues in Manchester’s Information Management Group
• RO Advisory Board Members
38
http://www.researchobject.org
http://www.wf4ever-project.org

Más contenido relacionado

La actualidad más candente

The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyFAIRDOM
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research dataVarsha Khodiyar
 
Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...FAIRDOM
 
Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Robert Oostenveld
 
Data management (1)
Data management (1)Data management (1)
Data management (1)SM Lalon
 
On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...Susanna-Assunta Sansone
 
Opportunistic Persistent Data Storage
Opportunistic Persistent Data StorageOpportunistic Persistent Data Storage
Opportunistic Persistent Data StorageLuke Weerasooriya
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...Alejandra Gonzalez-Beltran
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...Amanda Whitmire
 
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v12016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1Bruce Kozuma
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
FAIR sequencing data repository based on iRODS
FAIR sequencing data repository based on iRODSFAIR sequencing data repository based on iRODS
FAIR sequencing data repository based on iRODSFelipe Gutierrez
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxfPhilippe Rocca-Serra
 
Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Enayat Rajabi
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
 
Use of Research (Meta-)Data - Finding researchers in/across organizations -
Use of Research (Meta-)Data  - Finding researchers in/across organizations -Use of Research (Meta-)Data  - Finding researchers in/across organizations -
Use of Research (Meta-)Data - Finding researchers in/across organizations - National Institute of Informatics (NII)
 

La actualidad más candente (20)

The Donders Repository
The Donders RepositoryThe Donders Repository
The Donders Repository
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 
Beyond the PDF 2, 2013
Beyond the PDF 2, 2013Beyond the PDF 2, 2013
Beyond the PDF 2, 2013
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research data
 
OpenTox Europe 2013
OpenTox Europe 2013OpenTox Europe 2013
OpenTox Europe 2013
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
 
Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...
 
Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data
 
Data management (1)
Data management (1)Data management (1)
Data management (1)
 
On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...
 
Opportunistic Persistent Data Storage
Opportunistic Persistent Data StorageOpportunistic Persistent Data Storage
Opportunistic Persistent Data Storage
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
 
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v12016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
2016 Bio-IT World Cell Line Coordination Poster 2016-04-05v1
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
FAIR sequencing data repository based on iRODS
FAIR sequencing data repository based on iRODSFAIR sequencing data repository based on iRODS
FAIR sequencing data repository based on iRODS
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxf
 
Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
Use of Research (Meta-)Data - Finding researchers in/across organizations -
Use of Research (Meta-)Data  - Finding researchers in/across organizations -Use of Research (Meta-)Data  - Finding researchers in/across organizations -
Use of Research (Meta-)Data - Finding researchers in/across organizations -
 

Destacado

כלים ושיטות לבניית אתרים תקניים, נגישים ועשירים יותר
כלים ושיטות לבניית אתרים תקניים, נגישים ועשירים יותרכלים ושיטות לבניית אתרים תקניים, נגישים ועשירים יותר
כלים ושיטות לבניית אתרים תקניים, נגישים ועשירים יותרIsraeli Internet Association technology committee
 
Impact of climate change policy on the National Electricity Market
Impact of climate change policy on the National Electricity MarketImpact of climate change policy on the National Electricity Market
Impact of climate change policy on the National Electricity MarketEngineers Australia
 
Sharon Dawes (CTG Albany) Open data quality: a practical view
Sharon Dawes (CTG Albany) Open data quality: a practical viewSharon Dawes (CTG Albany) Open data quality: a practical view
Sharon Dawes (CTG Albany) Open data quality: a practical viewOpen City Foundation
 
Get on the Linked Data Web!
Get on the Linked Data Web!Get on the Linked Data Web!
Get on the Linked Data Web!Armin Haller
 
טכנולוגיות אינטרנט מתפתחות - אפליקציות ווב, מובייל, והווב הסמנטי
טכנולוגיות אינטרנט מתפתחות - אפליקציות ווב, מובייל, והווב הסמנטיטכנולוגיות אינטרנט מתפתחות - אפליקציות ווב, מובייל, והווב הסמנטי
טכנולוגיות אינטרנט מתפתחות - אפליקציות ווב, מובייל, והווב הסמנטיIsraeli Internet Association technology committee
 
WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410Arnaud Le Hors
 
What is hot on the web right now - A W3C perspective
What is hot on the web right now - A W3C perspectiveWhat is hot on the web right now - A W3C perspective
What is hot on the web right now - A W3C perspectiveArmin Haller
 
טכנולוגיות אינטרנט מתפתחות אפליקציות ווב, מובייל, והווב הסמנטי
טכנולוגיות אינטרנט מתפתחות אפליקציות ווב, מובייל, והווב הסמנטיטכנולוגיות אינטרנט מתפתחות אפליקציות ווב, מובייל, והווב הסמנטי
טכנולוגיות אינטרנט מתפתחות אפליקציות ווב, מובייל, והווב הסמנטיIsraeli Internet Association technology committee
 
Releasing the People's Data
Releasing the People's DataReleasing the People's Data
Releasing the People's DataOpen Data @ CTIC
 
שיטות לפיתוח אפליקציות ווב למכשירים ניידים - מובייל מונדי 28 ביוני 2010
שיטות לפיתוח אפליקציות ווב למכשירים ניידים - מובייל מונדי 28 ביוני 2010שיטות לפיתוח אפליקציות ווב למכשירים ניידים - מובייל מונדי 28 ביוני 2010
שיטות לפיתוח אפליקציות ווב למכשירים ניידים - מובייל מונדי 28 ביוני 2010Israeli Internet Association technology committee
 
¿Por qué los servicios electrónicos se usan tan poco? cómo hacer frente a la ...
¿Por qué los servicios electrónicos se usan tan poco? cómo hacer frente a la ...¿Por qué los servicios electrónicos se usan tan poco? cómo hacer frente a la ...
¿Por qué los servicios electrónicos se usan tan poco? cómo hacer frente a la ...Open Data @ CTIC
 
Accessible Design with HTML5 - HTML5DevConf.com May 21st San Francisco, 2012 ...
Accessible Design with HTML5 - HTML5DevConf.com May 21st San Francisco, 2012 ...Accessible Design with HTML5 - HTML5DevConf.com May 21st San Francisco, 2012 ...
Accessible Design with HTML5 - HTML5DevConf.com May 21st San Francisco, 2012 ...Raj Lal
 
Hacia una Nube de Datos Públicos Enlazados
Hacia una Nube de Datos Públicos EnlazadosHacia una Nube de Datos Públicos Enlazados
Hacia una Nube de Datos Públicos EnlazadosOpen Data @ CTIC
 
Web Semântica: uma introdução
Web Semântica: uma introdução Web Semântica: uma introdução
Web Semântica: uma introdução Yasodara Cordova
 
Semantic Web Landscape 2009
Semantic Web Landscape 2009Semantic Web Landscape 2009
Semantic Web Landscape 2009LeeFeigenbaum
 

Destacado (20)

כלים ושיטות לבניית אתרים תקניים, נגישים ועשירים יותר
כלים ושיטות לבניית אתרים תקניים, נגישים ועשירים יותרכלים ושיטות לבניית אתרים תקניים, נגישים ועשירים יותר
כלים ושיטות לבניית אתרים תקניים, נגישים ועשירים יותר
 
Impact of climate change policy on the National Electricity Market
Impact of climate change policy on the National Electricity MarketImpact of climate change policy on the National Electricity Market
Impact of climate change policy on the National Electricity Market
 
Sharon Dawes (CTG Albany) Open data quality: a practical view
Sharon Dawes (CTG Albany) Open data quality: a practical viewSharon Dawes (CTG Albany) Open data quality: a practical view
Sharon Dawes (CTG Albany) Open data quality: a practical view
 
Get on the Linked Data Web!
Get on the Linked Data Web!Get on the Linked Data Web!
Get on the Linked Data Web!
 
טכנולוגיות אינטרנט מתפתחות - אפליקציות ווב, מובייל, והווב הסמנטי
טכנולוגיות אינטרנט מתפתחות - אפליקציות ווב, מובייל, והווב הסמנטיטכנולוגיות אינטרנט מתפתחות - אפליקציות ווב, מובייל, והווב הסמנטי
טכנולוגיות אינטרנט מתפתחות - אפליקציות ווב, מובייל, והווב הסמנטי
 
מכשירים חדשים - עתיד הווב הנייד
מכשירים חדשים - עתיד הווב הנייד מכשירים חדשים - עתיד הווב הנייד
מכשירים חדשים - עתיד הווב הנייד
 
WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410
 
What is hot on the web right now - A W3C perspective
What is hot on the web right now - A W3C perspectiveWhat is hot on the web right now - A W3C perspective
What is hot on the web right now - A W3C perspective
 
פרסום נתונים ממשלתיים לציבור
פרסום נתונים ממשלתיים לציבורפרסום נתונים ממשלתיים לציבור
פרסום נתונים ממשלתיים לציבור
 
כלים ושיטות להנגשת אתרי אינטרנט
כלים ושיטות להנגשת אתרי אינטרנטכלים ושיטות להנגשת אתרי אינטרנט
כלים ושיטות להנגשת אתרי אינטרנט
 
טכנולוגיות אינטרנט מתפתחות אפליקציות ווב, מובייל, והווב הסמנטי
טכנולוגיות אינטרנט מתפתחות אפליקציות ווב, מובייל, והווב הסמנטיטכנולוגיות אינטרנט מתפתחות אפליקציות ווב, מובייל, והווב הסמנטי
טכנולוגיות אינטרנט מתפתחות אפליקציות ווב, מובייל, והווב הסמנטי
 
Semntic Web Intro Eyal Sela
Semntic Web Intro  Eyal SelaSemntic Web Intro  Eyal Sela
Semntic Web Intro Eyal Sela
 
Releasing the People's Data
Releasing the People's DataReleasing the People's Data
Releasing the People's Data
 
שיטות לפיתוח אפליקציות ווב למכשירים ניידים - מובייל מונדי 28 ביוני 2010
שיטות לפיתוח אפליקציות ווב למכשירים ניידים - מובייל מונדי 28 ביוני 2010שיטות לפיתוח אפליקציות ווב למכשירים ניידים - מובייל מונדי 28 ביוני 2010
שיטות לפיתוח אפליקציות ווב למכשירים ניידים - מובייל מונדי 28 ביוני 2010
 
¿Por qué los servicios electrónicos se usan tan poco? cómo hacer frente a la ...
¿Por qué los servicios electrónicos se usan tan poco? cómo hacer frente a la ...¿Por qué los servicios electrónicos se usan tan poco? cómo hacer frente a la ...
¿Por qué los servicios electrónicos se usan tan poco? cómo hacer frente a la ...
 
Accessible Design with HTML5 - HTML5DevConf.com May 21st San Francisco, 2012 ...
Accessible Design with HTML5 - HTML5DevConf.com May 21st San Francisco, 2012 ...Accessible Design with HTML5 - HTML5DevConf.com May 21st San Francisco, 2012 ...
Accessible Design with HTML5 - HTML5DevConf.com May 21st San Francisco, 2012 ...
 
Hacia una Nube de Datos Públicos Enlazados
Hacia una Nube de Datos Públicos EnlazadosHacia una Nube de Datos Públicos Enlazados
Hacia una Nube de Datos Públicos Enlazados
 
Web Semântica: uma introdução
Web Semântica: uma introdução Web Semântica: uma introdução
Web Semântica: uma introdução
 
Open data quality
Open data qualityOpen data quality
Open data quality
 
Semantic Web Landscape 2009
Semantic Web Landscape 2009Semantic Web Landscape 2009
Semantic Web Landscape 2009
 

Similar a Metadata for Research Objects

RO Advisory Kickoff Slides
RO Advisory Kickoff SlidesRO Advisory Kickoff Slides
RO Advisory Kickoff Slidesseanb
 
Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014seanb
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystemVarsha Khodiyar
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13DataDryad
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use CasesCarole Goble
 
Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Susanna-Assunta Sansone
 
Research Objects Tutorial (TPDL)
Research Objects Tutorial (TPDL)Research Objects Tutorial (TPDL)
Research Objects Tutorial (TPDL)dgarijo
 
Effective research data management
Effective research data managementEffective research data management
Effective research data managementCatherine Gold
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 
Ten habits of highly effective data
Ten habits of highly effective dataTen habits of highly effective data
Ten habits of highly effective dataAnita de Waard
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...Carole Goble
 
How to expose research data in EOSC
How to expose research data in EOSCHow to expose research data in EOSC
How to expose research data in EOSCEUDAT
 
The habits of highly successful data:
The habits of highly successful data: The habits of highly successful data:
The habits of highly successful data: Anita de Waard
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data RepositoriesHeinz Pampel
 

Similar a Metadata for Research Objects (20)

RO Advisory Kickoff Slides
RO Advisory Kickoff SlidesRO Advisory Kickoff Slides
RO Advisory Kickoff Slides
 
Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
Enhance your rese​arch impact through open science
Enhance your rese​arch impact through open scienceEnhance your rese​arch impact through open science
Enhance your rese​arch impact through open science
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 
Credible workshop
Credible workshopCredible workshop
Credible workshop
 
Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015
 
Research Objects Tutorial (TPDL)
Research Objects Tutorial (TPDL)Research Objects Tutorial (TPDL)
Research Objects Tutorial (TPDL)
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Ten habits of highly effective data
Ten habits of highly effective dataTen habits of highly effective data
Ten habits of highly effective data
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
How to expose research data in EOSC
How to expose research data in EOSCHow to expose research data in EOSC
How to expose research data in EOSC
 
The habits of highly successful data:
The habits of highly successful data: The habits of highly successful data:
The habits of highly successful data:
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositories
 
Scholze imcw 2014-11-25
Scholze imcw 2014-11-25Scholze imcw 2014-11-25
Scholze imcw 2014-11-25
 

Más de seanb

Linked Data Publication of Live Music Archives and Analyses
Linked Data Publication of Live Music Archives and AnalysesLinked Data Publication of Live Music Archives and Analyses
Linked Data Publication of Live Music Archives and Analysesseanb
 
Animation 14: Computer Science and Music
Animation 14: Computer Science and MusicAnimation 14: Computer Science and Music
Animation 14: Computer Science and Musicseanb
 
Linked Data Publication of Live Music Archives
Linked Data Publication of Live Music ArchivesLinked Data Publication of Live Music Archives
Linked Data Publication of Live Music Archivesseanb
 
Ontologies and Vocabularies
Ontologies and VocabulariesOntologies and Vocabularies
Ontologies and Vocabulariesseanb
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminarseanb
 
Scientific Social Objects
Scientific Social ObjectsScientific Social Objects
Scientific Social Objectsseanb
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objectsseanb
 
FISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD WorkshopFISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD Workshopseanb
 
SKOS, Past, Present and Future
SKOS, Past, Present and FutureSKOS, Past, Present and Future
SKOS, Past, Present and Futureseanb
 
Semantic Web for Multimedia
Semantic Web for MultimediaSemantic Web for Multimedia
Semantic Web for Multimediaseanb
 

Más de seanb (10)

Linked Data Publication of Live Music Archives and Analyses
Linked Data Publication of Live Music Archives and AnalysesLinked Data Publication of Live Music Archives and Analyses
Linked Data Publication of Live Music Archives and Analyses
 
Animation 14: Computer Science and Music
Animation 14: Computer Science and MusicAnimation 14: Computer Science and Music
Animation 14: Computer Science and Music
 
Linked Data Publication of Live Music Archives
Linked Data Publication of Live Music ArchivesLinked Data Publication of Live Music Archives
Linked Data Publication of Live Music Archives
 
Ontologies and Vocabularies
Ontologies and VocabulariesOntologies and Vocabularies
Ontologies and Vocabularies
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
Scientific Social Objects
Scientific Social ObjectsScientific Social Objects
Scientific Social Objects
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objects
 
FISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD WorkshopFISHLink Presentation at JISC MRD Workshop
FISHLink Presentation at JISC MRD Workshop
 
SKOS, Past, Present and Future
SKOS, Past, Present and FutureSKOS, Past, Present and Future
SKOS, Past, Present and Future
 
Semantic Web for Multimedia
Semantic Web for MultimediaSemantic Web for Multimedia
Semantic Web for Multimedia
 

Último

Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 

Último (20)

Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 

Metadata for Research Objects

  • 1. Sean Bechhofer sean.bechhofer@manchester.ac.uk @seanbechhofer Making Metadata Work, ISKO London, 23rd June 2014 Metadata for Research Objects 1
  • 2. Publication • Publications are about argumentation: Convince the reader of the validity of a position – Reproducible Results System: facilitates enactment and publication of reproducible research. • Results are reinforced by reproducability – Explicit representation of method. • Verifiability as a key factor in scientific discovery. J. Mesirov Accessible Reproducible Research Science 327(5964), p.415-416, 2010 doi:10.1126/science.1179653 Stodden et. al. Reproducible Research: Addressing the Need for Data and Code Sharing in Computational Science Computing in Science and Engineering 12(5), p.8-13, 2010 doi:10.1109/MCSE.2010.113 C.Goble et. al. Accelerating Scientists’ Knowledge Turns Communications in Computer and Information Science Volume 348, 2013, pp 3-25 doi:10.1007/978-3-642-37186-8_1
  • 3. Reproducible Science 3 Goble: SSI Collaborations Workshop 2014
  • 4. Scientific Workflows 4 » Scientific workflows are at the heart of experimental science › Enable automation of scientific methods › Support experimental reproducibility › Encourage best practices » There is then a need to preserve these workflows › Scientific development based on method reuse and repurpose › Conservation is key » Workflow preservation is a multidimensional challenge › Representation of complex objects › Decay analysis, diagnosis, and prevention › Social Objects that can be inspected, reused, repurposed Preservation of scientific workflows in data-intensive science
  • 5. Preservation Technical Multi-step computational process Repeatable and comparative Explicate computation Social Virtual Witnessing Transparent, precise, citable documentation Accurate provenance logs Reusable protocols, know-how, best practice Can I review / repeat your method? Can I defend my method? Can I reuse / reproduce this method?
  • 6. Context: Semantic Web and Linked Data • SW: Explicit machine-readable representation of information • LD: A set of best practices for publishing and connecting data on the Web 1. Use URIs to name things 2. Use dereferencable HTTP URIs 3. Provide useful content on lookup using standards 4. Include links to other stuff 6
  • 7. • An aggregation object that bundles together experimental resources that are essential to a computational scientific study or investigation. – data used – results produced in an experiment study; – (computational) methods employed to produce and analyse that data; – people involved in the investigation. • Plus annotation information that provides additional information about both the bundle itself and the resources of the bundle – descriptions – provenance Research Objects 7
  • 8. ROs as a Currency 8 Creator Contributor Collaborator Comparator Re-User Evaluator Reviewer Trainee Trainer Reader Publisher Curator Librarian Repository Manager
  • 9. • Three principles underlie the approach: • Identity – Referring to resources (and the aggregation itself) • Aggregation – Describing the aggregation structure and its constituent parts • Annotation – Associating information with aggregated resources. Research Objects 9
  • 10. Identity • Mechanisms for referring to the resources that are aggregated within a Research Object • URIs – Web Resources • DOIs – Documents/papers/datasets • ORCID IDs – Researchers 10
  • 11. Identifier Issues • HTTP URIs provide both access and identification • PIDs: Persistent Identifiers (e.g.DOIs) tend to resolve to human-readable landing pages – With embedded links to further (possibly machine- readable) resources • ROs seen as non-information resources with descriptive (RDF) metadata – Redirection/negotiation – Standard patterns for Linked Data resources • Bidirectional mappings between URIs and PIDs • Versioning through, e.g. Memento 11 H. Van de Sompel et. al. Persistent Identifiers for Scholarly Assets and the Web: The Need for an Unambiguous Mapping 9th International Digital Curation Conference
  • 12. Aggregation • Open Archives Initiation Object Reuse and Exchange (OAI ORE) is a standard for describing aggregations of web resources – http://www.openarchives.org/ore/ • Uses a Resource Map to describe the aggregated resources • Proxies allow for statements about the resources within the aggregation – Capturing context and viewpoints • Several concrete serialisations – RDF/XML, Atom, RDFa 12 Graceful Degradation
  • 13. Annotation • Open Annotation specification is a community developed data model for annotation of web resources – http://www.openannotation.org/spec/core/ • Developed by the W3C Open Annotation Community Group • Allows for “stand-off” annotations – Annotation as a first class citizen • Developed to fit with Web Architecture 13 Graceful Degradation
  • 14. Annotation Content • Essential to the understanding and interpretation of the scientific outcomes captured by a Research Object as well as the reuse of the resources within it. – Provenance information about the experiments, the study or any other experimental resources – Evolution information about the Research Object and its resources, – Descriptions of computational methods or processes – Dependency information or settings about the experiment executions 14
  • 15. Core & Extensions • Core model provides support for aggregation and annotation • Extensions provide additional vocabularies for domain specific tasks • Workflow Provenance – Information capturing workflow executions • Workflow Description – Abstractions describing Processes, inputs and outputs • Research Object Evolution – Information describing change and “snapshots” 15
  • 17. Provenance • W3C’s PROV model allows for capture of information relating to – Attribution  Who did it? – Derivation  Data sources used – Activities  What happened (and when) • Significant eco-system (generators, viewers, consumers) has grown up around PROV – IPAW & TAPP 17 Copyright © 2013 W3C® (MIT, ERCIM, Keio, Beihang), All Rights Reserved.
  • 19. preservation and access to preserved ROs as depicted in Figure 6. Optionally, an external repository may used to support the frequently evolving research objects. The repositories may be housed in a single multiple physical repositories, and use the same or differing technologies (e.g. a repository may use a dig preservation solution for the Preservation Repository and specialized digital library solution for the Acce Repository). Additionally, as the Preservation Repository does not have the same interactive u requirements as the access and live repositories, it could be implemented with slower (or offline) stora alternatives. Figure 6. Conceptual Archival System Storage Architecture. ROs and OAIS • ROs as Information Packages in OAIS • myExperiment as live/access repository • ROHUB as archival repository 19
  • 20. SCAPE: Planning and Watch 20 Watch OperationsPlanning Env & Users Repository plan deploy monitor monitor monitor access ingest, harvest execution http://www.scape-project.eu/ • SCAPE project concerned with Digital Preservation. • Planning and Watch infrastructure to helpmmonitor the state of a repository and co-ordinate appropriate actions • Driven by policies.
  • 21. myExperiment and RODL Decay, Service Deprecation, Data source monitoring, Checklists, Minimal Models Wf4Ever: Monitoring and Watch 21 Watch OperationsPlanning Env & Users Repository plan deploy monitor monitor monitor access ingest, harvest execution • Ideas applied to workflow preservation
  • 22. Decay • Survey of 92 Taverna workflows from myExperiment • Volatile Third-Party Resources • Missing Data • Missing Execution Environments • Poor descriptions 22 Belhajjame et. al. Why workflows break — Understanding and combating decay in Taverna workflows e-Science 2012 doi:10.1109/eScience.2012.6404482 (a) An overview of the decay causes. (b) Workflow decay due to third party resources. Fig. 3. Summary of workflow decay causes.
  • 23. Checklists and Validation • Checklists widely used to support safety, quality and consistency • Common in experimental science – Expressing minimum information required – Supporting “health” monitoring of workflow-centric ROs. • Checklists can be defined in terms of the RO model and its annotations – Generic checklist service then executes against that model and the given annotations – Provenance 23
  • 24. Minim Data Model pliant” or “ minimally compliant” with a checklist if it satisfies all of its MAY, SHOULD or MUST items respectively. Fig. 1. An overview of the Minim model schema. Checklist Requirement QueryTestRule SparqlQuery Result modifier (string) Query pattern (string)Rule CardinalityTest Min cardinality (integer) AggregationTest URI template (string) Max cardinality (integer) min max affirmRuleaggregatesTemplate hasRequirement: hasMustRequirement hasShouldRequirement hasMayRequirement isLiveTemplate sparql_query result_mod toModel Notation key: Explicit entity Implicit (super)class Literal value (type) property query graph QueryResultTest RuleTest exists 0..1 0..1 1 1 0..1 0..1 1 1 1 1..* SoftwareEnvRule URI template (string) Query AccessibilityTest URI template (string) ExistsTest Rule max 1 1 Query Model isDerivedBy 1..1 Our Minim data model (see Figure 1) provides 4 core constructs to express a quality requirement: 24 Zhao et. al. A Checklist-Based Approach for Quality Assessment of Scientific Information 3rd In. Workshop on Linked Science, 2013
  • 27. RO Bundle • A single, transferable object encapsulating the description and resources of an RO – Download, transfer, publish • ZIP-based format (resources) plus a manifest describing aggregation and annotations (description) – Unpack with standard tooling • JSON-LD as a representation for manifest – Lightweight linked-data format – Compatible with existing JSON tooling and services – PROV-O and OAC for annotations 27 http://wf4ever.github.io/ro/bundle/
  • 28. Bundling via git/Zenodo/figshare • Scientist works with local folder structure. – Version management via github. – Local tooling produces metadata description – Metadata about the aggregation (and its resources) provided by “hidden folder” • Zenodo/figshare pull snapshot from github – Providing DOIs for the aggregrations – Additional release cycles can prompt new DOIs 28
  • 33. Code as a Research Object 33
  • 37. Wrap Up • Aggregation objects bundling together experimental resources that are essential to a computational scientific study or investigation – Intended to support greater transparency and reproducability • Annotations provide additional information about the bundle and its contents – Metadata is key here • Use of existing standards, vocabularies and infrastructure • Nascent tooling to support creation, management and publication 37
  • 38. Thanks! • All the members of the Wf4Ever team – iSOCO: Intelligent Software Components S.A., Spain – University of Manchester, School of Computer Science, Manchester, United Kingdom – University of Oxford, Department of Zoology, Oxford, UK – Poznan Supercomputing and Networking Center. Poznan, Poland – IAA: Instituto de Astrofísica de Andalucía, Granada, Spain – Leiden University Medical Centre, Centre for Human and Clinical Genetics, The Netherlands • Colleagues in Manchester’s Information Management Group • RO Advisory Board Members 38 http://www.researchobject.org http://www.wf4ever-project.org

Notas del editor

  1. Metadata to support reproducibility. What does that mean? What do we need to do? How do we do it? Will run through the approach that was taken, and some of the vocabs and standards that are being used to do it.
  2. What’s the purpose of publication? Publications intended to present results/positions, along with arguments that reinforce those positions. Reproducability reinforces the validity of our positions. May require us to include much more information than can be included in a paper:in particular, data sets and methods.
  3. Understanding the different roles that are involved in supporting the scientific lifecycle and experimental process.
  4. One of the key issue is that HTTP URIs serve multiple purposes. They are identifiers, but also serve as a mechanism for locating or accessing the content. PIDs, on the other hand tend to involve a resolution or redirection process which guides us to the content. Commonly that resolution ends up on a landing page though – for example DOIs usually resolve to a web page, that may then provide embedded links to further resources. We can consider ROs as non-information resources (things who’s distinguishing characteristics can’t be conveyed in a message). On resolving the ID for such a thing we get descriptive metadata about it (but not the thing itself). This is a common pattern used for Linked Data resources. Herbert proposes a bidirectional mapping between PIDs and the HTTP URIs that provide access to the informaiton about them. So we can go from PID to stuff, and from stuff to the PID that it is about. Approaches like Memento could then be applied to support versioning. I don’t think there are necessarily any deep problems lurking here – it’s more about the way in which services are set up and establishing convention and practice.
  5. Lose this?
  6. Local folder/file structures – experiences with our astronomy users. Use github for version management. Local tooling produces metadata descriptions.
  7. Example RO in zenodo
  8. Example RO in figshare. Cf Code as a research object.
  9. Work by Dani Garijo of UPM. Web page generated from metadata about papers. RO includes information about the materials provided.
  10. Systems biology bundling. Experiments in mapping between COMBINE archives and ROs. http://co.mbine.org/documents/archive