SlideShare una empresa de Scribd logo
1 de 1
Descargar para leer sin conexión
myExperiment Research Objects: Beyond Workflows and Packs
Stian Soiland-Reyes1, Don Cruickshank2, Finn Bacall1, Jun Zhao2, Khalid Belhajjame1, David De Roure3, Carole A. Goble1
1 School of Computer Science, University of Manchester, UK
2 Department of Zoology, University of Oxford, UK
3 Oxford e-Research Centre, University of Oxford, UK
ABSTRACT
We have evolved Research Objects as a mechanism to preserve digital resources
related to research, by providing mechanisms, formats and architecture for
describing aggregated resources (hypothesis, workflow, datasets, scripts, services),
their relations (is input for, explains, used by), provenance (graph was derived from
dataset A, B and C) and attribution (who contributed what, and when?).
The website myExperiment is already popular for collaborating on, publishing and
sharing scientific workflows, however we have found that for understanding and
preserving a workflow over time, its definition is not enough, specially faced with
workflow decay, services and tools that change over time. We have therefore
adapted the research object model as a foundation for the myExperiment packs,
allowing uploading of workflow runs, inputs, outputs and other files relevant to the
workflow, relating them with annotations and integrated the Wf4Ever architecture
for performing decay analysis and tracking a research object’s evolution as it and its
constituent resources change over time.
MAKING RESEARCH OBJECTS
myExperiment is a website for collaboration and sharing of experiments, in
particular scientific workflows. We are enhancing myExperiment’s packs to be based
on the Research Object model, allowing users to form collections of workflows,
example input data, results, presentation slides, hypothesis, workflow runs and
documentation, effectively building a Research Object (RO). These uploaded
resources can then be further related, typed, described and given their own
attribution provenance record. ROs are versioned and shareable.
WF4EVER ARCHITECTURE
The architecture for Research Objects is realized as a Linked Data platform of
RESTful web services that support preservation aspects such as decay monitoring
and evolution tracking, presented to the user through a regular web interface on
myExperiment.
RESEARCH OBJECT MODEL
A research object (RO) is described in an RDF manifest which lists the aggregated
resources and their annotations as separate RDF graphs containing user
annotations (title, description, example value), typing information (hypothesis,
workflow, input data, etc) and automatically extracted metadata (provenance,
workflow structure).
The ontologies for the RO Model is based on standards for aggregations (OAI-ORE)
and annotations (Annotation Ontology, W3C Open Annotation Core OAC).
RESEARCH OBJECTS AS FILES
A RO Bundle is a JSON-LD-based serialization of a research object as a ZIP file
(Adobe UCF, ePub), allowing a hybrid of embedded resources and external
references (URIs). This allows a self-contained RO to be downloaded, transferred,
modified and inspected without requiring a dedicated web server, well suited for
desktop environments such as scientific workflow systems.
Taverna uses RO Bundle to make a workflow run bundle, a single file that contains
the input and output values, the workflow definition, complete with provenance
of the run and intermediate values. This allows sharing of a workflow run, e.g.
uploading to myExperiment, and later reloaded in a different Taverna installation.
This work was enabled by the Wf4Ever project funded by the European Commission’s 7th FWP
(FP7-ICT-2007-6 270192), and the myGrid platform grant by the EPSRC (EP/G026238/1)
Project sites http://www.myexperiment.org/ http://www.wf4ever-project.org/
Source code http://myexperiment.rubyforge.org/svn/ https://github.com/wf4ever/
License BSD 3-Clause License MIT license
http://www.researchobject.org/
Research
Object
Resource
Resource
Resource
Annotation
Annotation
Annotation
oa:hasTarget
oa:hasBody
ore:aggregates
Manifest
Annotation graph
http://purl.org/wf4ever/model
outputA.txt
outputC.jpg
outputB/
intermediates/
1.txt
2.txt
3.txt
de/def2e58b-50e2-4949-9980-fd310166621a.txt
inputA.txt
workflow
URI
references
attribution
execution
environment
Aggregating in Research Object
ZIP folder structure (RO Bundle)
mimetype
application/vnd.wf4ever.robundle+zip
workflowrun.prov.ttl
(provenance)
.ro/manifest.json
http://alpha.myexperiment.org/packs/387
http://sandbox.wf4ever-project.org/portal/
The Research Object is stored and manipulated in a Research Object Digital Library
using REST APIs, allowing any tools to view and modify the RO, like the RO portal:
https://w3id.org/bundle
REST API
RDF triple store
(RO structure,
Annotations)
RO index
Uploaded files
RO Portal
Checklist
service
...

Más contenido relacionado

Más de Stian Soiland-Reyes

2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
Stian Soiland-Reyes
 

Más de Stian Soiland-Reyes (15)

2014-10-31 Taverna 3 architecture
2014-10-31 Taverna 3 architecture2014-10-31 Taverna 3 architecture
2014-10-31 Taverna 3 architecture
 
2014-10-30 Taverna 3 status
2014-10-30 Taverna 3 status2014-10-30 Taverna 3 status
2014-10-30 Taverna 3 status
 
2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator project2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator project
 
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)
 
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
 
2013-05-29 Taverna Provenance
2013-05-29 Taverna Provenance2013-05-29 Taverna Provenance
2013-05-29 Taverna Provenance
 
2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?
 
2013-01-17 Research Object
2013-01-17 Research Object2013-01-17 Research Object
2013-01-17 Research Object
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
 
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...
2011 07-06 SCUFL2 Poster - because a workflow is more than its definition (BO...
 
2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow system2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow system
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXTaverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
 
Bringing caBIG services together using Taverna
Bringing caBIG services together using TavernaBringing caBIG services together using Taverna
Bringing caBIG services together using Taverna
 

Último

Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
UK Journal
 

Último (20)

IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 

2013 07-18 myExperiment research objects poster (PDF)

  • 1. myExperiment Research Objects: Beyond Workflows and Packs Stian Soiland-Reyes1, Don Cruickshank2, Finn Bacall1, Jun Zhao2, Khalid Belhajjame1, David De Roure3, Carole A. Goble1 1 School of Computer Science, University of Manchester, UK 2 Department of Zoology, University of Oxford, UK 3 Oxford e-Research Centre, University of Oxford, UK ABSTRACT We have evolved Research Objects as a mechanism to preserve digital resources related to research, by providing mechanisms, formats and architecture for describing aggregated resources (hypothesis, workflow, datasets, scripts, services), their relations (is input for, explains, used by), provenance (graph was derived from dataset A, B and C) and attribution (who contributed what, and when?). The website myExperiment is already popular for collaborating on, publishing and sharing scientific workflows, however we have found that for understanding and preserving a workflow over time, its definition is not enough, specially faced with workflow decay, services and tools that change over time. We have therefore adapted the research object model as a foundation for the myExperiment packs, allowing uploading of workflow runs, inputs, outputs and other files relevant to the workflow, relating them with annotations and integrated the Wf4Ever architecture for performing decay analysis and tracking a research object’s evolution as it and its constituent resources change over time. MAKING RESEARCH OBJECTS myExperiment is a website for collaboration and sharing of experiments, in particular scientific workflows. We are enhancing myExperiment’s packs to be based on the Research Object model, allowing users to form collections of workflows, example input data, results, presentation slides, hypothesis, workflow runs and documentation, effectively building a Research Object (RO). These uploaded resources can then be further related, typed, described and given their own attribution provenance record. ROs are versioned and shareable. WF4EVER ARCHITECTURE The architecture for Research Objects is realized as a Linked Data platform of RESTful web services that support preservation aspects such as decay monitoring and evolution tracking, presented to the user through a regular web interface on myExperiment. RESEARCH OBJECT MODEL A research object (RO) is described in an RDF manifest which lists the aggregated resources and their annotations as separate RDF graphs containing user annotations (title, description, example value), typing information (hypothesis, workflow, input data, etc) and automatically extracted metadata (provenance, workflow structure). The ontologies for the RO Model is based on standards for aggregations (OAI-ORE) and annotations (Annotation Ontology, W3C Open Annotation Core OAC). RESEARCH OBJECTS AS FILES A RO Bundle is a JSON-LD-based serialization of a research object as a ZIP file (Adobe UCF, ePub), allowing a hybrid of embedded resources and external references (URIs). This allows a self-contained RO to be downloaded, transferred, modified and inspected without requiring a dedicated web server, well suited for desktop environments such as scientific workflow systems. Taverna uses RO Bundle to make a workflow run bundle, a single file that contains the input and output values, the workflow definition, complete with provenance of the run and intermediate values. This allows sharing of a workflow run, e.g. uploading to myExperiment, and later reloaded in a different Taverna installation. This work was enabled by the Wf4Ever project funded by the European Commission’s 7th FWP (FP7-ICT-2007-6 270192), and the myGrid platform grant by the EPSRC (EP/G026238/1) Project sites http://www.myexperiment.org/ http://www.wf4ever-project.org/ Source code http://myexperiment.rubyforge.org/svn/ https://github.com/wf4ever/ License BSD 3-Clause License MIT license http://www.researchobject.org/ Research Object Resource Resource Resource Annotation Annotation Annotation oa:hasTarget oa:hasBody ore:aggregates Manifest Annotation graph http://purl.org/wf4ever/model outputA.txt outputC.jpg outputB/ intermediates/ 1.txt 2.txt 3.txt de/def2e58b-50e2-4949-9980-fd310166621a.txt inputA.txt workflow URI references attribution execution environment Aggregating in Research Object ZIP folder structure (RO Bundle) mimetype application/vnd.wf4ever.robundle+zip workflowrun.prov.ttl (provenance) .ro/manifest.json http://alpha.myexperiment.org/packs/387 http://sandbox.wf4ever-project.org/portal/ The Research Object is stored and manipulated in a Research Object Digital Library using REST APIs, allowing any tools to view and modify the RO, like the RO portal: https://w3id.org/bundle REST API RDF triple store (RO structure, Annotations) RO index Uploaded files RO Portal Checklist service ...