SlideShare una empresa de Scribd logo
1 de 41
Descargar para leer sin conexión
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        1
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        2
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        3
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        4
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        5
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        6
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        7
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        8
Who am I?	





 Sean Bechhofer	

 University of Manchester	

 sean.bechhofer@manchester.ac.uk	

 @seanbechhofer	

 http://humblyreport.wordpress.com	





                                        9
Research Objects: Towards
Exchange and Reuse of Digital
         Knowledge 	

                  Sean Bechhofer 	

              University of Manchester   	

        sean.bechhofer@manchester.ac.uk      	

                  @seanbechhofer     	

        http://humblyreport.wordpress.com      	





                                                     10
Publication	

•  Argumentation: Convince the reader of the 
   validity of a position [Mesirov]	

    –  Reproducible Results System: facilitates enactment
       and publication of reproducible research.	

 J. Mesirov Accessible Reproducible Research Science 327(5964), p.415-416, 2010
 http://dx.doi.org/10.1126/science.1179653	



•  Results are reinforced by reproducability [De Roure]	

    –  Explicit representation of method. 	

                          D. De Roure and C. Goble Anchors in Shifting Sand: the 	

                          Primacy of Method in the Web of Data Web Science Conference 2010, Raleigh
                          NC, 2010 http://eprints.ecs.soton.ac.uk/20817/	


•  Verifiability as a key factor in scientific discovery.	

 Stodden et. al. Reproducible Research: Addressing the Need for Data and
 Code Sharing in Computational Science Computing in Science and Engineering 12
 (5), p.8-13, 2010 http://dx.doi.org/10.1109/MCSE.2010.113
Publication	


   •  Nano-publications. Explicit representation at the statement
      level. 	

 Groth et. al. The Anatomy of a Nano-publication Information Services and Use
                       30(1), p.51-56, 2010 http://iospress.metapress.com/index/FTKH21Q50T521WM2.pdf	



   •  Executable Papers	

       –  Collage	

       –  SHARE	

       –  Verifiable Computational Results	

  Nowakowski et. al. The Collage Authoring Environment ICCS 2011, 2011 http://
  dx.doi.org/10.1016/j.procs.2011.04.064	


           Van Gorpet. al SHARE: a web portal for creating and sharing executable
           research papers ICCS 2011, 2011 http://dx.doi.org/10.1016/j.procs.2011.04.062	


                      Gavish et. al. A Universal Identifier for Computational Results	

                      ICCS 2011, 2011 http://dx.doi.org/10.1016/j.procs.2011.04.067	


                                                                                                          12
Knowledge Burying in paper publication	


             Experiment	

                                      Knowledge	




                        Publication	

              Text Mining	





                                         Paper	


  •  Publishing/mining cycle results in loss of knowledge	

      –  ≥ 40% of information lost	

  •  RIP – Rest in Paper	

  •  Need for mechanisms for publication of knowledge, preserving
     information about the process.	

                              B.Mons Which Gene Did You Mean? BMC Bioinformatics 6 p.142 2005
                              http://dx.doi.org/10.1186/1471-2105-6-142
The Problem	


   •  Moving to digital environments	

       –  Workflows, protocols, algorithms	

       –  Consuming and producing data	

       –  Electronic publication methods	

   •  From (linear) paper publications to…. 	


                            ???
                              	

   •  Need for frameworks for facilitating reuse and
      exchange of digital knowledge	

                                                       14
Workflows	

A Scientific Workflow can be seen as the              •  Central in experimental science	

combination of data and processes into a
                                                        •  Enable automation	

configurable, structured set of steps that implement
                                                        •  Make science repeatable (and sometimes
semi-automated computational solutions in scientific
                                                           reproducible)	

problem-solving	

                                                        •  Encourage best practices	

                                                    •  Scientist-friendly	

                                                        •  Aimed at (some types of) scientists, possibly
                                                           even without strong computational skills	

                                                     •  Communities: Need for scientific data
                                                        preservation	

                                                        •  Enhance scientific development by building on,
                                                           sharing, and extending previous results within
                                                           scientific communities	

                                                     •  However, workflow preservation is
                                                        especially complex	

                                                        •  Workflows not only specified statically at
                                                           design time but also interpreted through their
                                                           execution	

         BioAID_DiseaseDiscovery v3	

                                                        •  Complex models are required to describe
                                                           workflows and related resources, including
                                                           documents, data and services	

                                                        •  Resources often beyond control of scientists
myExperiment	


   A repository of research                   A probe into researcher
    methods	

                                  behaviour	

   A community social network of              Open source (BSD) Ruby on Rails
    people and things	

                        app 	

   A Social Virtual Research                  REST and SPARQL interfaces,
    Environment	

                              supports Linked Data	

                                               Part of product family including
   Web 2.0 “boutique” site	

                                                BioCatalogue, MethodBox and
                                                SysmoDB	

       5550	
  members,	
  300	
  groups,	
  2300	
  workflows,	
  220	
  packs
                                                                             	
  



                                                                                    16
Motivating Projects	


 •  myExperiment	

     –  Workflow sharing 	

 •  Sysmo-DB	

     –  Assets catalogue supporting exchange of data,
        models, SOPs	

 •  Obesity e-Lab/MethodBox	

     –  Sharing survey data/analysis scripts	

 •  myExperiment packs	

     –  Packs supporting (simple)
        aggregations.	

     –  Links not just references	

     –  Packs as nascent ROs	

                                                        17
Wf4Ever	

    …technological infrastructure for the preservation and
    efficient retrieval and reuse of scientific workflows in a range
    of disciplines.	

   •  Architecture/implementation for workflow preservation,
      sharing and reuse	

   •  Research Object models	

   •  Workflow Decay, Integrity and Authenticity	

   •  Workflow Evolution and Recommendation	

   •  Provenance	

   •  Driven by Use Cases	

        FP7 Digital Libraries and Digital Preservation	

        iSOCO, University of Manchester, Universidad Politécnica de
        Madrid, University of Oxford, Poznan Supercomputing and
        Networking Centre, Instituto de Astrofísica de Andalucía,
        Leiden University Medical Centre	

                           18
Research Objects	


       Semantically rich aggregations of resources,
            supporting a research objective 	



                        Linking	





                                                      19
Bio Scenario	





                  20
Bio Scenario	





                  21
Astronomers Questions	

 When accessing a workflow	

                                     When sharing a workflow	

 •  Can I use it for my purposes (in my                          •  What rights do others have?	

    words)?	

                                                   •  What a good workflow is to get a
 •  If I can expect it to run, when was                             good score?	

    it was last run, by whom?	

                                       –  Make my workflow findable, reusable,
                                                                          and ready for review	

 •  What it does quickly, by one of 	

                                                                       –  Instructions to authors	

     –    example input / output (and trying it) 	

                                                                       –  Two types of contributions: serious
     –    a description 	

                                                                          science, preliminary/playing around	

     –    ‘reading’ its key parts 	

                                                                 •  If my workflow may have issues	

     –    what it was used for 	

                                                                       –  What the system or other users think
     –    related workflows its creator 	

                                                                          it does	

     –    contacting the creator or last user	

                                                                 •  How it relates to other things	

 •  How I need to cite the author and
    workflow?	

                                                  •  Share freely or anonymously upon
                                                                    request?	




                                                                                                                   22	

                    http://www.flickr.com/photos/-bast-/349497988/
User Requirements	

                                                    Reader

                                                                           Re-User

        Trainee                                                        Contributor
                      Finder/Searcher
            Creator




                      Contributor
                                        Publisher

                      Comparator

                                                             Curator

                                    Evaluator/Reviewer
                                                                As a Creator of ROs, I want to aggregate existing
                                                                resources so that I can conveniently access related
                                                                resources from a single place.	

 •    Study of user scenarios	

 •    Isolation of User Requirements	

                         As a Reader of ROs, I want to compare an RO with
                                                                others so that I can determine whether the investigation
                                                                is novel	

 •    User review	

                                                                As a Comparator of ROs, I want to follow the steps
 •    Project Technical requirements	

                         taken so that I can understand the investigative process
                                                                or method	

 •    Classify Technical Requirements	

                                                                               23
User Roles	

 Creator. Collecting together resources as an RO for reuse or
 repurpose. May be for personal use.	

 Contributor. Providing materials to be used within an RO	

 Collaborator. Providing materials to be used without
 necessarily being aware of the RO	

 Reader. Looking for related works, state of the art.	

 Comparator. Looking for similar or previous work to a task in
 hand	

 Re-User. Understands the underlying methods encapsulated
 (e.g. workflow) and how to extract/replace components. 	

 Publisher. Disseminating results or methods. Upload to
 repository, publish via myExp, embed in blog post. 	

 Evaluator/Reviewer. Evaluating/validating or reviewing content.
 Confirmation of results or validation of process.	

               24
Workflow Reproducibility
                    Stability, Completeness, Integrity, Authenticity, Quality




Workflow Decay
•    Component level
•    flux/decay/unavailability
•    Data level
      •  formats/ids/standards
•    Infrastructure level
      •  platform/resources


Experiment Decay
•    Methodological changes
•    New technologies
•    New resources/components
•    New data



                                                                            25
Wf4Ever functionalities



Access  Usage Functionalities 	


                Edit                     Use            Annotate                …


         Data Management  Analysis Functionalities




              Stability      Completeness      Recommenda
                                                             Visualization    Collaboration       …
             Evaluation       Evaluation          -tions


Storage Functionalities                                     Lifecycle Functionalities




   Storage         Retrieval         Maintenance   …         Execution       Publication      Archival   …


                                                                                                         26
Wf4Ever Reference Implementation
                                                                    (By the end of 1st Year)
   Access  Usage Clients


                                                               Dropbox Client
                                          RO Manager
                 RO Portal
                                             Tool                     ROBox

           Data Management  Analysis Services




                    Stability             Completeness
                                                                Recommender
                   Evaluation              Evaluation


Storage Services                                           Lifecycle Services


                                                                     Taverna Workflow
                                                                     Mgmt System	

                             RO Digital Library


                                                                                          27
Linked Data	


   •  A set of best practices for publishing 
      and connecting data on the Web	

       1.  Use URIs to name things	

       2.  Use dereferencable HTTP URIs	

       3.  Provide useful content on lookup using standards	

       4.  Include links to other stuff	





                                                                 28
Linked Data
Linked Data is not Enough!	

                                                                    Note: The answer is 	

   •  A set of best practices for publishing                    not not Linked Data!*	

      and connecting data on the Web	

                       *Logician joke	


       1.  Use URIs to name things	

       2.  Use dereferencable HTTP URIs	

       3.  Provide useful content on lookup using standards	

       4.  Include links to other stuff	

   •  All very nice, lots of publishing going on, but no common
      models for lifecycle, aggregation, ownership, etc	

   •  A platform for sharing and publishing, but more is needed	


                  Bechhofer et al Linked Data is not Enough for Scientists Future Generation
                  Computer Systems, 2011 http://dx.doi.org/10.1016/j.future.2011.08.004	


                                                                                               30
ROs and Linked Data	


  •  Linked Data: Collection of best practices for publishing
     and connecting structured data on the web. 	

  •  ROs should be independent of mechanisms for
     representation and delivery	

  •  ROs as non-information 
     resources	

                                                           LD Cloud	

      –  “Named Graphs 
         for LD	



                         RO	



                                                                    31
WP2 - Workflow Lifecycle Management
                                                                   Research Object Model



»  Research Object Model	

    ›  Focus of work in M6-12	

»  Version 0.1 released to project in November 2011	

                                                                                      Container Structure
»  Use within developed RO services (RODL)	

»  A suite of linked ontologies	

    ›  Research Object Core - ro (aggregation and annotation)	

         •  Research Object	

                                                                    Emphasis on
    ›  Workflow Description - wfdesc (content)	

                   Workflow-centric
                                                                   Research Objects
         •  Abstract workflow 	

    ›  Workflow Provenance - wfprov (provenance)	

         •  Workflow provenance	




                                      Minimal place holder




                                                                                                        32
WP2 - Workflow Lifecycle Management
                                                                     Research Object Core (ro)


»  Aggregation (OAI-ORE)	

    ›  Use of OAI-ORE to support the description of collections of
       resources.	

    ›  Established vocabulary	

    ›  Usage in existing work (myExperiment)	

    ›  Fit with Linked Data publication	

»  Annotation (AO)	

    ›  Survey of existing annotation vocabularies, Annotation Ontology (Clark et al) and Open
       Annotation Collaboration (Van de Sompel et al). 	

    ›  Liaison and discussion with both groups	

         •  Little to choose in technical terms	

	

         •  A catalyst and focus for collaboration between AO and OAC	

    ›  Choice of AO 	

         •  Existing collaboration/relationship (UNIMAN and AO)	

»  Formation of W3C Open Annotation Community Group	

    ›  Participation from Wf4Ever staff	

    ›  Potential for impact/collaborations	

»  Defines the core data model used by the RO Digital Library service and the
   Command Line Tool developed in WP1. 	

                                                      33
WP2 - Workflow Lifecycle Management
                                                        Workflow Description (wfdesc)



»  Model providing initial descriptions of workflows 	

     ›  Process instances 	

     ›  Linked via input/output/parameters. 	

     ›  Support for the tasks of workflow abstraction, indexing, classifications, and general
        workflow analysis. 	

     ›  Generic technologies, adaptable to different domains using specific catalogues, e.g. SADI
        framework. 	

     ›  Reflects explicit focus on workflow-centric ROs	

»  Evolved from the OPMW ontology by Wf4Ever staff member Daniel Garijo and
   Yolanda Gil.	

»  Tooling generating wfdesc descriptions from aggregated Taverna workflows has
   been developed. 	

     ›  Descriptions already used by the Workflow Recommendation Service for inspecting
        workflow structures and service interconnections. WP3	





                                                                                                   34
WP2 - Workflow Lifecycle Management
                                                  Workflow Provenance (wfprov)



»  A provenance convergence layer	

    ›  Potential for links to OPM-V or PROV-O. 	

    ›  Mappings to OPM-V and PROV-O are under development 	

    ›  A placeholder for the v0.1 ontology suite	

»  Taverna plugin has been developed exporting Taverna provenance in PROV-O
   format in WP4 	

»  Prototype for a conversion agent that generates wfprov descriptions from PROV-
   O developed, wfprov data will primarily be used by Integrity and Authenticity in
   order to inspect workflow executions. WP4	

»  More extended modeling and descriptions of provenance information will be
   reported in WP4.	





                                                                                      35
ROs are Technical and Social	


   •  An artefact to support preservation of the method, data
      etc.	

   •  Technical details of platform, services etc.	





   •  A record of an investigation or experiment	

   •  A mechanism for communication, packaging, sharing,
      publishing, finding	

   •  An object that connects people together 	

                De Roure et al. Social Scientific Objects 1st International Workshop on Social
                Object Networks, Boston, 2011 http://users.ox.ac.uk/~oerc0033/preprints/
                myExpSocialObjects.pdf
Where Next/Challenges	

   •  Prototype development	

   •  Models for Research Objects	

       –  Vocabularies	

   •  Refinement of lifecycle states	

       –  Versioning and Evolution	

   •  Provenance	

       –  RO components	

       –  The RO itself	

   •  Trust	





                                         http://www.flickr.com/photos/marsdd/2986989396/	

   37
Music	





           38
Music	


    •  Music IR and Linked Data	

       –  Publication of collections	

              eTree	

              Million Song Dataset	

              Benefits?	

    •  Music IR and ROs	

        –  What are the Research 
           Objects of Music IR?	

        –  Intermediate results/feature 
           sets	

    •  Ontologies and vocabularies for describing results/feature
       sets	

                                                                    39
Thanks!	


   •  Manchester Information Management Group	

      –  http://img.cs.manchester.ac.uk 	

   •  myGrid Team	

      –  http://www.mygrid.org.uk/	

   •  Wf4Ever Team	

      –  http://www.wf4ever-project.org/ 	





                                                   40
Where Next?	





                 41

Más contenido relacionado

La actualidad más candente

Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...Beniamino Murgante
 
LODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesLODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesCarsten Keßler
 
Introduction to Research Data Management for postgraduate students
Introduction to Research Data Management for postgraduate studentsIntroduction to Research Data Management for postgraduate students
Introduction to Research Data Management for postgraduate studentsMarieke Guy
 
Bridging research and collections
Bridging research and collectionsBridging research and collections
Bridging research and collectionsvty
 
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkAn Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkHerbert Van de Sompel
 
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Deborah McGuinness
 
Augmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesAugmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesHerbert Van de Sompel
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceGarethKnight
 
Reaching the researcher
Reaching the researcherReaching the researcher
Reaching the researcherLIBER Europe
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersJez Cope
 
Creation of LSE Digital Library
Creation of LSE Digital LibraryCreation of LSE Digital Library
Creation of LSE Digital LibraryEd Fay
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghEDINA, University of Edinburgh
 
OAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumRobert Sanderson
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and LibariesRob Grim
 
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...Herbert Van de Sompel
 

La actualidad más candente (20)

Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...Claudia Bauzer Medeiros  Digital preservation – caring for our data to foster...
Claudia Bauzer Medeiros Digital preservation – caring for our data to foster...
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
LODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesLODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU series
 
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data ServicesNISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
 
Introduction to Research Data Management for postgraduate students
Introduction to Research Data Management for postgraduate studentsIntroduction to Research Data Management for postgraduate students
Introduction to Research Data Management for postgraduate students
 
Bridging research and collections
Bridging research and collectionsBridging research and collections
Bridging research and collections
 
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability FrameworkAn Overview of the OAI Object Reuse and Exchange Interoperability Framework
An Overview of the OAI Object Reuse and Exchange Interoperability Framework
 
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
 
Augmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositoriesAugmenting interoperability across scholarly repositories
Augmenting interoperability across scholarly repositories
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Reaching the researcher
Reaching the researcherReaching the researcher
Reaching the researcher
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchers
 
Creation of LSE Digital Library
Creation of LSE Digital LibraryCreation of LSE Digital Library
Creation of LSE Digital Library
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of Edinburgh
 
OAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall ForumOAC Presentation at CNI 09 Fall Forum
OAC Presentation at CNI 09 Fall Forum
 
Sünje Dallmeier-Tiessen: Research data "publishing": models, roles and respon...
Sünje Dallmeier-Tiessen: Research data "publishing": models, roles and respon...Sünje Dallmeier-Tiessen: Research data "publishing": models, roles and respon...
Sünje Dallmeier-Tiessen: Research data "publishing": models, roles and respon...
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
The OAI-ORE Interoperability Framework in the Context of the Current Scholarl...
 
圖書館趨勢觀察
圖書館趨勢觀察圖書館趨勢觀察
圖書館趨勢觀察
 
Escaping Datageddon
Escaping DatageddonEscaping Datageddon
Escaping Datageddon
 

Destacado

RO Advisory Kickoff Slides
RO Advisory Kickoff SlidesRO Advisory Kickoff Slides
RO Advisory Kickoff Slidesseanb
 
Linked Data Publication of Live Music Archives
Linked Data Publication of Live Music ArchivesLinked Data Publication of Live Music Archives
Linked Data Publication of Live Music Archivesseanb
 
Scientific Social Objects
Scientific Social ObjectsScientific Social Objects
Scientific Social Objectsseanb
 
Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014seanb
 
Animation 14: Computer Science and Music
Animation 14: Computer Science and MusicAnimation 14: Computer Science and Music
Animation 14: Computer Science and Musicseanb
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesDaniel S. Katz
 

Destacado (7)

RO Advisory Kickoff Slides
RO Advisory Kickoff SlidesRO Advisory Kickoff Slides
RO Advisory Kickoff Slides
 
Linked Data Publication of Live Music Archives
Linked Data Publication of Live Music ArchivesLinked Data Publication of Live Music Archives
Linked Data Publication of Live Music Archives
 
Scientific Social Objects
Scientific Social ObjectsScientific Social Objects
Scientific Social Objects
 
Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014Research Objects @ HARMONY 2014
Research Objects @ HARMONY 2014
 
Animation 14: Computer Science and Music
Animation 14: Computer Science and MusicAnimation 14: Computer Science and Music
Animation 14: Computer Science and Music
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community Responses
 

Similar a OeRC Seminar

2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objectsStian Soiland-Reyes
 
Introducing PRIME:Publisher, Repository and Institutional Metadata Exchange
Introducing PRIME:Publisher, Repository and Institutional Metadata ExchangeIntroducing PRIME:Publisher, Repository and Institutional Metadata Exchange
Introducing PRIME:Publisher, Repository and Institutional Metadata ExchangeBrian Hole
 
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...Guus van den Brekel
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceDavid De Roure
 
Publishing Open Data: Incentivising Rigour
Publishing Open Data: Incentivising RigourPublishing Open Data: Incentivising Rigour
Publishing Open Data: Incentivising RigourBrian Hole
 
Chem4Word Wade
Chem4Word WadeChem4Word Wade
Chem4Word WadeAlex Wade
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 
PRIME: Publisher, Repository & Institutional Metadata Exchange
PRIME: Publisher, Repository & Institutional Metadata ExchangePRIME: Publisher, Repository & Institutional Metadata Exchange
PRIME: Publisher, Repository & Institutional Metadata ExchangeBrian Hole
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?Graham Pryor
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsSimeon Warner
 
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...Brian Hole
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...Carole Goble
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessAbby Clobridge
 
OER for repository managers
OER for repository managersOER for repository managers
OER for repository managersNick Sheppard
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...Open Science Fair
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
Institutional Repositories.pptx
Institutional Repositories.pptxInstitutional Repositories.pptx
Institutional Repositories.pptxSheejamolMathew
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesDavid De Roure
 

Similar a OeRC Seminar (20)

2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
 
Introducing PRIME:Publisher, Repository and Institutional Metadata Exchange
Introducing PRIME:Publisher, Repository and Institutional Metadata ExchangeIntroducing PRIME:Publisher, Repository and Institutional Metadata Exchange
Introducing PRIME:Publisher, Repository and Institutional Metadata Exchange
 
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...
Do Libraries Meet Research 2.0 : collaborative tools and relevance for Resear...
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
Publishing Open Data: Incentivising Rigour
Publishing Open Data: Incentivising RigourPublishing Open Data: Incentivising Rigour
Publishing Open Data: Incentivising Rigour
 
Chem4Word Wade
Chem4Word WadeChem4Word Wade
Chem4Word Wade
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
PRIME: Publisher, Repository & Institutional Metadata Exchange
PRIME: Publisher, Repository & Institutional Metadata ExchangePRIME: Publisher, Repository & Institutional Metadata Exchange
PRIME: Publisher, Repository & Institutional Metadata Exchange
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and Collaborations
 
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...
The Journal of Open Archaeology Data and PRIME: Incentivising Open Data Archi...
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
 
OER for repository managers
OER for repository managersOER for repository managers
OER for repository managers
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
 
Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?Bibliotheek & Onderzoek 2.0?
Bibliotheek & Onderzoek 2.0?
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
Institutional Repositories.pptx
Institutional Repositories.pptxInstitutional Repositories.pptx
Institutional Repositories.pptx
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social Machines
 

Último

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Último (20)

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 

OeRC Seminar

  • 1. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 1
  • 2. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 2
  • 3. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 3
  • 4. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 4
  • 5. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 5
  • 6. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 6
  • 7. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 7
  • 8. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 8
  • 9. Who am I? Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 9
  • 10. Research Objects: Towards Exchange and Reuse of Digital Knowledge Sean Bechhofer University of Manchester sean.bechhofer@manchester.ac.uk @seanbechhofer http://humblyreport.wordpress.com 10
  • 11. Publication •  Argumentation: Convince the reader of the validity of a position [Mesirov] –  Reproducible Results System: facilitates enactment and publication of reproducible research. J. Mesirov Accessible Reproducible Research Science 327(5964), p.415-416, 2010 http://dx.doi.org/10.1126/science.1179653 •  Results are reinforced by reproducability [De Roure] –  Explicit representation of method. D. De Roure and C. Goble Anchors in Shifting Sand: the Primacy of Method in the Web of Data Web Science Conference 2010, Raleigh NC, 2010 http://eprints.ecs.soton.ac.uk/20817/ •  Verifiability as a key factor in scientific discovery. Stodden et. al. Reproducible Research: Addressing the Need for Data and Code Sharing in Computational Science Computing in Science and Engineering 12 (5), p.8-13, 2010 http://dx.doi.org/10.1109/MCSE.2010.113
  • 12. Publication •  Nano-publications. Explicit representation at the statement level. Groth et. al. The Anatomy of a Nano-publication Information Services and Use 30(1), p.51-56, 2010 http://iospress.metapress.com/index/FTKH21Q50T521WM2.pdf •  Executable Papers –  Collage –  SHARE –  Verifiable Computational Results Nowakowski et. al. The Collage Authoring Environment ICCS 2011, 2011 http:// dx.doi.org/10.1016/j.procs.2011.04.064 Van Gorpet. al SHARE: a web portal for creating and sharing executable research papers ICCS 2011, 2011 http://dx.doi.org/10.1016/j.procs.2011.04.062 Gavish et. al. A Universal Identifier for Computational Results ICCS 2011, 2011 http://dx.doi.org/10.1016/j.procs.2011.04.067 12
  • 13. Knowledge Burying in paper publication Experiment Knowledge Publication Text Mining Paper •  Publishing/mining cycle results in loss of knowledge –  ≥ 40% of information lost •  RIP – Rest in Paper •  Need for mechanisms for publication of knowledge, preserving information about the process. B.Mons Which Gene Did You Mean? BMC Bioinformatics 6 p.142 2005 http://dx.doi.org/10.1186/1471-2105-6-142
  • 14. The Problem •  Moving to digital environments –  Workflows, protocols, algorithms –  Consuming and producing data –  Electronic publication methods •  From (linear) paper publications to…. ??? •  Need for frameworks for facilitating reuse and exchange of digital knowledge 14
  • 15. Workflows A Scientific Workflow can be seen as the •  Central in experimental science combination of data and processes into a •  Enable automation configurable, structured set of steps that implement •  Make science repeatable (and sometimes semi-automated computational solutions in scientific reproducible) problem-solving •  Encourage best practices •  Scientist-friendly •  Aimed at (some types of) scientists, possibly even without strong computational skills •  Communities: Need for scientific data preservation •  Enhance scientific development by building on, sharing, and extending previous results within scientific communities •  However, workflow preservation is especially complex •  Workflows not only specified statically at design time but also interpreted through their execution BioAID_DiseaseDiscovery v3 •  Complex models are required to describe workflows and related resources, including documents, data and services •  Resources often beyond control of scientists
  • 16. myExperiment   A repository of research   A probe into researcher methods behaviour   A community social network of   Open source (BSD) Ruby on Rails people and things app   A Social Virtual Research   REST and SPARQL interfaces, Environment supports Linked Data   Part of product family including   Web 2.0 “boutique” site BioCatalogue, MethodBox and SysmoDB 5550  members,  300  groups,  2300  workflows,  220  packs   16
  • 17. Motivating Projects •  myExperiment –  Workflow sharing •  Sysmo-DB –  Assets catalogue supporting exchange of data, models, SOPs •  Obesity e-Lab/MethodBox –  Sharing survey data/analysis scripts •  myExperiment packs –  Packs supporting (simple) aggregations. –  Links not just references –  Packs as nascent ROs 17
  • 18. Wf4Ever …technological infrastructure for the preservation and efficient retrieval and reuse of scientific workflows in a range of disciplines. •  Architecture/implementation for workflow preservation, sharing and reuse •  Research Object models •  Workflow Decay, Integrity and Authenticity •  Workflow Evolution and Recommendation •  Provenance •  Driven by Use Cases FP7 Digital Libraries and Digital Preservation iSOCO, University of Manchester, Universidad Politécnica de Madrid, University of Oxford, Poznan Supercomputing and Networking Centre, Instituto de Astrofísica de Andalucía, Leiden University Medical Centre 18
  • 19. Research Objects Semantically rich aggregations of resources, supporting a research objective Linking 19
  • 22. Astronomers Questions When accessing a workflow When sharing a workflow •  Can I use it for my purposes (in my •  What rights do others have? words)? •  What a good workflow is to get a •  If I can expect it to run, when was good score? it was last run, by whom? –  Make my workflow findable, reusable, and ready for review •  What it does quickly, by one of –  Instructions to authors –  example input / output (and trying it) –  Two types of contributions: serious –  a description science, preliminary/playing around –  ‘reading’ its key parts •  If my workflow may have issues –  what it was used for –  What the system or other users think –  related workflows its creator it does –  contacting the creator or last user •  How it relates to other things •  How I need to cite the author and workflow? •  Share freely or anonymously upon request? 22 http://www.flickr.com/photos/-bast-/349497988/
  • 23. User Requirements Reader Re-User Trainee Contributor Finder/Searcher Creator Contributor Publisher Comparator Curator Evaluator/Reviewer As a Creator of ROs, I want to aggregate existing resources so that I can conveniently access related resources from a single place. •  Study of user scenarios •  Isolation of User Requirements As a Reader of ROs, I want to compare an RO with others so that I can determine whether the investigation is novel •  User review As a Comparator of ROs, I want to follow the steps •  Project Technical requirements taken so that I can understand the investigative process or method •  Classify Technical Requirements 23
  • 24. User Roles Creator. Collecting together resources as an RO for reuse or repurpose. May be for personal use. Contributor. Providing materials to be used within an RO Collaborator. Providing materials to be used without necessarily being aware of the RO Reader. Looking for related works, state of the art. Comparator. Looking for similar or previous work to a task in hand Re-User. Understands the underlying methods encapsulated (e.g. workflow) and how to extract/replace components. Publisher. Disseminating results or methods. Upload to repository, publish via myExp, embed in blog post. Evaluator/Reviewer. Evaluating/validating or reviewing content. Confirmation of results or validation of process. 24
  • 25. Workflow Reproducibility Stability, Completeness, Integrity, Authenticity, Quality Workflow Decay •  Component level •  flux/decay/unavailability •  Data level •  formats/ids/standards •  Infrastructure level •  platform/resources Experiment Decay •  Methodological changes •  New technologies •  New resources/components •  New data 25
  • 26. Wf4Ever functionalities Access Usage Functionalities Edit Use Annotate … Data Management Analysis Functionalities Stability Completeness Recommenda Visualization Collaboration … Evaluation Evaluation -tions Storage Functionalities Lifecycle Functionalities Storage Retrieval Maintenance … Execution Publication Archival … 26
  • 27. Wf4Ever Reference Implementation (By the end of 1st Year) Access Usage Clients Dropbox Client RO Manager RO Portal Tool ROBox Data Management Analysis Services Stability Completeness Recommender Evaluation Evaluation Storage Services Lifecycle Services Taverna Workflow Mgmt System RO Digital Library 27
  • 28. Linked Data •  A set of best practices for publishing and connecting data on the Web 1.  Use URIs to name things 2.  Use dereferencable HTTP URIs 3.  Provide useful content on lookup using standards 4.  Include links to other stuff 28
  • 30. Linked Data is not Enough! Note: The answer is •  A set of best practices for publishing not not Linked Data!* and connecting data on the Web *Logician joke 1.  Use URIs to name things 2.  Use dereferencable HTTP URIs 3.  Provide useful content on lookup using standards 4.  Include links to other stuff •  All very nice, lots of publishing going on, but no common models for lifecycle, aggregation, ownership, etc •  A platform for sharing and publishing, but more is needed Bechhofer et al Linked Data is not Enough for Scientists Future Generation Computer Systems, 2011 http://dx.doi.org/10.1016/j.future.2011.08.004 30
  • 31. ROs and Linked Data •  Linked Data: Collection of best practices for publishing and connecting structured data on the web. •  ROs should be independent of mechanisms for representation and delivery •  ROs as non-information resources LD Cloud –  “Named Graphs for LD RO 31
  • 32. WP2 - Workflow Lifecycle Management Research Object Model »  Research Object Model ›  Focus of work in M6-12 »  Version 0.1 released to project in November 2011 Container Structure »  Use within developed RO services (RODL) »  A suite of linked ontologies ›  Research Object Core - ro (aggregation and annotation) •  Research Object Emphasis on ›  Workflow Description - wfdesc (content) Workflow-centric Research Objects •  Abstract workflow ›  Workflow Provenance - wfprov (provenance) •  Workflow provenance Minimal place holder 32
  • 33. WP2 - Workflow Lifecycle Management Research Object Core (ro) »  Aggregation (OAI-ORE) ›  Use of OAI-ORE to support the description of collections of resources. ›  Established vocabulary ›  Usage in existing work (myExperiment) ›  Fit with Linked Data publication »  Annotation (AO) ›  Survey of existing annotation vocabularies, Annotation Ontology (Clark et al) and Open Annotation Collaboration (Van de Sompel et al). ›  Liaison and discussion with both groups •  Little to choose in technical terms •  A catalyst and focus for collaboration between AO and OAC ›  Choice of AO •  Existing collaboration/relationship (UNIMAN and AO) »  Formation of W3C Open Annotation Community Group ›  Participation from Wf4Ever staff ›  Potential for impact/collaborations »  Defines the core data model used by the RO Digital Library service and the Command Line Tool developed in WP1. 33
  • 34. WP2 - Workflow Lifecycle Management Workflow Description (wfdesc) »  Model providing initial descriptions of workflows ›  Process instances ›  Linked via input/output/parameters. ›  Support for the tasks of workflow abstraction, indexing, classifications, and general workflow analysis. ›  Generic technologies, adaptable to different domains using specific catalogues, e.g. SADI framework. ›  Reflects explicit focus on workflow-centric ROs »  Evolved from the OPMW ontology by Wf4Ever staff member Daniel Garijo and Yolanda Gil. »  Tooling generating wfdesc descriptions from aggregated Taverna workflows has been developed. ›  Descriptions already used by the Workflow Recommendation Service for inspecting workflow structures and service interconnections. WP3 34
  • 35. WP2 - Workflow Lifecycle Management Workflow Provenance (wfprov) »  A provenance convergence layer ›  Potential for links to OPM-V or PROV-O. ›  Mappings to OPM-V and PROV-O are under development ›  A placeholder for the v0.1 ontology suite »  Taverna plugin has been developed exporting Taverna provenance in PROV-O format in WP4 »  Prototype for a conversion agent that generates wfprov descriptions from PROV- O developed, wfprov data will primarily be used by Integrity and Authenticity in order to inspect workflow executions. WP4 »  More extended modeling and descriptions of provenance information will be reported in WP4. 35
  • 36. ROs are Technical and Social •  An artefact to support preservation of the method, data etc. •  Technical details of platform, services etc. •  A record of an investigation or experiment •  A mechanism for communication, packaging, sharing, publishing, finding •  An object that connects people together De Roure et al. Social Scientific Objects 1st International Workshop on Social Object Networks, Boston, 2011 http://users.ox.ac.uk/~oerc0033/preprints/ myExpSocialObjects.pdf
  • 37. Where Next/Challenges •  Prototype development •  Models for Research Objects –  Vocabularies •  Refinement of lifecycle states –  Versioning and Evolution •  Provenance –  RO components –  The RO itself •  Trust http://www.flickr.com/photos/marsdd/2986989396/ 37
  • 38. Music 38
  • 39. Music •  Music IR and Linked Data –  Publication of collections   eTree   Million Song Dataset   Benefits? •  Music IR and ROs –  What are the Research Objects of Music IR? –  Intermediate results/feature sets •  Ontologies and vocabularies for describing results/feature sets 39
  • 40. Thanks! •  Manchester Information Management Group –  http://img.cs.manchester.ac.uk •  myGrid Team –  http://www.mygrid.org.uk/ •  Wf4Ever Team –  http://www.wf4ever-project.org/ 40