SlideShare una empresa de Scribd logo
1 de 21
SEEK for Science 
Alessandro Borsoi 
11.09.2014, EDUCAFE, METID – Politecnico di Milano
what it is (1) 
SEEK is a storage platform designed to 
facilitate heterogeneous data and model 
storage and sharing, across multi-group 
scientific projects.
what it is (2) 
SEEK is an open-source, web-based platform 
and suite of software tools for the 
management, linking, exploration and 
exchange of Systems Biology data, models 
and Standard Operating Procedures (SOPs). 
SEEK is designed to facilitate data sharing 
and collaborations between scientists.
who 
Developed by: 
- a team at the University of Manchester in the 
United Kingdom 
- the Heidelberg Institute for Theoretical Studies in 
Germany 
Founded by: 
- the BBSRC in the UK 
- the BMBF in Germany 
as part of the SysMO-DB project
story 
SEEK was conceived as part of SysMO, a pan-European 
initiative to record and describe dynamic molecular 
processes in unicellular organisms: from laboratory to 
mathematical model. 
SEEK grew organically with the projects needs, informed by 
a core user-focus group known as the SysMO PALs. 
SEEK is now the central hub for the SysMO community to 
store and share a wide variety of data, from collection to 
publication, for both laboratory and computational 
experiments.
data (1) 
SEEK ‘data’ type: 
- data generated by high-throughput experiments. 
- data arising from low throughput, cumulative experiments in the form of: 
raw data, i.e. single pieces of data belonging to a larger data series, non-replicated 
data, non quantified data. 
experimental results, i.e. reliable, quantified and repeated data series, including high-throughput 
data. 
calculated data, i.e. involving further analysis of raw data. 
image data. 
- data arising from biological modelling. 
- models generated by systems biology approach. 
- parameterisations of models. 
- validation data for models. 
- metadata, i.e. data providing information about one or more pieces of data. 
- processes used to design the experiments, generate the data, and generate the models, 
i.e. standard operating procedures (SOPs), spreadsheets, workflows.
data (2) 
Data Catalogue 
The data catalogue in SEEK includes raw Datasets, Standard Operating Procedures 
(SOPs), Models, Publications and Presentations. All data are grouped by projects, and 
associated with the researchers who produced them. In order to encourage sharing of data 
we allow researchers flexibility in the formats they upload and share their data in. This 
means data formats in the SEEK catalogue can vary. We do offer a set of “best practice” 
guidelines for researchers who want to make their data available and usable to the widest 
possible audience. 
Most common formats allow viewing within the browser, without a download, with additional 
enhanced features for spreadsheets and SBML models. 
As a dynamic service, SEEK aims to expand functionality provided for data types and 
formats as the needs arise. Where SEEK does not appear to support a data type or format, 
a request can be placed to extend SEEK for this data. 
All data and information added to SEEK is searchable using key-words.
data (3) 
Organise and store your 
data 
SEEK has adopted 
an ISATAB style structure for 
organising experiments and data.
data (4) 
ISA and Interlinking 
Data in SEEK gain increased value and usability when they are described within the 
context of an experimental process. Multiple experiments will be carried out as part of a 
single Study, and that study may be part of a wider overall funded Investigation. In SEEK 
we adopt the ISATAB structure (Investigation, Studies, Assays) which is a community 
standard for describing links between Omics experiments. We believe that many aspects of 
the ISA framework are equally appropriate for describing experiments beyond Omics and 
Biology, so allow this framework to be applied to all data. 
Beyond the ISA framework, SEEK allows data to be interlinked within the site itself in order 
to describe their relationship. 
If research resulted in a publication, this can also be registered with SEEK (including 
accreditation to relevant people) using a PUBMED identifier or DOI, and linked to the 
assets involved in that research – allowing other researchers access to use, examine, or 
validate the data that would otherwise be unavailable through the publication alone.
data (5) 
Explore and annotate data 
Excel spreadsheets can be explored and annotated without the need to 
download.
data (6) 
Semantic spreadsheet templates 
Using RightField we have produced a wide collection of template files.
data (7) 
Versioning 
All data is stored using versioning, selectable privacy, and static URLs. Versioning and 
privacy settings ensure that you can share your most recent data, with who you choose. 
Static URLs ensure that you can be credited directly for all shared work.
data (8) 
There is a lot of 
flexibility and control 
over who can see, 
download or edit your 
items. 
Flexible sharing controls
data (9) 
Access Control 
Data will go through a research lifecycle between collection and publication. In a 
competitive academic environment it is important that the data can be shared with 
collaborators, and then the wider community at appropriate points within the life-cycle. 
SEEK allows users to keep their uploaded data entirely private, to share between 
individuals, then across entire projects, until eventually making it public upon publication.
SBML models (1) 
Simulate SBML models 
Most models that conforms to the SBML format can be simulated within 
SEEK.
SBML models (2) 
Model simulation and annotation 
if models follow the SBML standard, they can be simulated, or annotated and re-added as a 
new version, all within SEEK. 
The JWS Online model simulator presents a schematic diagram of the model, and allows 
parameters and reactions to be modified for the simulation. 
Models can also be edited using JWS Online OneStop, and semantically annotated with 
Miriam annotations, and then saved back to SEEK as a new version.
people (1) 
Who's doing what, where? 
You can find out what 
people using, and 
have expertise in, and 
how to get in contact 
with them.
people (2) 
People index 
SEEK contains an index of people where users can browse, or keyword-search, profiles of 
the projects, groups and people that contribute to the data on the site. People can describe 
their areas of expertise, which allows other users to quickly identify the right people to 
approach regarding specialist enquiries and collaboration proposals.
people (3) 
PALS 
SEEK has a varied network of scientists, known as SysMO 
PALs, who represent a wide but typical user base. Through 
regular meetings with these PALs we have, and continue, to 
develop a platform that is tailored in functionality and 
usability to you, the scientist.
platform 
PLATFORM
end 
END

Más contenido relacionado

La actualidad más candente

Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
Sean Ekins
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Carole Goble
 
IC-SDV 2019: OntoChem
IC-SDV 2019: OntoChemIC-SDV 2019: OntoChem
IC-SDV 2019: OntoChem
Dr. Haxel Consult
 

La actualidad más candente (20)

Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
 
When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?
 
ICIC 2014 Increasing the efficiency of pharmaceutical research through data i...
ICIC 2014 Increasing the efficiency of pharmaceutical research through data i...ICIC 2014 Increasing the efficiency of pharmaceutical research through data i...
ICIC 2014 Increasing the efficiency of pharmaceutical research through data i...
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
COMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightCOMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management right
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openly
 
From allotrope to reference master data management
From allotrope to reference master data management From allotrope to reference master data management
From allotrope to reference master data management
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
Model management tools for improved reproducibility in systems biology
Model management tools for improved reproducibility in systems biologyModel management tools for improved reproducibility in systems biology
Model management tools for improved reproducibility in systems biology
 
Challenges & Opportunities of Implementation FAIR in Life Sciences
Challenges & Opportunities of Implementation FAIR in Life SciencesChallenges & Opportunities of Implementation FAIR in Life Sciences
Challenges & Opportunities of Implementation FAIR in Life Sciences
 
Mduke sagecite-jisc-march11
Mduke sagecite-jisc-march11Mduke sagecite-jisc-march11
Mduke sagecite-jisc-march11
 
Data and model management in Systems Biology
Data and model management in Systems BiologyData and model management in Systems Biology
Data and model management in Systems Biology
 
IC-SDV 2019: OntoChem
IC-SDV 2019: OntoChemIC-SDV 2019: OntoChem
IC-SDV 2019: OntoChem
 
SageCite demonstrator overview
SageCite demonstrator overviewSageCite demonstrator overview
SageCite demonstrator overview
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier Linking
 

Destacado (6)

Private_Presentation1
Private_Presentation1Private_Presentation1
Private_Presentation1
 
Axio puma final full 0513
Axio puma final full 0513Axio puma final full 0513
Axio puma final full 0513
 
Expertise2014 pandoc
Expertise2014 pandocExpertise2014 pandoc
Expertise2014 pandoc
 
Pendidikan seni visual organistik
Pendidikan seni visual organistikPendidikan seni visual organistik
Pendidikan seni visual organistik
 
My Teaching Metaphor - I am a Tour Guide
My Teaching Metaphor - I am a Tour GuideMy Teaching Metaphor - I am a Tour Guide
My Teaching Metaphor - I am a Tour Guide
 
Ciudadaniaprogramadcn
CiudadaniaprogramadcnCiudadaniaprogramadcn
Ciudadaniaprogramadcn
 

Similar a Metid Match 2014 - SEEK for Science

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
FAIR sequencing data repository based on iRODS
FAIR sequencing data repository based on iRODSFAIR sequencing data repository based on iRODS
FAIR sequencing data repository based on iRODS
Felipe Gutierrez
 
Mendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperMendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 Paper
William Gunn
 
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
IJEACS
 

Similar a Metid Match 2014 - SEEK for Science (20)

SEEKing our way to better presentation of data and models from scientific inv...
SEEKing our way to better presentation of data and models from scientific inv...SEEKing our way to better presentation of data and models from scientific inv...
SEEKing our way to better presentation of data and models from scientific inv...
 
Next-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalNext-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information Retrieval
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Standards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchStandards and tools for model management in biomedical research
Standards and tools for model management in biomedical research
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
FAIR sequencing data repository based on iRODS
FAIR sequencing data repository based on iRODSFAIR sequencing data repository based on iRODS
FAIR sequencing data repository based on iRODS
 
Replicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearchReplicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearch
 
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLES
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLESCOLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLES
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLES
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
 
Mendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperMendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 Paper
 
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflows
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: C...
A META DATA VAULT APPROACH FOR  EVOLUTIONARY INTEGRATION OF BIG DATA SETS:  C...A META DATA VAULT APPROACH FOR  EVOLUTIONARY INTEGRATION OF BIG DATA SETS:  C...
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: C...
 

Último

The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 

Último (20)

Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 

Metid Match 2014 - SEEK for Science

  • 1. SEEK for Science Alessandro Borsoi 11.09.2014, EDUCAFE, METID – Politecnico di Milano
  • 2. what it is (1) SEEK is a storage platform designed to facilitate heterogeneous data and model storage and sharing, across multi-group scientific projects.
  • 3. what it is (2) SEEK is an open-source, web-based platform and suite of software tools for the management, linking, exploration and exchange of Systems Biology data, models and Standard Operating Procedures (SOPs). SEEK is designed to facilitate data sharing and collaborations between scientists.
  • 4. who Developed by: - a team at the University of Manchester in the United Kingdom - the Heidelberg Institute for Theoretical Studies in Germany Founded by: - the BBSRC in the UK - the BMBF in Germany as part of the SysMO-DB project
  • 5. story SEEK was conceived as part of SysMO, a pan-European initiative to record and describe dynamic molecular processes in unicellular organisms: from laboratory to mathematical model. SEEK grew organically with the projects needs, informed by a core user-focus group known as the SysMO PALs. SEEK is now the central hub for the SysMO community to store and share a wide variety of data, from collection to publication, for both laboratory and computational experiments.
  • 6. data (1) SEEK ‘data’ type: - data generated by high-throughput experiments. - data arising from low throughput, cumulative experiments in the form of: raw data, i.e. single pieces of data belonging to a larger data series, non-replicated data, non quantified data. experimental results, i.e. reliable, quantified and repeated data series, including high-throughput data. calculated data, i.e. involving further analysis of raw data. image data. - data arising from biological modelling. - models generated by systems biology approach. - parameterisations of models. - validation data for models. - metadata, i.e. data providing information about one or more pieces of data. - processes used to design the experiments, generate the data, and generate the models, i.e. standard operating procedures (SOPs), spreadsheets, workflows.
  • 7. data (2) Data Catalogue The data catalogue in SEEK includes raw Datasets, Standard Operating Procedures (SOPs), Models, Publications and Presentations. All data are grouped by projects, and associated with the researchers who produced them. In order to encourage sharing of data we allow researchers flexibility in the formats they upload and share their data in. This means data formats in the SEEK catalogue can vary. We do offer a set of “best practice” guidelines for researchers who want to make their data available and usable to the widest possible audience. Most common formats allow viewing within the browser, without a download, with additional enhanced features for spreadsheets and SBML models. As a dynamic service, SEEK aims to expand functionality provided for data types and formats as the needs arise. Where SEEK does not appear to support a data type or format, a request can be placed to extend SEEK for this data. All data and information added to SEEK is searchable using key-words.
  • 8. data (3) Organise and store your data SEEK has adopted an ISATAB style structure for organising experiments and data.
  • 9. data (4) ISA and Interlinking Data in SEEK gain increased value and usability when they are described within the context of an experimental process. Multiple experiments will be carried out as part of a single Study, and that study may be part of a wider overall funded Investigation. In SEEK we adopt the ISATAB structure (Investigation, Studies, Assays) which is a community standard for describing links between Omics experiments. We believe that many aspects of the ISA framework are equally appropriate for describing experiments beyond Omics and Biology, so allow this framework to be applied to all data. Beyond the ISA framework, SEEK allows data to be interlinked within the site itself in order to describe their relationship. If research resulted in a publication, this can also be registered with SEEK (including accreditation to relevant people) using a PUBMED identifier or DOI, and linked to the assets involved in that research – allowing other researchers access to use, examine, or validate the data that would otherwise be unavailable through the publication alone.
  • 10. data (5) Explore and annotate data Excel spreadsheets can be explored and annotated without the need to download.
  • 11. data (6) Semantic spreadsheet templates Using RightField we have produced a wide collection of template files.
  • 12. data (7) Versioning All data is stored using versioning, selectable privacy, and static URLs. Versioning and privacy settings ensure that you can share your most recent data, with who you choose. Static URLs ensure that you can be credited directly for all shared work.
  • 13. data (8) There is a lot of flexibility and control over who can see, download or edit your items. Flexible sharing controls
  • 14. data (9) Access Control Data will go through a research lifecycle between collection and publication. In a competitive academic environment it is important that the data can be shared with collaborators, and then the wider community at appropriate points within the life-cycle. SEEK allows users to keep their uploaded data entirely private, to share between individuals, then across entire projects, until eventually making it public upon publication.
  • 15. SBML models (1) Simulate SBML models Most models that conforms to the SBML format can be simulated within SEEK.
  • 16. SBML models (2) Model simulation and annotation if models follow the SBML standard, they can be simulated, or annotated and re-added as a new version, all within SEEK. The JWS Online model simulator presents a schematic diagram of the model, and allows parameters and reactions to be modified for the simulation. Models can also be edited using JWS Online OneStop, and semantically annotated with Miriam annotations, and then saved back to SEEK as a new version.
  • 17. people (1) Who's doing what, where? You can find out what people using, and have expertise in, and how to get in contact with them.
  • 18. people (2) People index SEEK contains an index of people where users can browse, or keyword-search, profiles of the projects, groups and people that contribute to the data on the site. People can describe their areas of expertise, which allows other users to quickly identify the right people to approach regarding specialist enquiries and collaboration proposals.
  • 19. people (3) PALS SEEK has a varied network of scientists, known as SysMO PALs, who represent a wide but typical user base. Through regular meetings with these PALs we have, and continue, to develop a platform that is tailored in functionality and usability to you, the scientist.