SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
Model management tools for improved
reproducibility in systems biology
Dagmar Waltemath,
on behalf of the SEMS team
University of Rostock, Germany
10th
International CellML Workshop
Auckland, June 2016
2
On models and simulations
Model Simulation
Figs: BioModels (top) and DOI: 10.1073/pnas.88.16.7328 (bottom)
3
Most scientific discoveries rely on previous findings.
Model
Fig.: Tyson 2001 (BIOM195)
Fig.: Tyson 1991 (BIOM005)
Successor
Fig.: History of Cell Cycle models in BioModels
4
Can we rely on findings that we ourselves
cannot evaluate? (Probably not!)
“only in ~20–25% of the projects were the relevant published data completely
in line with our in-house findings (Fig. 1c). In almost two-thirds of the projects,
there were inconsistencies [..] that either considerably prolonged the duration
of the target validation process or, in most cases, resulted in termination of the
projects because the evidence [..] was insufficient to justify further investments
into these projects.” Prinz et al (2011)
5
We identified key challenges of reproducibility in
systems biology and systems medicine.
Lack of data standards – Lack of data quality and quantity – Lack of data availability – Lack of transparency
6
A lack of data availability makes it impossible
for researchers to reproduce results.
● Model code in BioModels, including
supplemental with a how-to reproduce
the figures given in the original paper
● Online tool makes data available
and browseable
TriplexRNA
Recon 2Recon 2
● Publication backed up with a website
containing the supplemental material
● Model code in (non-curated) BioModels
● Visualisation of the model can easily
be explored
● References to original works
How can we support scientists
who wish to share model-based results?
Issues
– Simulation studies comprise
of several files
– Data is heterogeneous,
distributed, complex
– Data changes over time
– Documentation of the how
the study was performed
often missing
7
A lack of data availability makes it impossible
for researchers to reproduce results.
How can we support scientists
who wish to share model-based results?
Issues
– Simulation studies comprise
of several files
– Data is heterogeneous,
distributed, complex
– Data changes over time
– Documentation of the how
the study was performed
often missing
Our solutions
– Tool support for the
COMBINE Archive –
lowering the effort to share
reproducible models
– Graph-based storage of
model-related files –
integrated & searchable
virtual experiments
– Model version control –
towards a provenance of
models
8
The COMBINE archive bundles all files necessary
to reproduce a simulation study.
COMBINE archive toolkit
● manage COMBINE archives
– Explore
– Edit
– Share
– Publish
● Used in: PMR 2, JWS Online,
SED-ML Web Tools, OpenCor …
WebCAT, Scharm et al 2014
9
STON, SED-ML DB & MASYMOS
Integrated storage & retrieval
system (MASYMOS)
doi: 10.1093/database/bau130
doi: 10.1186/s13326-015-0014-4
Search across heterogeneous data,
ontologies, and structures→poster
Tailor-made storage systems
(STON, SED-ML DB)
Using graph databases to integrate
standardised model-based data
https://dx.doi.org/10.6084/m9.figshare.3382993.v1
SED-ML DB in JWS Online
BioModelsPhysiome Model repository
10
BiVeS & COMODI
Model version control
(BiVeS, COMODI)
Provenance-to-be (COMODI)
Tracking the evolution of a
CellML/SBML model over time
doi: 10.1093/bioinformatics/btv484
Tracking the evolution of simulation
studies and biological systems.
https://dx.doi.org/10.6084/m9.figshare.2543059.v5
Physiome Model repository
doi: 10.1093/bioinformatics/btv484
11
What's next? Models for the clinic, or: Bridging the gap
between standards for systems biology & systems medicine
Fig. courtesy Atalag et al (2015) http://hdl.handle.net/2292/27911
Thank you for your attention.
m n @SemsProject
Martin Scharm
BiVeS, COMODI,
COMBINE Archive
Video master
Fabienne Lambusch
Pattern & structure search
in SBML models
Mariam Nassar
Rank aggregation
Tom Gebhardt
SBGN-compliant diffs
Martin Peters
M2CAT, COMBINE Archive,
SED-ML database
Vasundra Toure
STON, SBGN-ED,
SBGN symbol of the month
Ron Henkel
MASYMOS, MORRE
www.sems.uni-rostock.de
References
Atalag et al (2015) http://hdl.handle.net/2292/27911
Bergmann et al. (2014) F.T. Bergmann, R. Adams, S. Moodie, J. Cooper, M. Glont et al.: COMBINE archive
and OMEX format: one file to share all information to reproduce a modeling project. BMC Bioinformatics
(2014)
Prinz et al. (2011) Prinz, Florian, Thomas Schlange, and Khusru Asadullah. "Believe it or not: how much
can we rely on published data on potential drug targets?." Nature reviews Drug discovery 10.9 (2011): 712-
712.
Schmitz et al. (2014) Schmitz, Ulf, et al. "Cooperative gene regulation by microRNA pairs and their
identification using a computational workflow." Nucleic acids research (2014): gku465.
Thiele et al. (2013) Thiele, Ines, et al. "A community-driven global reconstruction of human metabolism."
Nature biotechnology 31.5 (2013): 419-425.
Waltemath & Scharm (2014) D. Waltemath and M. Scharm: Extracting reproducible simulation studies
from model repositories using the CombineArchive Toolkit. Workshop on Data Management for the Life
Sciences (2014), Hamburg, BTW 2014.
Waltemath & Wolkenhauer (2016) D. Waltemath and O. Wolkenhauer: How modeling standards, software,
and initiatives support reproducibility in systems biology and systems medicine. IEEE Transactions on
Biomedical Engineering (2016) in the press.

Más contenido relacionado

La actualidad más candente

FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Carole Goble
 
bio data
bio databio data
bio data
007dcp
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
Alejandra Gonzalez-Beltran
 

La actualidad más candente (20)

The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusability
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBI
 
A global integrative ecosystem for digital pathology: how can we get there?
A global integrative ecosystem for digital pathology: how can we get there?A global integrative ecosystem for digital pathology: how can we get there?
A global integrative ecosystem for digital pathology: how can we get there?
 
Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
FuGE Update
FuGE UpdateFuGE Update
FuGE Update
 
is there life between standards? Data interoperability for AI.
is there life between standards? Data interoperability for AI.is there life between standards? Data interoperability for AI.
is there life between standards? Data interoperability for AI.
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.
 
Making your data good enough for sharing.
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.
 
bio data
bio databio data
bio data
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
Report of the second FAIRDOM foundry
Report of the second FAIRDOM foundryReport of the second FAIRDOM foundry
Report of the second FAIRDOM foundry
 
Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...BioSharing.org - mapping the landscape of community standards, databases, dat...
BioSharing.org - mapping the landscape of community standards, databases, dat...
 

Destacado

Central dogma
Central dogmaCentral dogma
Central dogma
AIT
 

Destacado (7)

Science in the Open - Science Commons Pacific Northwest
Science in the Open - Science Commons Pacific NorthwestScience in the Open - Science Commons Pacific Northwest
Science in the Open - Science Commons Pacific Northwest
 
What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts';
 
Open science
Open scienceOpen science
Open science
 
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)Unit 1, Lesson 1.8 - The Scientific Method (Part Two)
Unit 1, Lesson 1.8 - The Scientific Method (Part Two)
 
Open Science at the European Commission
Open Science at the European CommissionOpen Science at the European Commission
Open Science at the European Commission
 
Central dogma
Central dogmaCentral dogma
Central dogma
 

Similar a Model management tools for improved reproducibility in systems biology

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
Sarah Callaghan Research Data Overview
Sarah Callaghan Research Data OverviewSarah Callaghan Research Data Overview
Sarah Callaghan Research Data Overview
OpenAIRE
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
Carole Goble
 

Similar a Model management tools for improved reproducibility in systems biology (20)

Model Management in Systems Biology: Challenges – Approaches – Solutions
Model Management in Systems Biology: Challenges – Approaches – SolutionsModel Management in Systems Biology: Challenges – Approaches – Solutions
Model Management in Systems Biology: Challenges – Approaches – Solutions
 
SEEKing our way to better presentation of data and models from scientific inv...
SEEKing our way to better presentation of data and models from scientific inv...SEEKing our way to better presentation of data and models from scientific inv...
SEEKing our way to better presentation of data and models from scientific inv...
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
Group meeting in Manchester.
Group meeting in Manchester.Group meeting in Manchester.
Group meeting in Manchester.
 
Metid Match 2014 - SEEK for Science
Metid Match 2014 - SEEK for ScienceMetid Match 2014 - SEEK for Science
Metid Match 2014 - SEEK for Science
 
Management of simulation studies in computational biology
Management of simulation studies in computational biologyManagement of simulation studies in computational biology
Management of simulation studies in computational biology
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
 
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 
e:Bio Kick-Off Meeting, SEMS
e:Bio Kick-Off Meeting, SEMSe:Bio Kick-Off Meeting, SEMS
e:Bio Kick-Off Meeting, SEMS
 
2_ResearchDataOverview_SarahCallaghan
2_ResearchDataOverview_SarahCallaghan2_ResearchDataOverview_SarahCallaghan
2_ResearchDataOverview_SarahCallaghan
 
Sarah Callaghan Research Data Overview
Sarah Callaghan Research Data OverviewSarah Callaghan Research Data Overview
Sarah Callaghan Research Data Overview
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
Sems project overview
Sems project overviewSems project overview
Sems project overview
 
SoundSoftware: Software Sustainability for audio and Music Researchers
SoundSoftware: Software Sustainability for audio and Music Researchers SoundSoftware: Software Sustainability for audio and Music Researchers
SoundSoftware: Software Sustainability for audio and Music Researchers
 

Más de University Medicine Greifswald

Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...
University Medicine Greifswald
 

Más de University Medicine Greifswald (16)

A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...
 
2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn
 
Mehr Medizininformatik am Meer
Mehr Medizininformatik am MeerMehr Medizininformatik am Meer
Mehr Medizininformatik am Meer
 
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology models
 
Identifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsIdentifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational models
 
Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...
 
Modelling sample at SEMS from a graph perspective
Modelling sample at SEMS from a graph perspectiveModelling sample at SEMS from a graph perspective
Modelling sample at SEMS from a graph perspective
 
Coming Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMSComing Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMS
 
Masymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesMasymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositories
 
Reproducibility, dissemination, and management of modeling results
Reproducibility, dissemination,  and management of modeling resultsReproducibility, dissemination,  and management of modeling results
Reproducibility, dissemination, and management of modeling results
 
Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...
 
SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)
 
Simulation experiment descriptions and management
Simulation experiment descriptions and managementSimulation experiment descriptions and management
Simulation experiment descriptions and management
 
Bio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-MLBio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-ML
 
Meta-Information for Bio-Models
Meta-Information for Bio-ModelsMeta-Information for Bio-Models
Meta-Information for Bio-Models
 

Último

Heat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree daysHeat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree days
Brahmesh Reddy B R
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
jyothisaisri
 
Continuum emission from within the plunging region of black hole discs
Continuum emission from within the plunging region of black hole discsContinuum emission from within the plunging region of black hole discs
Continuum emission from within the plunging region of black hole discs
Sérgio Sacani
 

Último (20)

MODERN PHYSICS_REPORTING_QUANTA_.....pdf
MODERN PHYSICS_REPORTING_QUANTA_.....pdfMODERN PHYSICS_REPORTING_QUANTA_.....pdf
MODERN PHYSICS_REPORTING_QUANTA_.....pdf
 
Heads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfHeads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdf
 
Heat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree daysHeat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree days
 
VILLAGE ATTACHMENT For rural agriculture PPT.pptx
VILLAGE ATTACHMENT For rural agriculture  PPT.pptxVILLAGE ATTACHMENT For rural agriculture  PPT.pptx
VILLAGE ATTACHMENT For rural agriculture PPT.pptx
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
 
In-pond Race way systems for Aquaculture (IPRS).pptx
In-pond Race way systems for Aquaculture (IPRS).pptxIn-pond Race way systems for Aquaculture (IPRS).pptx
In-pond Race way systems for Aquaculture (IPRS).pptx
 
ANALEPTICS Mrs Namrata Sanjay Mane  Department of Pharmaceutical Chemistry...
ANALEPTICS  Mrs Namrata Sanjay  Mane   Department of Pharmaceutical Chemistry...ANALEPTICS  Mrs Namrata Sanjay  Mane   Department of Pharmaceutical Chemistry...
ANALEPTICS Mrs Namrata Sanjay Mane  Department of Pharmaceutical Chemistry...
 
WASP-69b’s Escaping Envelope Is Confined to a Tail Extending at Least 7 Rp
WASP-69b’s Escaping Envelope Is Confined to a Tail Extending at Least 7 RpWASP-69b’s Escaping Envelope Is Confined to a Tail Extending at Least 7 Rp
WASP-69b’s Escaping Envelope Is Confined to a Tail Extending at Least 7 Rp
 
THE GENERAL PROPERTIES OF PROTEOBACTERIA AND ITS TYPES
THE GENERAL PROPERTIES OF PROTEOBACTERIA AND ITS TYPESTHE GENERAL PROPERTIES OF PROTEOBACTERIA AND ITS TYPES
THE GENERAL PROPERTIES OF PROTEOBACTERIA AND ITS TYPES
 
Introduction and significance of Symbiotic algae
Introduction and significance of  Symbiotic algaeIntroduction and significance of  Symbiotic algae
Introduction and significance of Symbiotic algae
 
Continuum emission from within the plunging region of black hole discs
Continuum emission from within the plunging region of black hole discsContinuum emission from within the plunging region of black hole discs
Continuum emission from within the plunging region of black hole discs
 
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptxSaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
 
NuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdfNuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdf
 
TEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfTEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdf
 
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdfFORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
 
GBSN - Biochemistry (Unit 8) Enzymology
GBSN - Biochemistry (Unit 8) EnzymologyGBSN - Biochemistry (Unit 8) Enzymology
GBSN - Biochemistry (Unit 8) Enzymology
 
short range interaction for protein and factors influencing or affecting the ...
short range interaction for protein and factors influencing or affecting the ...short range interaction for protein and factors influencing or affecting the ...
short range interaction for protein and factors influencing or affecting the ...
 
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
 
Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...
Manganese‐RichSandstonesasanIndicatorofAncientOxic  LakeWaterConditionsinGale...Manganese‐RichSandstonesasanIndicatorofAncientOxic  LakeWaterConditionsinGale...
Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...
 
Lubrication System in forced feed system
Lubrication System in forced feed systemLubrication System in forced feed system
Lubrication System in forced feed system
 

Model management tools for improved reproducibility in systems biology

  • 1. Model management tools for improved reproducibility in systems biology Dagmar Waltemath, on behalf of the SEMS team University of Rostock, Germany 10th International CellML Workshop Auckland, June 2016
  • 2. 2 On models and simulations Model Simulation Figs: BioModels (top) and DOI: 10.1073/pnas.88.16.7328 (bottom)
  • 3. 3 Most scientific discoveries rely on previous findings. Model Fig.: Tyson 2001 (BIOM195) Fig.: Tyson 1991 (BIOM005) Successor Fig.: History of Cell Cycle models in BioModels
  • 4. 4 Can we rely on findings that we ourselves cannot evaluate? (Probably not!) “only in ~20–25% of the projects were the relevant published data completely in line with our in-house findings (Fig. 1c). In almost two-thirds of the projects, there were inconsistencies [..] that either considerably prolonged the duration of the target validation process or, in most cases, resulted in termination of the projects because the evidence [..] was insufficient to justify further investments into these projects.” Prinz et al (2011)
  • 5. 5 We identified key challenges of reproducibility in systems biology and systems medicine. Lack of data standards – Lack of data quality and quantity – Lack of data availability – Lack of transparency
  • 6. 6 A lack of data availability makes it impossible for researchers to reproduce results. ● Model code in BioModels, including supplemental with a how-to reproduce the figures given in the original paper ● Online tool makes data available and browseable TriplexRNA Recon 2Recon 2 ● Publication backed up with a website containing the supplemental material ● Model code in (non-curated) BioModels ● Visualisation of the model can easily be explored ● References to original works How can we support scientists who wish to share model-based results? Issues – Simulation studies comprise of several files – Data is heterogeneous, distributed, complex – Data changes over time – Documentation of the how the study was performed often missing
  • 7. 7 A lack of data availability makes it impossible for researchers to reproduce results. How can we support scientists who wish to share model-based results? Issues – Simulation studies comprise of several files – Data is heterogeneous, distributed, complex – Data changes over time – Documentation of the how the study was performed often missing Our solutions – Tool support for the COMBINE Archive – lowering the effort to share reproducible models – Graph-based storage of model-related files – integrated & searchable virtual experiments – Model version control – towards a provenance of models
  • 8. 8 The COMBINE archive bundles all files necessary to reproduce a simulation study. COMBINE archive toolkit ● manage COMBINE archives – Explore – Edit – Share – Publish ● Used in: PMR 2, JWS Online, SED-ML Web Tools, OpenCor … WebCAT, Scharm et al 2014
  • 9. 9 STON, SED-ML DB & MASYMOS Integrated storage & retrieval system (MASYMOS) doi: 10.1093/database/bau130 doi: 10.1186/s13326-015-0014-4 Search across heterogeneous data, ontologies, and structures→poster Tailor-made storage systems (STON, SED-ML DB) Using graph databases to integrate standardised model-based data https://dx.doi.org/10.6084/m9.figshare.3382993.v1 SED-ML DB in JWS Online BioModelsPhysiome Model repository
  • 10. 10 BiVeS & COMODI Model version control (BiVeS, COMODI) Provenance-to-be (COMODI) Tracking the evolution of a CellML/SBML model over time doi: 10.1093/bioinformatics/btv484 Tracking the evolution of simulation studies and biological systems. https://dx.doi.org/10.6084/m9.figshare.2543059.v5 Physiome Model repository doi: 10.1093/bioinformatics/btv484
  • 11. 11 What's next? Models for the clinic, or: Bridging the gap between standards for systems biology & systems medicine Fig. courtesy Atalag et al (2015) http://hdl.handle.net/2292/27911
  • 12. Thank you for your attention. m n @SemsProject Martin Scharm BiVeS, COMODI, COMBINE Archive Video master Fabienne Lambusch Pattern & structure search in SBML models Mariam Nassar Rank aggregation Tom Gebhardt SBGN-compliant diffs Martin Peters M2CAT, COMBINE Archive, SED-ML database Vasundra Toure STON, SBGN-ED, SBGN symbol of the month Ron Henkel MASYMOS, MORRE www.sems.uni-rostock.de
  • 13. References Atalag et al (2015) http://hdl.handle.net/2292/27911 Bergmann et al. (2014) F.T. Bergmann, R. Adams, S. Moodie, J. Cooper, M. Glont et al.: COMBINE archive and OMEX format: one file to share all information to reproduce a modeling project. BMC Bioinformatics (2014) Prinz et al. (2011) Prinz, Florian, Thomas Schlange, and Khusru Asadullah. "Believe it or not: how much can we rely on published data on potential drug targets?." Nature reviews Drug discovery 10.9 (2011): 712- 712. Schmitz et al. (2014) Schmitz, Ulf, et al. "Cooperative gene regulation by microRNA pairs and their identification using a computational workflow." Nucleic acids research (2014): gku465. Thiele et al. (2013) Thiele, Ines, et al. "A community-driven global reconstruction of human metabolism." Nature biotechnology 31.5 (2013): 419-425. Waltemath & Scharm (2014) D. Waltemath and M. Scharm: Extracting reproducible simulation studies from model repositories using the CombineArchive Toolkit. Workshop on Data Management for the Life Sciences (2014), Hamburg, BTW 2014. Waltemath & Wolkenhauer (2016) D. Waltemath and O. Wolkenhauer: How modeling standards, software, and initiatives support reproducibility in systems biology and systems medicine. IEEE Transactions on Biomedical Engineering (2016) in the press.