SlideShare una empresa de Scribd logo
1 de 19
Descargar para leer sin conexión
Possibilities for Integrating
Model-related Data in Computational
Biology
Databases in Life Sciences, Montreal, July 2013
Dagmar Waltemath, University of Rostock, Germany
Nicolas Le Novère, Babraham Institute, UK
Michel Dumontier, Carleton University, Canada
Archive
Introduction
13-07-12 Integrating model-related data 2
Fig.: DOI: 10.1038/35002125
Introduction
13-07-12 Integrating model-related data 3
Fig.: DOI: 10.1038/35002125
Introduction
13-07-12 Integrating model-related data 4
No and size
of models
time
Fig.: DOI: 10.1038/35002125
Introduction
13-07-12 Integrating model-related data 5
Fig.: DOI: 10.1038/35002125
model reuse – result reproducibility
Introduction
13-07-12 Integrating model-related data 6
Introduction
1. How can we distribute models with all
information necessary to reuse them (MIRIAM)?
2. How can we effectively manage different types
of model-related data?
3. How can we link model-related data to the rest
of the world?
13-07-12 Integrating model-related data 7
1. Distributing models
Archive
13-07-12 Integrating model-related data 8
Frank Bergmann
Nicolas Le Novère
1. Distributing models
The COMBINE archive v0.1
• single “.zip” file
• bundles models and model-related data
• single file
http://co.mbine.org/documents/archive
13-07-12 Integrating model-related data 9
1. A manifest file,
"manifest.xml“,
2. all described files,
3. a metadata file,
"metadata.*“,
4. remaining files.
• All documents necessary
for the description of a
model and all associated
data and procedures.
• In the future: also
references to documents
1. Distributing models
<?xml version="1.0" encoding="utf-8"?>
<omexManifest
xmlns="http://identifiers.org/combine.specifications/omex-manifest">
<content location="./manifest.xml"
format="http://identifiers.org/combine.specifications/omex-
manifest"/>
<content location="./model/model.xml"
format="http://identifiers.org/combine.specifications/sbml"/>
<content location="./simulation.xml"
format="http://identifiers.org/combine.specifications/sedml"/>
<content location="./article.pdf"
format="application/pdf"/>
<content location="./metadata.rdf"
format="http://identifiers.org/combine.specifications/omex-
metadata"/>
</omexManifest>
13-07-12 Integrating model-related data 10
2. Managing models
13-07-12 11Integrating model-related data
Ron Henkel
2. Managing models
• Neo4J database
• Model2graph mapping ( , )
• Rich relations
http://biomodels.net/qualifiers
• Links to annotations
13-07-12 Integrating model-related data 12
“Which models are annotated with
‚Adenosine tri-phosphate‘?“
“Which models contain reactions with ATP
as reactant and ADP as product?”
Document
Model
P E CR S
SBO:0000268 uniprot:P07101 uniprot:Q03393 GO:0005737HGNC:8582
is
isVersionOf
is
isEncodedBy
is
asProduct
asReactant
asModifier
Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
Document
Model
P E CR S
SBO:0000268 uniprot:P07101 uniprot:Q03393 GO:0005737HGNC:8582
is
isVersionOf
is
isEncodedBy
is
asProduct
asReactant
asModifier
2. Managing models
• Lucene-based ranked retrieval
13-07-12 Integrating model-related data 13
“Give me the best matching model published
about the Cell Cycle and covering forms of cdc.“
Lucene query "cdc*" AND "Cell Cycle"
http://www.ebi.ac.uk/biomodels-demo/
Henkel et al. (2010), Bioinformatics
Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
2. Managing models
• Representing simulation descriptions
• ... and other types of model-related data
13-07-12 Integrating model-related data 14
“Give me all possible simulations that
show the dependency of the Cell Cycle
on the concentration of cdc25.“
Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
3. Integrating model data
13-07-12 Integrating model-related data 15
3. Integrating model data
13-07-12 Integrating model-related data 1616
At the heart of Linked Data for the Life Sciences
• Free and open source
• Based on Semantic Web standards
• Billions of interlinked statements from dozens
of conventional and high value datasets
• Partnerships with EBI, NCBI, DBCLS, NCBO,
OpenPHACTS, and commercial tool providers
chemicals/drugs/formulations,
genomes/genes/proteins, domains
Interactions, complexes & pathways
BioModels
animal models and phenotypes
Disease, genetic markers, treatments
Terminologies & publications
3. Integrating model data
# get all biochemical reactions in biomodels that are kinds of "protein
catabolic process“, as defined by the gene ontology (in bioportal endpoint)
SELECT ?go ?label count(distinct ?x)
WHERE {
?go rdfs:label ?label .
?go rdfs:subClassOf ?tgo OPTION (TRANSITIVE) .
?tgo rdfs:label ?tlabel .
FILTER regex(?tlabel, "^protein catabolic process")
service <http://biomodels.bio2rdf.org/sparql> {
?x <http://bio2rdf.org/biopax_vocabulary:identical-to> ?go .
?x a <http://www.biopax.org/release/biopax-level3.owl#BiochemicalReaction> .
}
13-07-12 Integrating model-related data 17
Gene Ontology Annotation Number of Reactions
protein catabolic process [go:0030163] 51
cellular protein catabolic process [go:0044257] 26
modification-dependent protein catabolic process [go:0019941] 1
beta-amyloid formation [go:0034205] 1
“Give me all reactions in BioModels
Database that represent protein
catabolic processes. “
Summary
Approach Features Purpose
COMBINE archive File bundle;
• Easy access to all model-related
data through one single file
Shipping files
Graph-DB (MORRE) Network of interrelated nodes
• IR techniques easily applicable
• No schema
• Link models and simulations
Managing
existing
model data
BIO2RDF Semantic integration of knowledge
• Automated reasoning
• No schema
• Linking into LOD
Full
integration
13-07-12 Integrating model-related data 18
Thank you.
13-07-12 Integrating model-related data 19
http://co.mbine.org/events/COMBINE_2013

Más contenido relacionado

La actualidad más candente

Managing and Analyzing Health Data (VLDB Conference)
Managing and Analyzing Health Data (VLDB Conference)Managing and Analyzing Health Data (VLDB Conference)
Managing and Analyzing Health Data (VLDB Conference)
Peter Speyer
 
Niso usage data forum 2007
Niso usage data forum 2007Niso usage data forum 2007
Niso usage data forum 2007
John McDonald
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
suganmca14
 
Connecting and synchronizing scientific knowledge
Connecting and synchronizing scientific knowledgeConnecting and synchronizing scientific knowledge
Connecting and synchronizing scientific knowledge
Prashant Gupta
 

La actualidad más candente (20)

Students academic performance using clustering technique
Students academic performance using clustering techniqueStudents academic performance using clustering technique
Students academic performance using clustering technique
 
Managing and Analyzing Health Data (VLDB Conference)
Managing and Analyzing Health Data (VLDB Conference)Managing and Analyzing Health Data (VLDB Conference)
Managing and Analyzing Health Data (VLDB Conference)
 
Machine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisMachine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network Analysis
 
An Open Source Tool for Game Theoretic Health Data De-Identification
An Open Source Tool for Game Theoretic Health Data De-IdentificationAn Open Source Tool for Game Theoretic Health Data De-Identification
An Open Source Tool for Game Theoretic Health Data De-Identification
 
IRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current Approaches
 
Niso usage data forum 2007
Niso usage data forum 2007Niso usage data forum 2007
Niso usage data forum 2007
 
Predicting students performance in final examination
Predicting students performance in final examinationPredicting students performance in final examination
Predicting students performance in final examination
 
Complex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine LearningComplex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine Learning
 
Bi4101343346
Bi4101343346Bi4101343346
Bi4101343346
 
Student Performance Data Mining Project Report
Student Performance Data Mining Project ReportStudent Performance Data Mining Project Report
Student Performance Data Mining Project Report
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease Dataset
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
 
Effects of Highly Agreed Documents in Relevancy Prediction
Effects of Highly Agreed Documents in Relevancy PredictionEffects of Highly Agreed Documents in Relevancy Prediction
Effects of Highly Agreed Documents in Relevancy Prediction
 
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
 
Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13
 
Mapping to the Metabolomic Manifold
Mapping to the Metabolomic ManifoldMapping to the Metabolomic Manifold
Mapping to the Metabolomic Manifold
 
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
 API-Centric Data Integration for Human Genomics Reference Databases: Achieve... API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
 
Connecting and synchronizing scientific knowledge
Connecting and synchronizing scientific knowledgeConnecting and synchronizing scientific knowledge
Connecting and synchronizing scientific knowledge
 
Pedro-Combining rapid data modelling and ontology services
Pedro-Combining rapid data modelling and ontology servicesPedro-Combining rapid data modelling and ontology services
Pedro-Combining rapid data modelling and ontology services
 

Destacado (8)

Meta-Information for Bio-Models
Meta-Information for Bio-ModelsMeta-Information for Bio-Models
Meta-Information for Bio-Models
 
Management of simulation studies in computational biology
Management of simulation studies in computational biologyManagement of simulation studies in computational biology
Management of simulation studies in computational biology
 
Reproducibility, dissemination, and management of modeling results
Reproducibility, dissemination,  and management of modeling resultsReproducibility, dissemination,  and management of modeling results
Reproducibility, dissemination, and management of modeling results
 
Senior Project
Senior ProjectSenior Project
Senior Project
 
Grayling wcf davos visual social media
Grayling wcf davos   visual social mediaGrayling wcf davos   visual social media
Grayling wcf davos visual social media
 
Modelling sample at SEMS from a graph perspective
Modelling sample at SEMS from a graph perspectiveModelling sample at SEMS from a graph perspective
Modelling sample at SEMS from a graph perspective
 
Web analytics for Marketing & Communications - Best Practice Methodologies fo...
Web analytics for Marketing & Communications - Best Practice Methodologies fo...Web analytics for Marketing & Communications - Best Practice Methodologies fo...
Web analytics for Marketing & Communications - Best Practice Methodologies fo...
 
e:Bio Kick-Off Meeting, SEMS
e:Bio Kick-Off Meeting, SEMSe:Bio Kick-Off Meeting, SEMS
e:Bio Kick-Off Meeting, SEMS
 

Similar a Possibilities for integrating model-related data in computational biology (DILS 2013)

PhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco RuizPhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco Ruiz
Luis Marco Ruiz
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
Edward Blurock
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Carole Goble
 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Devansh16
 

Similar a Possibilities for integrating model-related data in computational biology (DILS 2013) (20)

Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
COMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightCOMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management right
 
Large Graph Mining
Large Graph MiningLarge Graph Mining
Large Graph Mining
 
PhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco RuizPhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco Ruiz
 
Data and model management in Systems Biology
Data and model management in Systems BiologyData and model management in Systems Biology
Data and model management in Systems Biology
 
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONMULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
 
Standards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchStandards and tools for model management in biomedical research
Standards and tools for model management in biomedical research
 
OGD Metadata standards – The ENGAGE metadata architecture
OGD Metadata standards – The ENGAGE metadata architectureOGD Metadata standards – The ENGAGE metadata architecture
OGD Metadata standards – The ENGAGE metadata architecture
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
 
Poster CBIS 2012
Poster CBIS 2012Poster CBIS 2012
Poster CBIS 2012
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH Perspective
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusability
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
 

Más de University Medicine Greifswald

Más de University Medicine Greifswald (17)

A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...
 
When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?
 
2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn
 
Mehr Medizininformatik am Meer
Mehr Medizininformatik am MeerMehr Medizininformatik am Meer
Mehr Medizininformatik am Meer
 
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology models
 
Model management tools for improved reproducibility in systems biology
Model management tools for improved reproducibility in systems biologyModel management tools for improved reproducibility in systems biology
Model management tools for improved reproducibility in systems biology
 
Short introduction to SED-ML
Short introduction to SED-MLShort introduction to SED-ML
Short introduction to SED-ML
 
Data and Model Management for Systems Biology
Data and Model Management  for Systems BiologyData and Model Management  for Systems Biology
Data and Model Management for Systems Biology
 
Identifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsIdentifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational models
 
Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...
 
Coming Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMSComing Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMS
 
Masymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesMasymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositories
 
SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)
 
Simulation experiment descriptions and management
Simulation experiment descriptions and managementSimulation experiment descriptions and management
Simulation experiment descriptions and management
 
Sems project overview
Sems project overviewSems project overview
Sems project overview
 
Bio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-MLBio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-ML
 

Último

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Último (20)

SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

Possibilities for integrating model-related data in computational biology (DILS 2013)

  • 1. Possibilities for Integrating Model-related Data in Computational Biology Databases in Life Sciences, Montreal, July 2013 Dagmar Waltemath, University of Rostock, Germany Nicolas Le Novère, Babraham Institute, UK Michel Dumontier, Carleton University, Canada Archive
  • 2. Introduction 13-07-12 Integrating model-related data 2 Fig.: DOI: 10.1038/35002125
  • 3. Introduction 13-07-12 Integrating model-related data 3 Fig.: DOI: 10.1038/35002125
  • 4. Introduction 13-07-12 Integrating model-related data 4 No and size of models time Fig.: DOI: 10.1038/35002125
  • 5. Introduction 13-07-12 Integrating model-related data 5 Fig.: DOI: 10.1038/35002125 model reuse – result reproducibility
  • 7. Introduction 1. How can we distribute models with all information necessary to reuse them (MIRIAM)? 2. How can we effectively manage different types of model-related data? 3. How can we link model-related data to the rest of the world? 13-07-12 Integrating model-related data 7
  • 8. 1. Distributing models Archive 13-07-12 Integrating model-related data 8 Frank Bergmann Nicolas Le Novère
  • 9. 1. Distributing models The COMBINE archive v0.1 • single “.zip” file • bundles models and model-related data • single file http://co.mbine.org/documents/archive 13-07-12 Integrating model-related data 9
  • 10. 1. A manifest file, "manifest.xml“, 2. all described files, 3. a metadata file, "metadata.*“, 4. remaining files. • All documents necessary for the description of a model and all associated data and procedures. • In the future: also references to documents 1. Distributing models <?xml version="1.0" encoding="utf-8"?> <omexManifest xmlns="http://identifiers.org/combine.specifications/omex-manifest"> <content location="./manifest.xml" format="http://identifiers.org/combine.specifications/omex- manifest"/> <content location="./model/model.xml" format="http://identifiers.org/combine.specifications/sbml"/> <content location="./simulation.xml" format="http://identifiers.org/combine.specifications/sedml"/> <content location="./article.pdf" format="application/pdf"/> <content location="./metadata.rdf" format="http://identifiers.org/combine.specifications/omex- metadata"/> </omexManifest> 13-07-12 Integrating model-related data 10
  • 11. 2. Managing models 13-07-12 11Integrating model-related data Ron Henkel
  • 12. 2. Managing models • Neo4J database • Model2graph mapping ( , ) • Rich relations http://biomodels.net/qualifiers • Links to annotations 13-07-12 Integrating model-related data 12 “Which models are annotated with ‚Adenosine tri-phosphate‘?“ “Which models contain reactions with ATP as reactant and ADP as product?” Document Model P E CR S SBO:0000268 uniprot:P07101 uniprot:Q03393 GO:0005737HGNC:8582 is isVersionOf is isEncodedBy is asProduct asReactant asModifier Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
  • 13. Document Model P E CR S SBO:0000268 uniprot:P07101 uniprot:Q03393 GO:0005737HGNC:8582 is isVersionOf is isEncodedBy is asProduct asReactant asModifier 2. Managing models • Lucene-based ranked retrieval 13-07-12 Integrating model-related data 13 “Give me the best matching model published about the Cell Cycle and covering forms of cdc.“ Lucene query "cdc*" AND "Cell Cycle" http://www.ebi.ac.uk/biomodels-demo/ Henkel et al. (2010), Bioinformatics Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
  • 14. 2. Managing models • Representing simulation descriptions • ... and other types of model-related data 13-07-12 Integrating model-related data 14 “Give me all possible simulations that show the dependency of the Cell Cycle on the concentration of cdc25.“ Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
  • 15. 3. Integrating model data 13-07-12 Integrating model-related data 15
  • 16. 3. Integrating model data 13-07-12 Integrating model-related data 1616 At the heart of Linked Data for the Life Sciences • Free and open source • Based on Semantic Web standards • Billions of interlinked statements from dozens of conventional and high value datasets • Partnerships with EBI, NCBI, DBCLS, NCBO, OpenPHACTS, and commercial tool providers chemicals/drugs/formulations, genomes/genes/proteins, domains Interactions, complexes & pathways BioModels animal models and phenotypes Disease, genetic markers, treatments Terminologies & publications
  • 17. 3. Integrating model data # get all biochemical reactions in biomodels that are kinds of "protein catabolic process“, as defined by the gene ontology (in bioportal endpoint) SELECT ?go ?label count(distinct ?x) WHERE { ?go rdfs:label ?label . ?go rdfs:subClassOf ?tgo OPTION (TRANSITIVE) . ?tgo rdfs:label ?tlabel . FILTER regex(?tlabel, "^protein catabolic process") service <http://biomodels.bio2rdf.org/sparql> { ?x <http://bio2rdf.org/biopax_vocabulary:identical-to> ?go . ?x a <http://www.biopax.org/release/biopax-level3.owl#BiochemicalReaction> . } 13-07-12 Integrating model-related data 17 Gene Ontology Annotation Number of Reactions protein catabolic process [go:0030163] 51 cellular protein catabolic process [go:0044257] 26 modification-dependent protein catabolic process [go:0019941] 1 beta-amyloid formation [go:0034205] 1 “Give me all reactions in BioModels Database that represent protein catabolic processes. “
  • 18. Summary Approach Features Purpose COMBINE archive File bundle; • Easy access to all model-related data through one single file Shipping files Graph-DB (MORRE) Network of interrelated nodes • IR techniques easily applicable • No schema • Link models and simulations Managing existing model data BIO2RDF Semantic integration of knowledge • Automated reasoning • No schema • Linking into LOD Full integration 13-07-12 Integrating model-related data 18
  • 19. Thank you. 13-07-12 Integrating model-related data 19 http://co.mbine.org/events/COMBINE_2013