SlideShare una empresa de Scribd logo
1 de 91
Descargar para leer sin conexión
Toward whole-cell models for
science and engineering
Jonathan Karr
March 9, 2015
Positions available
research.mssm.edu/karr
karr@mssm.edu
Jayodita
Sanghvi
Markus
Covert
Jared
JacobsDerek
Macklin
Acknowledgements
Chemicals & fuels
Optimize yield
Minimize cost
Food
Optimize yield
Resist drought
Prevent infection
Medicine
Predict prognoses
Optimize therapy
Maximize quality of life
Central challenge: predict phenotype from genotype
Example: drug biosynthesis
Example: drug biosynthesis
Example: drug biosynthesis
Example: drug biosynthesis
Example: drug biosynthesis
Example: drug biosynthesis
Example: drug biosynthesis
Example: drug biosynthesis
Predicting phenotype from genotype requires “whole-cell” models
Integrated
Comprehensive Dynamic
Gene-complete
Whole-cell modeling principles
Biological data is readily available
?
Data Knowledge
Whole-cell model goals
Whole-cell modeling
A grand challenge of the 21st century
– Masaru Tomita
Biology urgently needs a theoretical basis to unify it
– Sydney Brenner
The ultimate test of understanding a simple cell,
more than being able to build one, would be to build
a computer model of the cell
– Clyde Hutchison
Single-cell variation
Microscopy
Transcription
RNA-seq
Protein expression
Mass-spec, Western blot
Modeling challenge: heterogeneous data
Modeling challenge: sparse data
MetabolicSignaling
Transcriptional regulatory
Modelling challenge: heterogenous networks
Time
Length
Replication
Growth
Transcription
Metabolism
Modeling challenge: multiple time and length scales
0
25
50
75
100
1970's
Coarse-grained
ODEs
1990's
FBA
2000's
Boolean
models
2008
iFBA
2012
Whole-cell
model
%annotatedgenes Whole-cell modeling progress
vv v v v
Predictive modeling methodologies
Granularity
Scope
ODE
SDE
FBA
Boolean
Bayesian
Gillespie
PDE
Whole-cell model
Uptake
FBA
Composition
Metabolism
FBA
Composition
Transcription
Stochastic binding
Gene expression
Translation
Stochastic binding
Gene expression
Replication
Chemical kinetics
DNA sequence
Solution: integrated models
0
25
50
75
100
1970's
Coarse-grained
ODEs
1990's
FBA
2000's
Boolean
models
2008
iFBA
2012
Whole-cell
model
%annotatedgenes Whole-cell modeling progress
vv
Model Validate
Engineer
Whole-cell modeling
Validate
Engineer
Model
Whole-cell modeling
Model construction
1. Define system
2. Define scope
3. Curate data
4. Choose representation
5. Identify parameters
6. Test predictions
E. coli M. genitalium
Genome 4700 kb 580 kb
Genes 4461 525
Size 2 μm × 0.5 μm 0.2-0.3 μm
1. Select a tractable model organism
Comparative genomics
Fraiser et. al, 1995
Genome-wide essentiality
Glass et. al, 1999
M. genitalium is well-characterized
Genomic-scale data
Kühner et. al, 2009
M. genitalium is well-characterized
Genomic transplantation
Lartigue et. al, 2009
Genomic synthesis
Gibson et. al, 2009
M. genitalium has unique engineering tools
2. Choose model scope
2. Choose model scope
• Explicitly represent each metabolite, gene, RNA, and protein species
• Explicitly model the function of every characterized gene product
• Account for the metabolic cost of every uncharacterized gene product
• Represent important, well-characterized molecules individually
3. Broadly curate experimental data
Karr et al., 2013
Uptake
FBA
Composition
Metabolism
FBA
Composition
Transcription
Stochastic events
Gene expression
Translation
Stochastic events
Gene expression
Replication
Chemical kinetics
DNA sequence
Sub-modelsStates
4. Select a flexible mathematical representation
Mass, shape
Metabolite, RNA,
protein counts
Mammalian host
Transcript, polypeptide
sequences
DNA polymerization,
proteins, modifications
FtsZ ring
1 s
Simulation algorithm
Uptake
Metabolism
Transcription
Translation
Replication
Cellstates
Cellstates
Uptake
Metabolism
Transcription
Translation
Replication
Cellstates
Uptake
Metabolism
Transcription
Translation
Replication
DNA RNA Protein Other
Replication
RepInitiation
Supercoiling
Condensation
Segregation
Damage
Repair
TransReg
Transcription
Processing
Modification
Aminoacylation
Degradation
Translation
ProcessingI
Translocation
ProcessingII
Folding
Modification
Complexation
Ribosome
TermOrg
Activation
Degradation
Metabolism
Shape
FtsZ
Cytokinesis
DNA
Replication
Rep Initiation
Supercoiling
Condensation
Segregation
Damage
Repair
Trans Reg
RNA
Transcription
Processing
Modification
Aminoacylation
Degradation l,
Protein
Translation
Processing I
Translocation
Processing II
Folding
Modification
Complexation
Ribosome
Term Org #
Activation
Degradation l, #
Other
Metabolism
Shape
FtsZ
Cytokinesis
Many resources are shared
Many resources are shared
1 s
Uptake
Metabolism
Transcription
Translation
Replication
Cellstates
Cellstates
Uptake
Metabolism
Transcription
Translation
Replication
Cellstates
Uptake
Metabolism
Transcription
Translation
Replication
Dividestate
Dividestate
Dividestate
Simulation algorithm
Mycoplasma model contains 28 sub-models
Karr et al., 2012
Course expertise
Modeling
• Frank Bergmann
• Marcus Krantz
• Wolfgang Liebermeister
• Pedro Mendes
• Chris Myers
• Pnar Pir
• Kieran Smallbone
Curation
• Vijayalakshmi Chelliah
Standards
• Michael Hucka
• Falk Schreiber
• Dagmar Waltemath
Karr et al., 2012
Example sub-model: Transcription
Example sub-model: Transcription
Karr et al., 2012
Free
Bound
Promoter
Bound
Active
1. Update RNA polymerase states
3. Bind RNA polymerase
2. Calculate promoter affinities
4. Elongate and terminate transcripts
AUGAUCCGUCUCUAAUGUCUAC
UTCAACGUGAGGUAAUAAAGUC
UCCACGAUGCUACUGUAUC
GCCUCAUACUGCGGAU
UUACGUAUCAGUGAUCAGUACU
Sequence
Transcript
HcrA SpxFur GntR LuxR
glpF dnaJ dnaK gntR trxB polC
Example sub-model: Transcription
•Compare the model’s predictions to data, 𝑦𝑖
•Define an error metric
∑ 𝐸 𝑓𝑖(𝑥; 𝑝) cells,time − 𝑦𝑖
2
•Numerically minimize error
• Gradient descent
• Scatter search
• Simulated annealing
• Genetic algorithms
5. Identify parameters
•Large parameter space
•Stochastic model
•Large computational cost
•Heterogeneous data
•Little dynamic, single cell data
5. Identify parameters
Model reduction enables parameter identification
3. Manually tune parameters
using full model
1. Reduce model
Time
ModelExperiment
Molecule
Molecule
2. Identify reduced model
parameters using
traditional methods
Software: wholecell.org
• ODE models
• COPASI: copasi.org
• V-Cell: nrcam.uchc.edu
• Systems biology toolbox
• Boolean models
• CellNOpt
• Flux-balance analysis
• openCOBRA: opencobra.sourceforge.net
• RAVEN
• Integrative models
• E-Cell: e-cell.org
• Whole-cell: wholecell.org
• Standards
• SBML: sbml.org
• CellML: cellml.org
Software
Cellular composition
Metabolite concentrations
mRNA, protein copy numbers
RNA synthesis rates
Karr et al., 2012
DNA binding protein collisions
Karr et al., 2012
DNA binding
Replication
Translation
60 m mol ATP / gDCW
80 a mol ATP / cell
Energy consumption
v
v
Karr et al., 2012
Energy consumption
Model Validate
Engineer
Whole-cell modeling
Validate
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Model reproduces observed metabolomics
Karr et al., 2012
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Model validated by experiments and theoryValidate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Model validated by experiments and theoryValidate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Colorimetric growth assay Model predictions
Model reproduces measured growth rate
Karr et al., 2012
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Model validated by experiments and theoryValidate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory
Model
Engineer
Whole-cell modeling
Validate
Engineer
What genomic modifications maximize growth?
Time
Mass
Example: growth optimization
M. genitalium
M. mycoides
M. pneumoniae
Optimal gene expression
Optimal architecture retains robustnessOptimal gene expression retains robustness
Graphical design tool
Clotho, TinkerCell, GenoCAD
High-level language
BioCompiler
Biophysical model
Whole-cell models, SCHEMA, MD
Physical implementation
Gibson assembly, TALENs, ZFNs, CRISPR
Transplantation
Transplantation
(if (nutrients)
(grow)
(sporulate))
Directed evolutionMutate Select
Synthetic design landscape
Karr lab: expanding whole-cell models
M. pneumoniae
• Expand scope: regulation
• Improve accuracy: species-specific data
• Enable rational genome engineering
• Cell-based drug therapy
Human cancer
• Colorectal cancer
• Personalized models
• Precision medicine
Karr lab: solving important problems
Biological discovery
Synthetic networks
Biological design
Drug repositioning
Drug toxicity
Karr lab: developing modeling tools
Reconstruction: WholeCellKB
Parallelized simulator
Parameter estimation
Simulation storage: WholeCellSimDB
Visualization: WholeCellViz
wholecell.org
??
•How can we model more complex physiology?
• Transcriptional regulation
• Translational regulation
• Stochastic death, failure modes
• Higher-order meta-stable states
• Resource distribution
• Aging
• Evolution
• Populations
•How can we model more complex organisms?
• Larger bacteria
• Eukaryotes
• Multicellularity
• Humans
•How can we use models to direct engineering?
Open challenges
Whole-cell modeling course
1. Teach whole-cell modeling
• Model biological systems
• Construct dynamical models
• Integrate models
2. Improve implementation
• Reusable
• Standard
• Open
3. Improve methodology
Data
?
Whole-cell
models
Broadly predicts cell physiology
Integrates heterogeneous data and models
Guides bioengineering and medicine
Knowledge
• Karr JR et al. (2012) A Whole-Cell Computational Model Predicts Phenotype from
Genotype. Cell, 150, 389-401.
• Macklin DN, Ruggero NA, Covert MW (2014) The future of whole-cell
modeling. Curr Opin Biotechnol, 28C, 111-115.
• Shuler ML, Foley P, Atlas J (2012). Modeling a minimal cell. Methods Mol
Biol, 881, 573-610.
• Joyce AR, Palsson BØ (2007). Toward whole cell modeling and simulation:
comprehensive functional genomics through the constraint-based approach. Prog
Drug Res 64, 267-309.
• Tomita M (2001). Whole-cell simulation: a grand challenge of the 21st century.
Trends Biotechnol 6, 205-10.
• Surovtsev IV et al. (2009) Mathematical modeling of a minimal protocell with
coordinated growth and division. J Theor Biol, 260, 422-9.
Recommended reading
• Thiele I et al. (2009). Genome-scale reconstruction of Escherichia coli's
transcriptional and translational machinery: a knowledge base, its mathematical
formulation, and its functional characterization. PLoS Comput Biol. 5, e1000312.
• Orth JD, Thiele I, Palsson BØ (2010). What is flux balance analysis? Nat
Biotechnol, 28, 245-8.
• Covert MW et al (2008). Integrated Flux Balance Analysis Model of Escherichia coli.
Bioinformatics 24, 2044–50.
• Covert MW et al (2004). Integrating high-throughput and computational data
elucidates bacterial networks. Nature, 429, 92-6.
Recommended reading: FBA

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Genomics
GenomicsGenomics
Genomics
 
Interactomics, Integromics to Systems Biology: Next Animal Biotechnology Fron...
Interactomics, Integromics to Systems Biology: Next Animal Biotechnology Fron...Interactomics, Integromics to Systems Biology: Next Animal Biotechnology Fron...
Interactomics, Integromics to Systems Biology: Next Animal Biotechnology Fron...
 
3D cell cultures
3D cell cultures3D cell cultures
3D cell cultures
 
Presentation on marker genes
Presentation on marker genesPresentation on marker genes
Presentation on marker genes
 
Cell free protein expression system
Cell free protein expression systemCell free protein expression system
Cell free protein expression system
 
Applications of Proteomics in plant biotic stress
Applications of Proteomics in plant biotic stressApplications of Proteomics in plant biotic stress
Applications of Proteomics in plant biotic stress
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in Bioinformatics
 
Reverse genetics
Reverse geneticsReverse genetics
Reverse genetics
 
cohesive and blunt end ligation
cohesive and blunt end ligationcohesive and blunt end ligation
cohesive and blunt end ligation
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Mutation detection
Mutation detectionMutation detection
Mutation detection
 
Crispr cas: A new tool of genome editing
Crispr cas: A new tool of genome editing Crispr cas: A new tool of genome editing
Crispr cas: A new tool of genome editing
 
SNP Genotyping Technologies
SNP Genotyping TechnologiesSNP Genotyping Technologies
SNP Genotyping Technologies
 
2D-PAGE & DIGE
2D-PAGE & DIGE2D-PAGE & DIGE
2D-PAGE & DIGE
 
CRISPR-Revolutionary Genome editing tools for Plants.....
CRISPR-Revolutionary Genome editing tools for Plants.....CRISPR-Revolutionary Genome editing tools for Plants.....
CRISPR-Revolutionary Genome editing tools for Plants.....
 
Alphafold2 - Protein Structural Bioinformatics After CASP14
Alphafold2 - Protein Structural Bioinformatics After CASP14Alphafold2 - Protein Structural Bioinformatics After CASP14
Alphafold2 - Protein Structural Bioinformatics After CASP14
 
ChIP-seq
ChIP-seqChIP-seq
ChIP-seq
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Molecular markr sscp
Molecular markr sscpMolecular markr sscp
Molecular markr sscp
 

Destacado (12)

Biotech2012 spring 3_metabolomics
Biotech2012 spring 3_metabolomicsBiotech2012 spring 3_metabolomics
Biotech2012 spring 3_metabolomics
 
The metabolism of glucose
The metabolism of glucoseThe metabolism of glucose
The metabolism of glucose
 
Hormonal regulation of proteins metabolism
Hormonal regulation of proteins metabolismHormonal regulation of proteins metabolism
Hormonal regulation of proteins metabolism
 
Digestion, absorption and metabolism
Digestion, absorption and metabolismDigestion, absorption and metabolism
Digestion, absorption and metabolism
 
Lipoproteins
LipoproteinsLipoproteins
Lipoproteins
 
Diabetes and Glucose Metabolism
Diabetes and Glucose MetabolismDiabetes and Glucose Metabolism
Diabetes and Glucose Metabolism
 
Metabolism protein
Metabolism proteinMetabolism protein
Metabolism protein
 
Metabolomics
MetabolomicsMetabolomics
Metabolomics
 
Lipoprotein metabolism - (transport of lipids in the Blood)
Lipoprotein metabolism - (transport of lipids in the Blood)Lipoprotein metabolism - (transport of lipids in the Blood)
Lipoprotein metabolism - (transport of lipids in the Blood)
 
Pentose Phosphate Pathway
Pentose Phosphate PathwayPentose Phosphate Pathway
Pentose Phosphate Pathway
 
Glycolysis- An over view
Glycolysis- An over viewGlycolysis- An over view
Glycolysis- An over view
 
Electron Transport Chain ETC
Electron Transport Chain ETCElectron Transport Chain ETC
Electron Transport Chain ETC
 

Similar a Introduction to whole-cell modeling lecture | Whole-cell modeling summer school | March 3-9, 2015 @ University of Rostock

4.4 genetic engineering & biotechnology
4.4 genetic engineering & biotechnology4.4 genetic engineering & biotechnology
4.4 genetic engineering & biotechnology
cartlidge
 
4.6 Biotechnology And Genetic Engineering
4.6 Biotechnology And Genetic Engineering4.6 Biotechnology And Genetic Engineering
4.6 Biotechnology And Genetic Engineering
Patricia Lopez
 
4.4 Biotechnology And Genetic Engineering
4.4 Biotechnology And Genetic Engineering4.4 Biotechnology And Genetic Engineering
4.4 Biotechnology And Genetic Engineering
Patricia Lopez
 
Waksman Student Scholars Program Poster
Waksman Student Scholars Program PosterWaksman Student Scholars Program Poster
Waksman Student Scholars Program Poster
Wesley Kwong
 

Similar a Introduction to whole-cell modeling lecture | Whole-cell modeling summer school | March 3-9, 2015 @ University of Rostock (20)

Biotechnology
BiotechnologyBiotechnology
Biotechnology
 
STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...
 
4.4 genetic engineering & biotechnology
4.4 genetic engineering & biotechnology4.4 genetic engineering & biotechnology
4.4 genetic engineering & biotechnology
 
Protein Phrase
Protein PhraseProtein Phrase
Protein Phrase
 
Genetically engineered bacteria: chemical factories of the future?
Genetically engineered bacteria: chemical factories of the future?Genetically engineered bacteria: chemical factories of the future?
Genetically engineered bacteria: chemical factories of the future?
 
Proteomics and its applications in phytopathology
Proteomics and its applications in phytopathologyProteomics and its applications in phytopathology
Proteomics and its applications in phytopathology
 
JoB spike in manuscript 2014
JoB spike in manuscript 2014JoB spike in manuscript 2014
JoB spike in manuscript 2014
 
2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented
 
Omics in plant breeding
Omics in plant breedingOmics in plant breeding
Omics in plant breeding
 
4.6 Biotechnology And Genetic Engineering
4.6 Biotechnology And Genetic Engineering4.6 Biotechnology And Genetic Engineering
4.6 Biotechnology And Genetic Engineering
 
4.4 Biotechnology And Genetic Engineering
4.4 Biotechnology And Genetic Engineering4.4 Biotechnology And Genetic Engineering
4.4 Biotechnology And Genetic Engineering
 
2014 11 03_bioinformatics_case_studies
2014 11 03_bioinformatics_case_studies2014 11 03_bioinformatics_case_studies
2014 11 03_bioinformatics_case_studies
 
Molecular folding
Molecular foldingMolecular folding
Molecular folding
 
Molecular folding
Molecular foldingMolecular folding
Molecular folding
 
Day 6 September 15th Chapters 3 and 5
Day 6 September 15th Chapters 3 and 5Day 6 September 15th Chapters 3 and 5
Day 6 September 15th Chapters 3 and 5
 
Waksman Student Scholars Program Poster
Waksman Student Scholars Program PosterWaksman Student Scholars Program Poster
Waksman Student Scholars Program Poster
 
Trends In Genomics
Trends In GenomicsTrends In Genomics
Trends In Genomics
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdf
 
Epigeneticsand methylation
Epigeneticsand methylationEpigeneticsand methylation
Epigeneticsand methylation
 
Health proteins Appearance
Health proteins AppearanceHealth proteins Appearance
Health proteins Appearance
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 

Último (20)

Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 

Introduction to whole-cell modeling lecture | Whole-cell modeling summer school | March 3-9, 2015 @ University of Rostock

  • 1. Toward whole-cell models for science and engineering Jonathan Karr March 9, 2015 Positions available research.mssm.edu/karr karr@mssm.edu
  • 3. Chemicals & fuels Optimize yield Minimize cost Food Optimize yield Resist drought Prevent infection Medicine Predict prognoses Optimize therapy Maximize quality of life Central challenge: predict phenotype from genotype
  • 11. Example: drug biosynthesis Predicting phenotype from genotype requires “whole-cell” models
  • 13. Biological data is readily available
  • 15. Whole-cell modeling A grand challenge of the 21st century – Masaru Tomita Biology urgently needs a theoretical basis to unify it – Sydney Brenner The ultimate test of understanding a simple cell, more than being able to build one, would be to build a computer model of the cell – Clyde Hutchison
  • 22. Uptake FBA Composition Metabolism FBA Composition Transcription Stochastic binding Gene expression Translation Stochastic binding Gene expression Replication Chemical kinetics DNA sequence Solution: integrated models
  • 26. Model construction 1. Define system 2. Define scope 3. Curate data 4. Choose representation 5. Identify parameters 6. Test predictions
  • 27. E. coli M. genitalium Genome 4700 kb 580 kb Genes 4461 525 Size 2 μm × 0.5 μm 0.2-0.3 μm 1. Select a tractable model organism
  • 28. Comparative genomics Fraiser et. al, 1995 Genome-wide essentiality Glass et. al, 1999 M. genitalium is well-characterized
  • 29. Genomic-scale data Kühner et. al, 2009 M. genitalium is well-characterized
  • 30. Genomic transplantation Lartigue et. al, 2009 Genomic synthesis Gibson et. al, 2009 M. genitalium has unique engineering tools
  • 32. 2. Choose model scope • Explicitly represent each metabolite, gene, RNA, and protein species • Explicitly model the function of every characterized gene product • Account for the metabolic cost of every uncharacterized gene product • Represent important, well-characterized molecules individually
  • 33. 3. Broadly curate experimental data Karr et al., 2013
  • 34. Uptake FBA Composition Metabolism FBA Composition Transcription Stochastic events Gene expression Translation Stochastic events Gene expression Replication Chemical kinetics DNA sequence Sub-modelsStates 4. Select a flexible mathematical representation Mass, shape Metabolite, RNA, protein counts Mammalian host Transcript, polypeptide sequences DNA polymerization, proteins, modifications FtsZ ring
  • 36. DNA RNA Protein Other Replication RepInitiation Supercoiling Condensation Segregation Damage Repair TransReg Transcription Processing Modification Aminoacylation Degradation Translation ProcessingI Translocation ProcessingII Folding Modification Complexation Ribosome TermOrg Activation Degradation Metabolism Shape FtsZ Cytokinesis DNA Replication Rep Initiation Supercoiling Condensation Segregation Damage Repair Trans Reg RNA Transcription Processing Modification Aminoacylation Degradation l, Protein Translation Processing I Translocation Processing II Folding Modification Complexation Ribosome Term Org # Activation Degradation l, # Other Metabolism Shape FtsZ Cytokinesis Many resources are shared
  • 39. Mycoplasma model contains 28 sub-models Karr et al., 2012
  • 40. Course expertise Modeling • Frank Bergmann • Marcus Krantz • Wolfgang Liebermeister • Pedro Mendes • Chris Myers • Pnar Pir • Kieran Smallbone Curation • Vijayalakshmi Chelliah Standards • Michael Hucka • Falk Schreiber • Dagmar Waltemath
  • 41. Karr et al., 2012 Example sub-model: Transcription
  • 43. Free Bound Promoter Bound Active 1. Update RNA polymerase states 3. Bind RNA polymerase 2. Calculate promoter affinities 4. Elongate and terminate transcripts AUGAUCCGUCUCUAAUGUCUAC UTCAACGUGAGGUAAUAAAGUC UCCACGAUGCUACUGUAUC GCCUCAUACUGCGGAU UUACGUAUCAGUGAUCAGUACU Sequence Transcript HcrA SpxFur GntR LuxR glpF dnaJ dnaK gntR trxB polC Example sub-model: Transcription
  • 44. •Compare the model’s predictions to data, 𝑦𝑖 •Define an error metric ∑ 𝐸 𝑓𝑖(𝑥; 𝑝) cells,time − 𝑦𝑖 2 •Numerically minimize error • Gradient descent • Scatter search • Simulated annealing • Genetic algorithms 5. Identify parameters
  • 45. •Large parameter space •Stochastic model •Large computational cost •Heterogeneous data •Little dynamic, single cell data 5. Identify parameters
  • 46. Model reduction enables parameter identification 3. Manually tune parameters using full model 1. Reduce model Time ModelExperiment Molecule Molecule 2. Identify reduced model parameters using traditional methods
  • 48. • ODE models • COPASI: copasi.org • V-Cell: nrcam.uchc.edu • Systems biology toolbox • Boolean models • CellNOpt • Flux-balance analysis • openCOBRA: opencobra.sourceforge.net • RAVEN • Integrative models • E-Cell: e-cell.org • Whole-cell: wholecell.org • Standards • SBML: sbml.org • CellML: cellml.org Software
  • 49.
  • 54. Karr et al., 2012 DNA binding protein collisions
  • 55. Karr et al., 2012 DNA binding
  • 58. 60 m mol ATP / gDCW 80 a mol ATP / cell Energy consumption
  • 59. v v Karr et al., 2012 Energy consumption
  • 61. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 62. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 63. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 64. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 65. Model reproduces observed metabolomics Karr et al., 2012
  • 66. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 67. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Model validated by experiments and theoryValidate model against experiments and theory
  • 68. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Model validated by experiments and theoryValidate model against experiments and theory
  • 69. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 70. Colorimetric growth assay Model predictions Model reproduces measured growth rate Karr et al., 2012
  • 71. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Model validated by experiments and theoryValidate model against experiments and theory
  • 72. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 73. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 74. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 75. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 76. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 77. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 78. Matches training data Cell mass, volume Biomass composition RNA, protein expression, half-lives Superhelicity Matches published data Metabolite concentrations DNA-bound protein density Gene essentiality Matches new data Wild-type growth rate Disruption strain growth rates Matches theory Mass conservation Central dogma Cell theory Evolution No obvious errors Plot model predictions Manually inspect data Compare to known biology Software stable Simulation code is stable Tests passing Validate model against experiments and theory
  • 80. What genomic modifications maximize growth? Time Mass Example: growth optimization
  • 81. M. genitalium M. mycoides M. pneumoniae Optimal gene expression
  • 82. Optimal architecture retains robustnessOptimal gene expression retains robustness
  • 83. Graphical design tool Clotho, TinkerCell, GenoCAD High-level language BioCompiler Biophysical model Whole-cell models, SCHEMA, MD Physical implementation Gibson assembly, TALENs, ZFNs, CRISPR Transplantation Transplantation (if (nutrients) (grow) (sporulate)) Directed evolutionMutate Select Synthetic design landscape
  • 84. Karr lab: expanding whole-cell models M. pneumoniae • Expand scope: regulation • Improve accuracy: species-specific data • Enable rational genome engineering • Cell-based drug therapy Human cancer • Colorectal cancer • Personalized models • Precision medicine
  • 85. Karr lab: solving important problems Biological discovery Synthetic networks Biological design Drug repositioning Drug toxicity
  • 86. Karr lab: developing modeling tools Reconstruction: WholeCellKB Parallelized simulator Parameter estimation Simulation storage: WholeCellSimDB Visualization: WholeCellViz wholecell.org ??
  • 87. •How can we model more complex physiology? • Transcriptional regulation • Translational regulation • Stochastic death, failure modes • Higher-order meta-stable states • Resource distribution • Aging • Evolution • Populations •How can we model more complex organisms? • Larger bacteria • Eukaryotes • Multicellularity • Humans •How can we use models to direct engineering? Open challenges
  • 88. Whole-cell modeling course 1. Teach whole-cell modeling • Model biological systems • Construct dynamical models • Integrate models 2. Improve implementation • Reusable • Standard • Open 3. Improve methodology
  • 89. Data ? Whole-cell models Broadly predicts cell physiology Integrates heterogeneous data and models Guides bioengineering and medicine Knowledge
  • 90. • Karr JR et al. (2012) A Whole-Cell Computational Model Predicts Phenotype from Genotype. Cell, 150, 389-401. • Macklin DN, Ruggero NA, Covert MW (2014) The future of whole-cell modeling. Curr Opin Biotechnol, 28C, 111-115. • Shuler ML, Foley P, Atlas J (2012). Modeling a minimal cell. Methods Mol Biol, 881, 573-610. • Joyce AR, Palsson BØ (2007). Toward whole cell modeling and simulation: comprehensive functional genomics through the constraint-based approach. Prog Drug Res 64, 267-309. • Tomita M (2001). Whole-cell simulation: a grand challenge of the 21st century. Trends Biotechnol 6, 205-10. • Surovtsev IV et al. (2009) Mathematical modeling of a minimal protocell with coordinated growth and division. J Theor Biol, 260, 422-9. Recommended reading
  • 91. • Thiele I et al. (2009). Genome-scale reconstruction of Escherichia coli's transcriptional and translational machinery: a knowledge base, its mathematical formulation, and its functional characterization. PLoS Comput Biol. 5, e1000312. • Orth JD, Thiele I, Palsson BØ (2010). What is flux balance analysis? Nat Biotechnol, 28, 245-8. • Covert MW et al (2008). Integrated Flux Balance Analysis Model of Escherichia coli. Bioinformatics 24, 2044–50. • Covert MW et al (2004). Integrating high-throughput and computational data elucidates bacterial networks. Nature, 429, 92-6. Recommended reading: FBA

Notas del editor

  1. Toward this goal we have built a gene-complete, computational model of a single bacterial cell which Integrates all cellular processes into a single computational model, providing a unified understanding of cellular physiology, which Predicts the dynamics of every molecule and process, Can be used to guide experimental design and inform data analysis Hopefully in the future, in concert with emerging genome-scale DNA synthesis techniques, can be used to guide rational engineering of biological systems
  2. Protein expression: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2778844/figure/F4/
  3. A central challenge in cell biology is to understand the molecular basis of cellular behaviors, that is: to understand how cellular behaviors arise from the interactions of biomolecules, to understand how cellular processes are controlled and coordinated at the molecular level, to understand how cells allocate limited molecular resources for growth and maintainence, and more.
  4. Lack of data Time scales Heterogeneity of experimental and computational methods across cell biology subfields Lack of correlated measurements (cell properties measured independtely Measurements are bulk (not single cells) (also obscures correlations, causation)
  5. Lack of data Time scales Heterogeneity of experimental and computational methods across cell biology subfields Lack of correlated measurements (cell properties measured independtely Measurements are bulk (not single cells) (also obscures correlations, causation)
  6. Havugimana et al., 2012; Yan et al., 2010; Sachs et al., 2005; Orth et al., 2010
  7. Lack of data Time scales Heterogeneity of experimental and computational methods across cell biology subfields Lack of correlated measurements (cell properties measured independtely Measurements are bulk (not single cells) (also obscures correlations, causation)
  8. M. genitalium is a tractable model organism Modular architecture integrates 28 processes Model broadly predicts single-cell physiology Model reproduces previously observed data Model provides insights into complex phenotypes
  9. M. genitalium is a tractable model organism Modular architecture integrates 28 processes Model broadly predicts single-cell physiology Model reproduces previously observed data Model provides insights into complex phenotypes
  10. http://www.pbs.org/wgbh/nova/sciencenow/dispatches/images/050707-mgenitalium.jpg Tractable genome 75% annotated Little overlapping function Genomic synthesis Genome-scale datasets
  11. To maximize the tractability of our model we made a few simplifying assumptions. First, we chose to model Mycoplasma genitalium, the smallest-known freely living organism which is believed to have evolved by a massive degenerative evolution from gram positive bacteria to tractably sized genome of 580 kb containing just 525 genes, 75% of which are functionally annotated.
  12. To maximize the tractability of our model we made a few simplifying assumptions. First, we chose to model Mycoplasma genitalium, the smallest-known freely living organism which is believed to have evolved by a massive degenerative evolution from gram positive bacteria to tractably sized genome of 580 kb containing just 525 genes, 75% of which are functionally annotated.
  13. Second, recognizing the modularity of biology and the separation of time scales of biological processes, we built our model by composition enabling us at short time scales to model each cellular process independently, using the most appropriate mathematical representation and experimental data for each cellular process. - Separation of time scales - Choose appropriate representations and parameterizations - Creatively decouple representations Computational reconcile and decouple parameters A module is: Independent physiologic function Independent enzyme complement Internal time scale faster than time scales of interactions with other modules Factorization of state space and transfer functions Module methods: Time evolution Interface to core simulation -- references to metabolites, enyzmes; RNAs, proteins, etc. Resource (energy) requirement during simulation Initial conditions Fit growth rate Expected resource requirements (contribution metabolism objective) Pre-processing, memory allocation Options, parameters, indices, pre-processed data, predicted time courses Plotting, printing, saving, loading
  14. Our model is then executed by first executing 28 individual models of cellular processes at a 1 s time scale, second integrating the inputs and outputs of the individual models, for example, and finally repeating this process tens of thousands of times across the length of the Mycoplasma genitalium cell cycle.
  15. Our model is then executed by first executing 28 individual models of cellular processes at a 1 s time scale, second integrating the inputs and outputs of the individual models, for example, and finally repeating this process tens of thousands of times across the length of the Mycoplasma genitalium cell cycle.
  16. Our 27 models are each based on extensive curation of the literature and are implemented separately using different mathematical representations. For example, we constructed the replication module by first considering the molecular mechanism of replication, interactions between replication and DNA, the enzymes involved in catalyzing replication, and the metabolic resources required for replication. This motivated us to build a model of replication which would account for: The formation of the replication bubble and DnaA complex disassembly at the oriC DNA unwinding, replication bubble progression, and leading strand polymerization toward the terC Discontinuous lagging strand primer and DNA polymerization Okazaki fragment ligation Interactions between DNA polymerase and other DNA-bound proteins and DNA damage Next we built data structures which enable us to represent the specific location and size of all of the replication machinery and other DNA-bound proteins and DNA modifications and damages at each point in the cell cycle. Finally we modeled the initiation, progression, and termination of replication as a set of rules governing the evolution of the state of the DNA, proteins, and metabolites.
  17. 50 million ATP / cell 80 atto mol ATP / cell 60 m mol ATP / gDCW
  18. M. genitalium is a tractable model organism Modular architecture integrates 28 processes Model broadly predicts single-cell physiology Model reproduces previously observed data Model provides insights into complex phenotypes
  19. 3151 simulations (192 wt, 2959 deletions)
  20. 3151 simulations (192 wt, 2959 deletions)
  21. 3151 simulations (192 wt, 2959 deletions)
  22. 3151 simulations (192 wt, 2959 deletions)
  23. 3151 simulations (192 wt, 2959 deletions)
  24. 3151 simulations (192 wt, 2959 deletions)
  25. 3151 simulations (192 wt, 2959 deletions)
  26. 3151 simulations (192 wt, 2959 deletions)
  27. 3151 simulations (192 wt, 2959 deletions)
  28. 3151 simulations (192 wt, 2959 deletions)
  29. 3151 simulations (192 wt, 2959 deletions)
  30. 3151 simulations (192 wt, 2959 deletions)
  31. 3151 simulations (192 wt, 2959 deletions)
  32. 3151 simulations (192 wt, 2959 deletions)
  33. 3151 simulations (192 wt, 2959 deletions)
  34. 3151 simulations (192 wt, 2959 deletions)
  35. M. genitalium is a tractable model organism Modular architecture integrates 28 processes Model broadly predicts single-cell physiology Model reproduces previously observed data Model provides insights into complex phenotypes
  36. A central challenge in cell biology is to understand the molecular basis of cellular behaviors, that is: to understand how cellular behaviors arise from the interactions of biomolecules, to understand how cellular processes are controlled and coordinated at the molecular level, to understand how cells allocate limited molecular resources for growth and maintainence, and more.
  37. A central challenge in cell biology is to understand the molecular basis of cellular behaviors, that is: to understand how cellular behaviors arise from the interactions of biomolecules, to understand how cellular processes are controlled and coordinated at the molecular level, to understand how cells allocate limited molecular resources for growth and maintainence, and more.