Introduction to whole-cell modeling lecture | Whole-cell modeling summer school | March 3-9, 2015 @ University of Rostock

Toward whole-cell models for
science and engineering
Jonathan Karr
March 9, 2015
Positions available
research.mssm.edu/karr
karr@mssm.edu

Jayodita
Sanghvi
Markus
Covert
Jared
JacobsDerek
Macklin
Acknowledgements

Chemicals & fuels
Optimize yield
Minimize cost
Food
Optimize yield
Resist drought
Prevent infection
Medicine
Predict prognoses
Optimize therapy
Maximize quality of life
Central challenge: predict phenotype from genotype

Example: drug biosynthesis
Predicting phenotype from genotype requires “whole-cell” models

Integrated
Comprehensive Dynamic
Gene-complete
Whole-cell modeling principles

Biological data is readily available

?
Data Knowledge
Whole-cell model goals

Whole-cell modeling
A grand challenge of the 21st century
– Masaru Tomita
Biology urgently needs a theoretical basis to unify it
– Sydney Brenner
The ultimate test of understanding a simple cell,
more than being able to build one, would be to build
a computer model of the cell
– Clyde Hutchison

Single-cell variation
Microscopy
Transcription
RNA-seq
Protein expression
Mass-spec, Western blot
Modeling challenge: heterogeneous data

Modeling challenge: sparse data

MetabolicSignaling
Transcriptional regulatory
Modelling challenge: heterogenous networks

Time
Length
Replication
Growth
Transcription
Metabolism
Modeling challenge: multiple time and length scales

0
25
50
75
100
1970's
Coarse-grained
ODEs
1990's
FBA
2000's
Boolean
models
2008
iFBA
2012
Whole-cell
model
%annotatedgenes Whole-cell modeling progress
vv v v v

Predictive modeling methodologies
Granularity
Scope
ODE
SDE
FBA
Boolean
Bayesian
Gillespie
PDE
Whole-cell model

Uptake
FBA
Composition
Metabolism
FBA
Composition
Transcription
Stochastic binding
Gene expression
Translation
Stochastic binding
Gene expression
Replication
Chemical kinetics
DNA sequence
Solution: integrated models

0
25
50
75
100
1970's
Coarse-grained
ODEs
1990's
FBA
2000's
Boolean
models
2008
iFBA
2012
Whole-cell
model
%annotatedgenes Whole-cell modeling progress
vv

Model Validate
Engineer
Whole-cell modeling

Validate
Engineer
Model
Whole-cell modeling

Model construction
1. Define system
2. Define scope
3. Curate data
4. Choose representation
5. Identify parameters
6. Test predictions

E. coli M. genitalium
Genome 4700 kb 580 kb
Genes 4461 525
Size 2 μm × 0.5 μm 0.2-0.3 μm
1. Select a tractable model organism

Comparative genomics
Fraiser et. al, 1995
Genome-wide essentiality
Glass et. al, 1999
M. genitalium is well-characterized

Genomic-scale data
Kühner et. al, 2009
M. genitalium is well-characterized

Genomic transplantation
Lartigue et. al, 2009
Genomic synthesis
Gibson et. al, 2009
M. genitalium has unique engineering tools

2. Choose model scope
• Explicitly represent each metabolite, gene, RNA, and protein species
• Explicitly model the function of every characterized gene product
• Account for the metabolic cost of every uncharacterized gene product
• Represent important, well-characterized molecules individually

3. Broadly curate experimental data
Karr et al., 2013

Uptake
FBA
Composition
Metabolism
FBA
Composition
Transcription
Stochastic events
Gene expression
Translation
Stochastic events
Gene expression
Replication
Chemical kinetics
DNA sequence
Sub-modelsStates
4. Select a flexible mathematical representation
Mass, shape
Metabolite, RNA,
protein counts
Mammalian host
Transcript, polypeptide
sequences
DNA polymerization,
proteins, modifications
FtsZ ring

1 s
Simulation algorithm
Uptake
Metabolism
Transcription
Translation
Replication
Cellstates
Cellstates
Uptake
Metabolism
Transcription
Translation
Replication
Cellstates
Uptake
Metabolism
Transcription
Translation
Replication

DNA RNA Protein Other
Replication
RepInitiation
Supercoiling
Condensation
Segregation
Damage
Repair
TransReg
Transcription
Processing
Modification
Aminoacylation
Degradation
Translation
ProcessingI
Translocation
ProcessingII
Folding
Modification
Complexation
Ribosome
TermOrg
Activation
Degradation
Metabolism
Shape
FtsZ
Cytokinesis
DNA
Replication
Rep Initiation
Supercoiling
Condensation
Segregation
Damage
Repair
Trans Reg
RNA
Transcription
Processing
Modification
Aminoacylation
Degradation l,
Protein
Translation
Processing I
Translocation
Processing II
Folding
Modification
Complexation
Ribosome
Term Org #
Activation
Degradation l, #
Other
Metabolism
Shape
FtsZ
Cytokinesis
Many resources are shared

1 s
Uptake
Metabolism
Transcription
Translation
Replication
Cellstates
Cellstates
Uptake
Metabolism
Transcription
Translation
Replication
Cellstates
Uptake
Metabolism
Transcription
Translation
Replication
Dividestate
Dividestate
Dividestate
Simulation algorithm

Mycoplasma model contains 28 sub-models
Karr et al., 2012

Course expertise
Modeling
• Frank Bergmann
• Marcus Krantz
• Wolfgang Liebermeister
• Pedro Mendes
• Chris Myers
• Pnar Pir
• Kieran Smallbone
Curation
• Vijayalakshmi Chelliah
Standards
• Michael Hucka
• Falk Schreiber
• Dagmar Waltemath

Karr et al., 2012
Example sub-model: Transcription

Karr et al., 2012

Free
Bound
Promoter
Bound
Active
1. Update RNA polymerase states
3. Bind RNA polymerase
2. Calculate promoter affinities
4. Elongate and terminate transcripts
AUGAUCCGUCUCUAAUGUCUAC
UTCAACGUGAGGUAAUAAAGUC
UCCACGAUGCUACUGUAUC
GCCUCAUACUGCGGAU
UUACGUAUCAGUGAUCAGUACU
Sequence
Transcript
HcrA SpxFur GntR LuxR
glpF dnaJ dnaK gntR trxB polC

•Compare the model’s predictions to data, 𝑦𝑖
•Define an error metric
∑ 𝐸 𝑓𝑖(𝑥; 𝑝) cells,time − 𝑦𝑖
2
•Numerically minimize error
• Gradient descent
• Scatter search
• Simulated annealing
• Genetic algorithms

•Large parameter space
•Stochastic model
•Large computational cost
•Heterogeneous data
•Little dynamic, single cell data

Model reduction enables parameter identification
3. Manually tune parameters
using full model
1. Reduce model
Time
ModelExperiment
Molecule
Molecule
2. Identify reduced model
parameters using
traditional methods

• ODE models
• COPASI: copasi.org
• V-Cell: nrcam.uchc.edu
• Systems biology toolbox
• Boolean models
• CellNOpt
• Flux-balance analysis
• openCOBRA: opencobra.sourceforge.net
• RAVEN
• Integrative models
• E-Cell: e-cell.org
• Whole-cell: wholecell.org
• Standards
• SBML: sbml.org
• CellML: cellml.org
Software

Karr et al., 2012
DNA binding protein collisions

60 m mol ATP / gDCW
80 a mol ATP / cell
Energy consumption

v
v
Karr et al., 2012
Energy consumption

Model Validate
Engineer
Whole-cell modeling
Validate

Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
DNA-bound protein density
Gene essentiality
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing
Validate model against experiments and theory

Matches training data
Cell mass, volume
Biomass composition
RNA, protein expression, half-lives
Superhelicity
Matches published data
Metabolite concentrations
Matches new data
Matches theory
Central dogma
Cell theory
Evolution
Software stable
Tests passing

Model reproduces observed metabolomics
Karr et al., 2012

Superhelicity
Matches published data
Metabolite concentrations
Matches new data
Matches theory
Central dogma
Cell theory
Evolution
Software stable
Tests passing
Model validated by experiments and theoryValidate model against experiments and theory

Superhelicity
DNA-bound protein density
Gene essentiality
Matches new data
Matches theory
Central dogma
Cell theory
Evolution
Software stable
Tests passing
Model validated by experiments and theoryValidate model against experiments and theory

Superhelicity
Matches new data
Matches theory
Central dogma
Cell theory
Evolution
Software stable
Tests passing

Colorimetric growth assay Model predictions
Model reproduces measured growth rate
Karr et al., 2012

Superhelicity
Matches new data
Wild-type growth rate
Disruption strain growth rates
Matches theory
Central dogma
Cell theory
Evolution
Software stable
Tests passing

Superhelicity
Matches new data
Matches theory
Mass conservation
Central dogma
Cell theory
Evolution
Software stable
Tests passing

Superhelicity
Matches new data
Matches theory
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Tests passing

Superhelicity
Matches new data
Matches theory
Central dogma
Cell theory
Evolution
No obvious errors
Plot model predictions
Manually inspect data
Compare to known biology
Software stable
Simulation code is stable
Tests passing

Model
Engineer
Whole-cell modeling
Validate
Engineer

What genomic modifications maximize growth?
Time
Mass
Example: growth optimization

M. genitalium
M. mycoides
M. pneumoniae
Optimal gene expression

Optimal architecture retains robustnessOptimal gene expression retains robustness

Graphical design tool
Clotho, TinkerCell, GenoCAD
High-level language
BioCompiler
Biophysical model
Whole-cell models, SCHEMA, MD
Physical implementation
Gibson assembly, TALENs, ZFNs, CRISPR
Transplantation
Transplantation
(if (nutrients)
(grow)
(sporulate))
Directed evolutionMutate Select
Synthetic design landscape

Karr lab: expanding whole-cell models
M. pneumoniae
• Expand scope: regulation
• Improve accuracy: species-specific data
• Enable rational genome engineering
• Cell-based drug therapy
Human cancer
• Colorectal cancer
• Personalized models
• Precision medicine

Karr lab: solving important problems
Biological discovery
Synthetic networks
Biological design
Drug repositioning
Drug toxicity

Karr lab: developing modeling tools
Reconstruction: WholeCellKB
Parallelized simulator
Parameter estimation
Simulation storage: WholeCellSimDB
Visualization: WholeCellViz
wholecell.org
??

•How can we model more complex physiology?
• Transcriptional regulation
• Translational regulation
• Stochastic death, failure modes
• Higher-order meta-stable states
• Resource distribution
• Aging
• Evolution
• Populations
•How can we model more complex organisms?
• Larger bacteria
• Eukaryotes
• Multicellularity
• Humans
•How can we use models to direct engineering?
Open challenges

Whole-cell modeling course
1. Teach whole-cell modeling
• Model biological systems
• Construct dynamical models
• Integrate models
2. Improve implementation
• Reusable
• Standard
• Open
3. Improve methodology

Data
?
Whole-cell
models
Broadly predicts cell physiology
Integrates heterogeneous data and models
Guides bioengineering and medicine
Knowledge

• Karr JR et al. (2012) A Whole-Cell Computational Model Predicts Phenotype from
Genotype. Cell, 150, 389-401.
• Macklin DN, Ruggero NA, Covert MW (2014) The future of whole-cell
modeling. Curr Opin Biotechnol, 28C, 111-115.
• Shuler ML, Foley P, Atlas J (2012). Modeling a minimal cell. Methods Mol
Biol, 881, 573-610.
• Joyce AR, Palsson BØ (2007). Toward whole cell modeling and simulation:
comprehensive functional genomics through the constraint-based approach. Prog
Drug Res 64, 267-309.
• Tomita M (2001). Whole-cell simulation: a grand challenge of the 21st century.
Trends Biotechnol 6, 205-10.
• Surovtsev IV et al. (2009) Mathematical modeling of a minimal protocell with
coordinated growth and division. J Theor Biol, 260, 422-9.
Recommended reading

• Thiele I et al. (2009). Genome-scale reconstruction of Escherichia coli's
transcriptional and translational machinery: a knowledge base, its mathematical
formulation, and its functional characterization. PLoS Comput Biol. 5, e1000312.
• Orth JD, Thiele I, Palsson BØ (2010). What is flux balance analysis? Nat
Biotechnol, 28, 245-8.
• Covert MW et al (2008). Integrated Flux Balance Analysis Model of Escherichia coli.
Bioinformatics 24, 2044–50.
• Covert MW et al (2004). Integrating high-throughput and computational data
elucidates bacterial networks. Nature, 429, 92-6.
Recommended reading: FBA

Introduction to whole-cell modeling lecture | Whole-cell modeling summer school | March 3-9, 2015 @ University of Rostock

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (12)

Similar a Introduction to whole-cell modeling lecture | Whole-cell modeling summer school | March 3-9, 2015 @ University of Rostock

Similar a Introduction to whole-cell modeling lecture | Whole-cell modeling summer school | March 3-9, 2015 @ University of Rostock (20)

Último

Último (20)

Introduction to whole-cell modeling lecture | Whole-cell modeling summer school | March 3-9, 2015 @ University of Rostock

Notas del editor