4.11.24 Mass Incarceration and the New Jim Crow.pptx
Phylogenomics Talk at UC Berkeley by J. A. Eisen
1. Phylogenomics and the Diversity
and Diversification of Microbes
Jonathan A. Eisen
UC Davis
UC Berkeley Talk
February 3, 2011
Monday, February 14, 2011
2. My Obsessions
Jonathan A. Eisen
UC Davis
UC Berkeley Talk
February 3, 2011
Monday, February 14, 2011
4. Social Networking in Science
HOME PAGE MY TIMES TODAY'S PAPER VIDEO MOST POPULAR TIMES TOPICS Welcome, fcollins Member Center Log Out
Sunday, April 1, 2007 Health
WORLD U.S. N.Y. / REGION BUSINESS TECHNOLOGY SCIENCE HEALTH SPORTS OPINION ARTS STYLE TRAVEL JOBS REAL ESTATE AUTOS
FITNESS & NUTRITION HEALTH CARE POLICY MENTAL HEALTH & BEHAVIOR
Scientist Reveals Secret of the Ocean: It's Him
By NICHOLAS WADE
Published: April 1, 2007
PRINT nytimes.com/sports
Maverick scientist J. Craig Venter has done it again. It was just a few years SINGLE-PAGE
ago that Dr. Venter announced that the human genome sequenced by Celera
SAVE
Genomics was in fact, mostly his own. And now, Venter has revealed a second
SHARE
twist in his genomic self-examination. Venter was discussing his Global
SHARE
Ocean Voyage, in which he used his personal yacht to collect ocean water
samples from around the world. He then used large filtration units to collect How good is your bracket? Compare your tournament picks
to choices from members of The New York Times sports
microbes from the water samples which were then brought back to his high desk and other players.
tech lab in Rockville, MD where he used the same methods that were used to Also in Sports:
The Bracket Blog - all the news leading up to the Final
sequence the human genome to study the genomes of the 1000s of ocean Four
dwelling microbes found in each sample. In discussing the sampling methods, Venter let slip his Bats Blog: Spring training updates
Play Magazine: How to build a super athlete
latest attack on the standards of science – some of the samples were in fact not from the ocean, but
were from microbial habitats in and on his body.
“The human microbiome is the next frontier,” Dr. Venter said. “The ocean voyage was just a cover.
My main goal has always been to work on the microbes that live in and on people. And now that my
genome is nearly complete, why not use myself as the model for human microbiome studies as well.
”
It is certainly true that in the last few years, the microbes that live in and on people have become a
hot research topic. So hot that the same people who were involved in the race to sequence the human
genome have been involved in this race too. Francis Collins, Venter main competitor and still the
Monday, February 14, 2011
director of the National Human Genome Research Institute (NHGRI), recently testified before
8. Phylogenomics of Novelty
Mechanisms of Variation in
Origin of New Mechanisms:
Functions Patterns, Causes
and Effects
Monday, February 14, 2011
9. Phylogenomics of Novelty
Mechanisms of Variation in
Origin of New Mechanisms:
Functions Patterns, Causes
and Effects
Species Evolution
Monday, February 14, 2011
10. Phylogenomics of Novelty
• How does novelty originate?
• Major categories of processes
• From within
• De novo invention
• Simple substitutions
• Duplication and divergence
• Domain shuffling
• Small & large rearrangements
• Regulatory changes
• From outside
• Lateral gene transfer
• Symbioses
Monday, February 14, 2011
11. Phylogenomics of Novelty
• How does novelty originate?
• Major categories of processes
Mechanisms of • From within
Origin of New • De novo invention
Functions • Simple substitutions
• Duplication and divergence
• Domain shuffling
• Small & large rearrangements
• Regulatory changes
• From outside
• Lateral gene transfer
• Symbioses
Monday, February 14, 2011
12. Phylogenomics of Novelty
• Patterns of variation
• Within species
• Between species
• Causes
• Variation in replication,
recombination and repair
• Effects
• Differences in evolvability
• Ecological niche
• Short and long term genome
evolution
Monday, February 14, 2011
13. Phylogenomics of Novelty
• Patterns of variation
• Within species Variation in
• Between species Mechanisms:
• Causes Patterns, Causes
• Variation in replication, and Effects
recombination and repair
• Effects
• Differences in evolvability
• Ecological niche
• Short and long term genome
evolution
Monday, February 14, 2011
14. Phylogenomics of Novelty
• Information needed to distinguish convergence from
homology
• Allows inference of rates and patterns of change
• Allows one to determine if something is a “one time” event
or a common theme in many lineages
Monday, February 14, 2011
15. Phylogenomics of Novelty
• Information needed to distinguish convergence from
homology
• Allows inference of rates and patterns of change
• Allows one to determine if something is a “one time” event
or a common theme in many lineages
Species Evolution
Monday, February 14, 2011
16. Phylogenomics of Novelty
Variation in
Mechanisms of
Mechanisms:
Origin of New
Patterns, Causes
Functions
and Effects
Species Evolution
Monday, February 14, 2011
17. Phylogenomics of Novelty
Variation in
Mechanisms of
Mechanisms:
Origin of New
Patterns, Causes
Functions
and Effects
Focus Today on
Using Sequence
Information for All
of This
Species Evolution
Monday, February 14, 2011
18. Why do this?
• Discover causes and effects of differences in
evolvability
• Improve predictions from genome analysis
• Guide interpretation of biological data
Monday, February 14, 2011
19. Outline
• Introduction
• Phylogenomic Stories
– Within genome invention of novelty
– Stealing novelty
– Communities of microbes
– Community service and knowing what we don’t
know
Monday, February 14, 2011
20. Introduction
Genome Sequencing
Monday, February 14, 2011
21. rRNA Tree of Life
FIgure from Barton, Eisen et al.
“Evolution”, CSHL Press.
Based on tree from Pace NR, 2003.
Monday, February 14, 2011
22. Limited Sampling of RRR Studies
FIgure from Barton, Eisen et al.
“Evolution”, CSHL Press.
Based on tree from Pace NR, 2003.
Monday, February 14, 2011
23. Limited Sampling of RRR Studies
Haloferax
Methanococcus
Chlorobium
Deinococcus
Thermotoga
FIgure from Barton, Eisen et al.
“Evolution”, CSHL Press.
Based on tree from Pace NR, 2003.
Monday, February 14, 2011
27. Limited Sampling of RRR Studies
Haloferax
Methanococcus
Chlorobium
Deinococcus
Thermotoga
FIgure from Barton, Eisen et al.
“Evolution”, CSHL Press.
Based on tree from Pace NR, 2003.
Monday, February 14, 2011
35. Phylogenomics of Novelty
• How does novelty originate?
• Major categories of processes
• From within
• De novo invention
• Simple substitutions
• Duplication and divergence
• Domain shuffling
• Small & large rearrangements
• Regulatory changes
• From outside
• Lateral gene transfer
• Symbioses
Monday, February 14, 2011
36. Phylogenomics of Novelty
• How does novelty originate?
• Major categories of processes
Mechanisms of • From within
Origin of New • De novo invention
Functions • Simple substitutions
• Duplication and divergence
• Domain shuffling
• Small & large rearrangements
• Regulatory changes
• From outside
• Lateral gene transfer
• Symbioses
Monday, February 14, 2011
37. From Eisen et al.
1997 Nature
Medicine 3:
1076-1078.
Monday, February 14, 2011
38. Blast Search of H. pylori “MutS”
• Blast search pulls up Syn. sp MutS#2 with much higher p
value than other MutS homologs
• Based on this TIGR predicted this species had mismatch
repair
• Assumes functional constancy
Based on Eisen et al. 1997 Nature Medicine 3: 1076-1078.
Monday, February 14, 2011
39. Predicting Function
• Identification of motifs
– Short regions of sequence similarity that are indicative of
general activity
– e.g., ATP binding
• Homology/similarity based methods
– Gene sequence is searched against a databases of other
sequences
– If significant similar genes are found, their functional
information is used
• Problem
– Genes frequently have similarity to hundreds of motifs
and multiple genes, not all with the same function
Monday, February 14, 2011
40. MutL??
Based on Eisen et al. 1997 Nature Medicine 3: 1076-1078.
Monday, February 14, 2011
41. Overlaying Functions onto Tree
MutS2
Aquae
MSH5 Strpy
Bacsu
Synsp
Deira Helpy
Yeast
Human Borbu Metth
Celeg
MSH6 mSaco
Yeast
Human
Mouse
Arath
Yeast MSH4
Celeg
Human
Arath
Human
MSH3 Mouse
Fly
Spombe
Yeast Xenla
Rat
Mouse
Yeast Human
MSH1 Spombe Yeast MSH2
Neucr
Arath
Aquae Trepa
Chltr
DeiraTheaq
BacsuBorbu
Thema
SynspStrpy Based on Eisen,
Ecoli
Neigo
1998 Nucl Acids
MutS1 Res 26: 4291-4300.
Monday, February 14, 2011
45. Tetrahymena Genome Processing
• Probably exists as a defense mechanism
• Analogous to RIPPING and
heterochromatin silencing
• Presence of repetitive DNA in MAC but
not TEs suggests the mechanism involves
targeting foreign DNA
• Thus unlike RIPPING ciliate processing
does not limit diversification by duplication
Eisen et al. 2006. PLoS Biology.
Monday, February 14, 2011
46. Conclusions
• Enormous variation in processes underlying
origin of novelty
• See within genomes -> between species
• Knowledge about mechanisms and variation
helps predictions of function and biology
from analysis of sequence data
Monday, February 14, 2011
47. Phylogenomics of Novelty II
Sometimes, it is easier to steal, borrow, or
coopt functions rather than evolve them
anew
Monday, February 14, 2011
49. rRNA Tree of Life
Bacteria
Archaea
Eukaryotes
FIgure from Barton, Eisen et al.
“Evolution”, CSHL Press.
Based on tree from Pace NR, 2003.
Monday, February 14, 2011
51. Network of Life
Bacteria
Archaea
Eukaryotes
Figure from Barton, Eisen et al.
“Evolution”, CSHL Press.
Based on tree from Pace NR, 2003.
Monday, February 14, 2011
52. A. thaliana T1E2.8 is a
Chloroplast Derived HSP60
Monday, February 14, 2011
54. Correlated gain/loss of genes
• Microbial genes are lost rapidly when not
maintained by selection
• Genes can be acquired by lateral transfer
• Frequently gain and loss occurs for entire
pathways/processes
• Thus might be able to use correlated
presence/absence information to identify
genes with similar functions
Monday, February 14, 2011
55. Non-Homology Predictions:
Phylogenetic Profiling
• Step 1: Search all genes in
organisms of interest against all
other genomes
• Ask: Yes or No, is each gene
found in each other species
• Cluster genes by distribution
patterns (profiles)
Monday, February 14, 2011
56. Carboxydothermus hydrogenoformans
• Isolated from a Russian hotspring
• Thermophile (grows at 80°C)
• Anaerobic
• Grows very efficiently on CO
(Carbon Monoxide)
• Produces hydrogen gas
• Low GC Gram positive
(Firmicute)
• Genome Determined (Wu et al.
2005 PLoS Genetics 1: e65. )
Monday, February 14, 2011
57. Homologs of Sporulation Genes
Wu et al. 2005
PLoS Genetics 1:
e65.
Monday, February 14, 2011
61. Mutualistic Genome Evolution
• Compare and contrast different types of
mutualistic symbioses
• Diverse hosts, symbionts, biology, ages
• Organelles, chemosymbioses,
photosynthetic symbioses, nutritional
symbioses
• What are the rules & patterns?
Monday, February 14, 2011
62. Glassy Winged Sharpshooter
• Feeds on xylem
sap
• Vector for
Pierce’s Disease
• Potential
bioterror agent
Monday, February 14, 2011
63. Sharpshooter Shotgun Sequencing
shotgun
Collaboration with Nancy
Wu et al. 2006 PLoS Biology 4: e188.
Moran’s lab
Monday, February 14, 2011
67. Higher Evolutionary Rates in
Endosymbionts
Wu et al. 2006 PLoS Biology 4: e188. Collaboration with Nancy Moran’ s Lab
Monday, February 14, 2011
68. Variation in Evolution Rates
MutS MutL
+ +
+ +
+ +
+ +
_ _
_ _
Wu et al. 2006 PLoS Biology 4: e188. Collaboration with Nancy Moran’ s Lab
Monday, February 14, 2011
69. Polymorphisms in Metapopulation
• Data from ~200 hosts
– 104 SNPs
– 2 indels
• PCR surveys show that
this is between host
variation
• Much lower ratio of
transitions:transversions
than in Blochmannia
• Consistent with absence
of MMR from
Blochmannia
Monday, February 14, 2011
70. Baumannia is a Vitamin and
Cofactor Producing Machine
Wu et al.
2006
PLoS
Biology 4:
e188.
Monday, February 14, 2011
83. How can we best use
metagenomic data?
• Many possible uses including:
– Improvements on rRNA based phylotyping and
species diversity measurements
– Adding functional information on top of
phylogenetic/species diversity information
• Most/all possible uses either require or are
improved with phylogenetic analysis
Monday, February 14, 2011
86. Weighted % of Clones
0
0.1250
0.2500
0.3750
0.5000
Al
ph
ap
ro
t eo
Be b ac
ta
pr t er
ot
e ia
G ob
am ac
m t er
ap ia
Monday, February 14, 2011
ro
Ep te
si ob
lo ac
np t er
ro ia
De t eo
lta b ac
pr te
ot ria
eo
b
C ac
ya ter
n ob ia
ac
t er
Fi ia
rm
ic
u te
Ac s
tin
ob
ac
t er
C ia
hl
or
ob
i
C
FB
Major Phylogenetic Group
Sargasso Phylotypes
C
hl
or
of
le
Sp xi
iro
ch
ae
te
Fu
so s
De ba
in ct
er
oc ia
oc
cu
s-
Eu The
ry r
ar mu
ch s
ae
C ot
re a
na
rc
ha
eo
ta
Shotgun Sequencing Allows Use of Other Markers
EFG
Venter et al., Science 304: 66-74. 2004
EFTu
rRNA
RecA
RpoB
HSP70
101. Research Topics
Variation in
Mechanisms of
Mechanisms:
Origin of New
Patterns, Causes
Functions
and Effects
Species Evolution
Monday, February 14, 2011
102. Research Topics
Variation in
Mechanisms of
Mechanisms:
Origin of New
Patterns, Causes
Functions
and Effects
Species Evolution
Monday, February 14, 2011
104. As of 2002 Proteobacteria
TM6
OS-K • At least 40
Acidobacteria
Termite Group
OP8
phyla of
Nitrospira
Bacteroides bacteria
Chlorobi
Fibrobacteres
Marine GroupA
WS3
Gemmimonas
Firmicutes
Fusobacteria
Actinobacteria
OP9
Cyanobacteria
Synergistes
Deferribacteres
Chrysiogenetes
NKB19
Verrucomicrobia
Chlamydia
OP3
Planctomycetes
Spriochaetes
Coprothmermobacter
OP10
Thermomicrobia
Chloroflexi
TM7
Deinococcus-Thermus
Dictyoglomus
Aquificae
Thermudesulfobacteria
Thermotogae
OP1 Based on
OP11 Hugenholtz, 2002
Monday, February 14, 2011
105. As of 2002 Proteobacteria
TM6
OS-K
• At least 40
Acidobacteria
Termite Group
OP8
phyla of
Nitrospira
Bacteroides bacteria
Chlorobi
Fibrobacteres
Marine GroupA • Genome
WS3
Gemmimonas
Firmicutes
sequences are
Fusobacteria
Actinobacteria
mostly from
OP9
Cyanobacteria
Synergistes
three phyla
Deferribacteres
Chrysiogenetes
NKB19
Verrucomicrobia
Chlamydia
OP3
Planctomycetes
Spriochaetes
Coprothmermobacter
OP10
Thermomicrobia
Chloroflexi
TM7
Deinococcus-Thermus
Dictyoglomus
Aquificae
Thermudesulfobacteria
Thermotogae
OP1 Based on
OP11 Hugenholtz, 2002
Monday, February 14, 2011
106. As of 2002 Proteobacteria
TM6
OS-K
• At least 40
Acidobacteria
Termite Group
OP8
phyla of
Nitrospira
Bacteroides bacteria
Chlorobi
Fibrobacteres
Marine GroupA • Genome
WS3
Gemmimonas
Firmicutes
sequences are
Fusobacteria
Actinobacteria
mostly from
OP9
Cyanobacteria
Synergistes
three phyla
Deferribacteres
Chrysiogenetes
NKB19
• Some other
Verrucomicrobia
Chlamydia
OP3
phyla are
Planctomycetes
Spriochaetes only sparsely
Coprothmermobacter
OP10
Thermomicrobia
sampled
Chloroflexi
TM7
Deinococcus-Thermus
Dictyoglomus
Aquificae
Thermudesulfobacteria
Thermotogae
OP1 Based on
OP11 Hugenholtz, 2002
Monday, February 14, 2011
107. As of 2002 Proteobacteria
TM6
OS-K
• At least 40
Acidobacteria
Termite Group
OP8
phyla of
Nitrospira
Bacteroides bacteria
Chlorobi
Fibrobacteres
Marine GroupA • Genome
WS3
Gemmimonas
Firmicutes
sequences are
Fusobacteria
Actinobacteria
mostly from
OP9
Cyanobacteria
Synergistes
three phyla
Deferribacteres
Chrysiogenetes
NKB19
• Some other
Verrucomicrobia
Chlamydia
OP3
phyla are
Planctomycetes
Spriochaetes only sparsely
Coprothmermobacter
OP10
Thermomicrobia
sampled
Chloroflexi
TM7
Deinococcus-Thermus
Dictyoglomus
Aquificae
Thermudesulfobacteria
Thermotogae
OP1 Based on
OP11 Hugenholtz, 2002
Monday, February 14, 2011
108. Proteobacteria
• NSF-funded TM6
OS-K
• At least 40
Tree of Life Acidobacteria
Termite Group phyla of
OP8
Project Nitrospira
Bacteroides bacteria
Chlorobi
• A genome Fibrobacteres
Marine GroupA • Genome
WS3
from each of Gemmimonas sequences are
Firmicutes
eight phyla Fusobacteria
mostly from
Actinobacteria
OP9
Cyanobacteria
Synergistes
three phyla
Deferribacteres
Chrysiogenetes
NKB19
• Some other
Verrucomicrobia
Chlamydia
OP3
phyla are only
Planctomycetes
Spriochaetes sparsely
Coprothmermobacter
OP10
Thermomicrobia
sampled
Chloroflexi
TM7
Deinococcus-Thermus
• Solution I:
Dictyoglomus
Eisen, Ward, Aquificae
Thermudesulfobacteria
sequence more
Robb, Nelson, et Thermotogae
phyla
OP1
al OP11
Monday, February 14, 2011
110. Proteobacteria
• NSF-funded TM6
OS-K
• At least 40
Tree of Life Acidobacteria
Termite Group phyla of bacteria
OP8
Project Nitrospira
• Genome
Bacteroides
• A genome Chlorobi
Fibrobacteres sequences are
Marine GroupA
from each of WS3
Gemmimonas mostly from
eight phyla Firmicutes
Fusobacteria three phyla
Actinobacteria
OP9
Cyanobacteria
• Some other
Synergistes
Deferribacteres
Chrysiogenetes
phyla are only
NKB19
Verrucomicrobia sparsely
Chlamydia
OP3
Planctomycetes
sampled
Spriochaetes
Coprothmermobacter • Still highly
OP10
Thermomicrobia
Chloroflexi
biased in terms
TM7
Deinococcus-Thermus
Dictyoglomus
of the tree
Aquificae
Eisen & Ward, PIs Thermudesulfobacteria
Thermotogae
OP1
OP11
Monday, February 14, 2011
111. Proteobacteria
• NSF-funded TM6
OS-K
• At least 40
Tree of Life Acidobacteria
Termite Group phyla of bacteria
OP8
Project Nitrospira
• Genome
Bacteroides
• A genome Chlorobi
Fibrobacteres sequences are
Marine GroupA
from each of WS3
Gemmimonas mostly from
eight phyla Firmicutes
Fusobacteria three phyla
Actinobacteria
OP9
Cyanobacteria
• Some other
Synergistes
Deferribacteres
Chrysiogenetes
phyla are only
NKB19
Verrucomicrobia sparsely
Chlamydia
OP3
Planctomycetes
sampled
Spriochaetes
Coprothmermobacter • Same trend in
OP10
Thermomicrobia
Chloroflexi
Archaea
TM7
Deinococcus-Thermus
Dictyoglomus
Aquificae
Eisen & Ward, PIs Thermudesulfobacteria
Thermotogae
OP1
OP11
Monday, February 14, 2011
112. Proteobacteria
• NSF-funded TM6
OS-K
• At least 40
Tree of Life Acidobacteria
Termite Group phyla of bacteria
OP8
Project Nitrospira
• Genome
Bacteroides
• A genome Chlorobi
Fibrobacteres sequences are
Marine GroupA
from each of WS3
Gemmimonas mostly from
eight phyla Firmicutes
Fusobacteria three phyla
Actinobacteria
OP9
Cyanobacteria
• Some other
Synergistes
Deferribacteres
Chrysiogenetes
phyla are only
NKB19
Verrucomicrobia sparsely
Chlamydia
OP3
Planctomycetes
sampled
Spriochaetes
Coprothmermobacter • Same trend in
OP10
Thermomicrobia
Chloroflexi
Eukaryotes
TM7
Deinococcus-Thermus
Dictyoglomus
Aquificae
Eisen & Ward, PIs Thermudesulfobacteria
Thermotogae
OP1
OP11
Monday, February 14, 2011