SlideShare una empresa de Scribd logo
1 de 39
Studying the microbiome 
Mick Watson 
Head of Bioinformatics, Edinburgh Genomics, University of Edinburgh 
Research Group Leader, The Roslin Institute, University of Edinburgh
Edinburgh Genomics 
• Genomics facility based at the University of Edinburgh 
• Available for collaborations on an academic, non-profit basis 
• Formed from merger of 
– ARK-Genomics 
– The GenePool 
• Funded by three major bio UK research councils 
• A range of technologies and expertise available 
http://genomics.ed.ac.uk
Prevailing theory of the individual 
• An individual consists of at least 10x as many bacterial cells as “host” cells * 
• Each individual is a “supra-organism” 
– a composite of host and microbial cells contribute the functions necessary for the 
individual to survive 
• The genetic landscape of any individual is a composite of the host genome 
and the genomes of the millions of microbial symbionts that live on and 
within that individual 
• It is clearly important to take a holistic view when examining any animal 
phenotype 
My focus 
• Move from discovery science to applied science 
• “What’s there?”  “What can we do with it?”
• The “ten times” figure 
comes from a paper in 
1972, and is estimated 
from 1g of human faeces 
• More modern estimates 
range from equal to 100 
times! 
• American Society for 
Microbiology 2014 report 
puts the ratio closer to 3:1 
• Panel included Peter 
Turnbaugh 
• There’s still more of them 
though…. 
http://www.bostonglobe.com/ideas/2014/09/13/your-body-mostly-microbes- 
actually-have-idea/qlcoKot4wfUXecjeVaFKFN/story.html
Microbiome research is undergoing a crisis 
Please don’t make things worse  
• Crisis 1 
– The correlation/causation fallacy. For example…. 
– Patients with type II diabetes have a different gut microbiome compared 
to healthy patients 
– Does the microbiome cause diabetes? 
– Or do they have a different microbiome because they have diabetes? 
(therefore different diet) 
• Crisis 2 
– A lot of people want to do it, but don’t know how 
– Errors, bad experimental design, incorrect conclusions
What is the microbiome? 
“the ecological community of commensal, 
symbiotic, and pathogenic microorganisms that 
literally share our body space” 
- Joshua Lederberg 
Note: includes funghi, protists, archaea, bacteria, algae, viruses etc etc etc 
(whisper it: most “microbiome” studies only look at bacteria/archaea)
How do we study the microbiome? 
• Marker gene vs shotgun metagenomics 
• Marker gene 
– 16S / 18S / ITS 
– Amplify this and compare 
• Metagenomics 
– Extract all DNA 
– Fragment, sequence, interpret 
• In theory, the latter least biased*
16S studies are not metagenomics 
http://phylogenomics.blogspot.co.uk/2012/08/referring-to-16s-surveys-as.html, http://biomickwatson.wordpress.com/2014/01/12/youre-probably-not-doing-metagenomics/
16S 
• Prokaryotic rRNA subunit 
• Present in all (?) bacterial/archaeal genomes, contains constant 
and hypervariable regions 
• Hypervariable regions may give “species specific” signatures
16S process 
• Current sequencing technologies can’t sequence whole thing 
• Design primers in constant regions and PCR 
• Amplify 1 or more hypervariable regions 
• Cluster similar sequences into OTUs 
• Compare to 16S database and assign phylogenetic group 
• Compare abundance across sample groups (QIIME, Mothur)
16S problems 
• Some genomes have multiple copies of the 16S gene 
• The constant regions aren’t constant 
– Design degenerate primers 
– Some primers pick up certain groups better than others 
– A perfect match primer will amplify better than one containing mis-matches 
• The abundances from 16S are wrong, we simply hope that 
they are consistently wrong across samples 
• Absence really difficult to prove/wrong to assume 
• Chimeras, PCR artefacts consisting of 16S gene fragments 
from two different molecules
• Ashelford KE, Chuzhanova NA, Fry JC, Jones AJ, Weightman AJ. At least 1 in 20 16S rRNA sequence records 
currently held in public repositories is estimated to contain substantial anomalies. Appl Environ Microbiol. 
2005 71(12):7724-36.
SEQUENCING TECHNOLOGIES
References
Technology Advantages Disadvantages Output per run 
Illumina Highly accurate; cheap; 
Sequencing: what’s on the market? 
industry leader; multiple 
platforms 
Slower than Ion; short 
reads; 
HiSeq X Ten: 18Tb 
HiSeq X: 1.8Tb 
2500:HO 600Gb -> 1Tb 
2500:RO: 180Gb 
NextSeq: 140Gb 
MiSeq: 25Gb 
Ion Torrent Fast; cheap machine Very poor on 
homopolymers; doesn’t 
match Illumina on 
throughput 
PGM: 2Gb 
Proton P1: 10Gb 
Proton P2: 30Gb 
PacBio Long reads; single molecule High error rate, needs 
correction; low 
throughput; expensive 
machine 
300-500Mb 
Oxford 
Nanopore 
MinION 
Long reads; single molecule; 
cheap; portable 
High error rate; unknown 
quantity 
Unknown 
Complete 
Genomics 
Highly accurate; cheap Limited to human; black 
box 
Unknown; human 
genomes can be purchased
Illumina read lengths 
• HiSeq X Ten (Human only): 100PE 
• HiSeq 2500: V4 125PE, V3R 150PE, V3H 100PE 
• NextSeq: 150PE 
• MiSeq: V2: 250PE, V3 300PE
16S sequencing strategy? 
• Platform: MiSeq 
• Theoretically: 
– 2x150bp can sequence ~180bp amplicon 
– 2x250bp can sequence ~480bp amplicon 
– 2x300bp can sequence ~580bp amplicon
Important paper 
• Amongst other 
things, sequenced 
a mock 
community with 
different 
sequencing and 
bioinformatics 
strategies 
• Kozich JJ, Westcott SL, Baxter 
NT, Highlander SK, Schloss PD. 
Development of a dual-index 
sequencing strategy and 
curation pipeline for analyzing 
amplicon sequence data on 
the MiSeq Illumina sequencing 
platform. Appl Environ 
Microbiol. 2013 S79(17):5112- 
20.
• Three 16S regions sequenced using 2x250bp 
– V4 (~250 bp), V34 (430bp), and V45 regions (~375 bp) 
– In the Mock community, there should be 20 OTUs
16S sequencing strategy? 
• The only strategy that got close to the correct result is 
complete overlap of 2x250bp MiSeq reads
SHOTGUN METAGENOMICS
Shotgun metagenomics 
• Take ecosystem, extract all DNA and sequence it 
• Should be unbiased, right?... Right? 
• (NB: issues on the next few slides are also issues for 
marker gene studies)
Extraction protocol 
“we found that each DNA 
extraction method resulted in 
unique community patterns” 
“We observed significant differences 
in distribution of bacterial taxa 
depending on the method.”
Storage 
“Samples frozen with and without glycerol as cryoprotectant 
indicated a major loss of Bacteroidetes in unprotected samples”
• In the chicken caecum, bacteroidetes dominate, followed by 
firmicutes: 
• Nordentoft S et al (2011) The influence of the cage system and colonisation of Salmonella Enteritidis on 
the microbial gut flora of laying hens studied by T-RFLP and 454 pyrosequencing. BMC Microbiol. 11:187
• In the chicken caecum, firmicutes dominate, few 
proteobacteria, no bacteroidetes 
• Danzeisen JL et al (2011). Modulations of the chicken cecal microbiome and metagenome in response to 
anticoccidial and growth promoter treatment. PLOS ONE. 6(11):e27949.
• Did I mention that microbiome research is 
undergoing a crisis?  
• It gets worse…..
Contamination 
• Sequenced a pure culture of 
Salmonella bongori 
• Extracted DNA using different kits 
• Did serial dilutions of the pure 
culture to assess impact of 
contaminating species
The kits 
• FastDNA Spin Kit For Soil (FP), MoBio UltraClean Microbial 
DNA Isolation Kit (MB), QIAmp DNA Stool Mini Kit (QIA) and 
PSP Spin Stool DNA Plus kit (PSP) 
FP had a stable kit profile dominated by Burkholderia, PSP was dominated by 
Bradyrhizobium, while the QIA kit had the most complex mix of bacterial DNA. 
Bradyrhizobiaceae, Burkholderiaceae, Chitinophagaceae, Comomonadaceae, 
Propionibacteriaceae and Pseudomonadaceae were present in at least three quarters of 
the dilutions from PSP, FP and QIA kits. However, relative abundances of taxa at the 
Family level varied according to kit: FP was marked by Burkholderiaceae and 
Enterobacteriaceae, PSP was marked by Bradyrhizobiaceae and Chitinophagaceae. The 
contamination in the QIA kit was relatively diverse in comparison to the other kits, and 
included higher proportions of Aerococcaceae, Bacillaceae, Flavobacteriaceae, 
Microbacteriaceae, Paenibacillaceae, Planctomycetaceae and Polyangiaceae than the 
other kits. Kit MB did not have a distinct contaminant profile and varied from dilution to 
dilution due to paucity of reads
“These metagenomic results therefore clearly 
show that contamination becomes the dominant 
feature of sequence data from low biomass 
samples, and that the kit used to extract DNA can 
have an impact on the observed bacterial 
diversity”
From Salter et al: 
“Tellingly, Laurence et al [1] recently 
demonstrated with an in silico 
analysis that Bradyrhizobium is a 
common contaminant of 
sequencing datasets including the 
1000 Human Genome Project” 
1. Laurence M, Hatzis C, Brash DE. 
Common contaminants in next-generation 
sequencing that hinder 
discovery of low-abundance microbes. 
PLoS One. 2014 9(5):e97876. 
Adenoids are at the back of the nasal cavity 
Bradyrhizobium is a soil bacterium
Confounding factors
ANYWAY…..
Shotgun metagenomics 
• Can assemble 
– MetaVelvet, Meta-IDBA, Ray Meta, MetAMOS 
– Different techniques for partitioning 
• Coverage, sequence composition, connectivity 
• MetaWatt, CONCOCT 
– Predict genes: Glimmer-MG, FragGenScan 
• Use reference 
– Kraken, PhyloSift, MetaPhlAn, HUMAnN
All-in-one solution 
• EBI Metagenomics 
• Hunter S, et al. EBI metagenomics--a new resource for the analysis and archiving of 
metagenomic data. Nucleic Acids Res. 2014 42(Database issue):D600-6.
CONCLUSIONS
Conclusions 
• I love microbiome research (honestly!) 
• Really, incredibly exciting… but…. 
• Every step counts 
• Be very careful, at all stages 
• 16S – cheap, biased but effective 
• WGS – expensive, information rich, less biased 
• Beware contamination, include controls

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Metagenomics newer approach in understanding Microbes
Metagenomics newer approach in understanding Microbes  Metagenomics newer approach in understanding Microbes
Metagenomics newer approach in understanding Microbes
 
Metagenomic analysis
Metagenomic analysisMetagenomic analysis
Metagenomic analysis
 
SAGE- Serial Analysis of Gene Expression
SAGE- Serial Analysis of Gene ExpressionSAGE- Serial Analysis of Gene Expression
SAGE- Serial Analysis of Gene Expression
 
Introduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR GenomicsIntroduction to 16S Analysis with NGS - BMR Genomics
Introduction to 16S Analysis with NGS - BMR Genomics
 
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
 
Metatranscriptomic sequencing service
Metatranscriptomic sequencing serviceMetatranscriptomic sequencing service
Metatranscriptomic sequencing service
 
Metabarcoding QIIME2 workshop - Denoise
Metabarcoding QIIME2 workshop - DenoiseMetabarcoding QIIME2 workshop - Denoise
Metabarcoding QIIME2 workshop - Denoise
 
Metagenomic
MetagenomicMetagenomic
Metagenomic
 
Bio153 microbial genomics 2012
Bio153 microbial genomics 2012Bio153 microbial genomics 2012
Bio153 microbial genomics 2012
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
 
Genome Assembly 2018
Genome Assembly 2018Genome Assembly 2018
Genome Assembly 2018
 
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
 
Intro to illumina sequencing
Intro to illumina sequencingIntro to illumina sequencing
Intro to illumina sequencing
 
Microarray technology, biochip, DNA chip
Microarray technology, biochip, DNA chip Microarray technology, biochip, DNA chip
Microarray technology, biochip, DNA chip
 
RNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewRNA-seq Data Analysis Overview
RNA-seq Data Analysis Overview
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
 
The Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environmentThe Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environment
 
System biology and its tools
System biology and its toolsSystem biology and its tools
System biology and its tools
 
Yeast Genome
Yeast Genome Yeast Genome
Yeast Genome
 

Destacado

Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiome
jukais
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal
GenomeInABottle
 

Destacado (20)

Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiome
 
Molecular characterization of Pst isolates from Western Canada
Molecular characterization of Pst isolates from Western CanadaMolecular characterization of Pst isolates from Western Canada
Molecular characterization of Pst isolates from Western Canada
 
New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...
New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...
New High Throughput Sequencing technologies at the Norwegian Sequencing Centr...
 
140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal140127 abrf interlaboratory study proposal
140127 abrf interlaboratory study proposal
 
Next-Generation Sequencing Commercial Milestones Infographic
Next-Generation Sequencing Commercial Milestones InfographicNext-Generation Sequencing Commercial Milestones Infographic
Next-Generation Sequencing Commercial Milestones Infographic
 
Sequencing, Genome Assembly and the SGN Platform
Sequencing, Genome Assembly and the SGN PlatformSequencing, Genome Assembly and the SGN Platform
Sequencing, Genome Assembly and the SGN Platform
 
Phylogenomic methods for comparative evolutionary biology - University Colleg...
Phylogenomic methods for comparative evolutionary biology - University Colleg...Phylogenomic methods for comparative evolutionary biology - University Colleg...
Phylogenomic methods for comparative evolutionary biology - University Colleg...
 
Aug2014 abrf interlaboratory study plans
Aug2014 abrf interlaboratory study plansAug2014 abrf interlaboratory study plans
Aug2014 abrf interlaboratory study plans
 
Molecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics PipelineMolecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics Pipeline
 
Ngs part i 2013
Ngs part i 2013Ngs part i 2013
Ngs part i 2013
 
Dr. Douglas Marthaler - Use of Next Generation Sequencing for Whole Genome An...
Dr. Douglas Marthaler - Use of Next Generation Sequencing for Whole Genome An...Dr. Douglas Marthaler - Use of Next Generation Sequencing for Whole Genome An...
Dr. Douglas Marthaler - Use of Next Generation Sequencing for Whole Genome An...
 
Galaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo ProtocolGalaxy RNA-Seq Analysis: Tuxedo Protocol
Galaxy RNA-Seq Analysis: Tuxedo Protocol
 
2016 iHT2 San Diego Health IT Summit
2016 iHT2 San Diego Health IT Summit2016 iHT2 San Diego Health IT Summit
2016 iHT2 San Diego Health IT Summit
 
Biz model for ion proton dna sequencer
Biz model for ion proton dna sequencerBiz model for ion proton dna sequencer
Biz model for ion proton dna sequencer
 
Next Generation Sequencing and its Applications in Medical Research - Frances...
Next Generation Sequencing and its Applications in Medical Research - Frances...Next Generation Sequencing and its Applications in Medical Research - Frances...
Next Generation Sequencing and its Applications in Medical Research - Frances...
 
A Survey of NGS Data Analysis on Hadoop
A Survey of NGS Data Analysis on HadoopA Survey of NGS Data Analysis on Hadoop
A Survey of NGS Data Analysis on Hadoop
 
NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...
NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...
NGS Targeted Enrichment Technology in Cancer Research: NGS Tech Overview Webi...
 
A different kettle of fish entirely: bioinformatic challenges and solutions f...
A different kettle of fish entirely: bioinformatic challenges and solutions f...A different kettle of fish entirely: bioinformatic challenges and solutions f...
A different kettle of fish entirely: bioinformatic challenges and solutions f...
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
 
I Jornada Actualización en Genética Reproductiva y Fertilidad
I Jornada Actualización en Genética Reproductiva y Fertilidad I Jornada Actualización en Genética Reproductiva y Fertilidad
I Jornada Actualización en Genética Reproductiva y Fertilidad
 

Similar a Studying the microbiome

Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
Nikhil Aggarwal
 
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
AroojSheikh12
 

Similar a Studying the microbiome (20)

Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 
Metagenomics by microbiology dept. panjab university2018copy
Metagenomics by microbiology dept. panjab university2018copyMetagenomics by microbiology dept. panjab university2018copy
Metagenomics by microbiology dept. panjab university2018copy
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 
DNA recombinant technology on insulin modification
DNA recombinant technology on insulin modificationDNA recombinant technology on insulin modification
DNA recombinant technology on insulin modification
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 
Use of DNA barcoding and its role in the plant species/varietal Identifica...
Use of DNA  barcoding  and its role in the plant species/varietal  Identifica...Use of DNA  barcoding  and its role in the plant species/varietal  Identifica...
Use of DNA barcoding and its role in the plant species/varietal Identifica...
 
Microbiome Isolation and DNA Enrichment Protocol: Pathogen Detection Webinar ...
Microbiome Isolation and DNA Enrichment Protocol: Pathogen Detection Webinar ...Microbiome Isolation and DNA Enrichment Protocol: Pathogen Detection Webinar ...
Microbiome Isolation and DNA Enrichment Protocol: Pathogen Detection Webinar ...
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
 
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
Microbiome studies using 16S ribosomal DNA PCR: some cautionary tales.
 
Human genome project - Decoding the codes of life
Human genome project - Decoding the codes of lifeHuman genome project - Decoding the codes of life
Human genome project - Decoding the codes of life
 
Microbiome Identification to Characterization: Pathogen Detection Webinar Ser...
Microbiome Identification to Characterization: Pathogen Detection Webinar Ser...Microbiome Identification to Characterization: Pathogen Detection Webinar Ser...
Microbiome Identification to Characterization: Pathogen Detection Webinar Ser...
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdf
 
Molecular pathology in microbiology and metagenomics
Molecular pathology in microbiology and metagenomicsMolecular pathology in microbiology and metagenomics
Molecular pathology in microbiology and metagenomics
 
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
 
2014
20142014
2014
 
Next Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology OverviewNext Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology Overview
 
Biological technologies
Biological technologiesBiological technologies
Biological technologies
 
Molecular analysis of Microbial Community
Molecular analysis of Microbial CommunityMolecular analysis of Microbial Community
Molecular analysis of Microbial Community
 

Último

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 

Último (20)

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Unit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 oUnit5-Cloud.pptx for lpu course cse121 o
Unit5-Cloud.pptx for lpu course cse121 o
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 

Studying the microbiome

  • 1. Studying the microbiome Mick Watson Head of Bioinformatics, Edinburgh Genomics, University of Edinburgh Research Group Leader, The Roslin Institute, University of Edinburgh
  • 2. Edinburgh Genomics • Genomics facility based at the University of Edinburgh • Available for collaborations on an academic, non-profit basis • Formed from merger of – ARK-Genomics – The GenePool • Funded by three major bio UK research councils • A range of technologies and expertise available http://genomics.ed.ac.uk
  • 3. Prevailing theory of the individual • An individual consists of at least 10x as many bacterial cells as “host” cells * • Each individual is a “supra-organism” – a composite of host and microbial cells contribute the functions necessary for the individual to survive • The genetic landscape of any individual is a composite of the host genome and the genomes of the millions of microbial symbionts that live on and within that individual • It is clearly important to take a holistic view when examining any animal phenotype My focus • Move from discovery science to applied science • “What’s there?”  “What can we do with it?”
  • 4. • The “ten times” figure comes from a paper in 1972, and is estimated from 1g of human faeces • More modern estimates range from equal to 100 times! • American Society for Microbiology 2014 report puts the ratio closer to 3:1 • Panel included Peter Turnbaugh • There’s still more of them though…. http://www.bostonglobe.com/ideas/2014/09/13/your-body-mostly-microbes- actually-have-idea/qlcoKot4wfUXecjeVaFKFN/story.html
  • 5. Microbiome research is undergoing a crisis Please don’t make things worse  • Crisis 1 – The correlation/causation fallacy. For example…. – Patients with type II diabetes have a different gut microbiome compared to healthy patients – Does the microbiome cause diabetes? – Or do they have a different microbiome because they have diabetes? (therefore different diet) • Crisis 2 – A lot of people want to do it, but don’t know how – Errors, bad experimental design, incorrect conclusions
  • 6. What is the microbiome? “the ecological community of commensal, symbiotic, and pathogenic microorganisms that literally share our body space” - Joshua Lederberg Note: includes funghi, protists, archaea, bacteria, algae, viruses etc etc etc (whisper it: most “microbiome” studies only look at bacteria/archaea)
  • 7. How do we study the microbiome? • Marker gene vs shotgun metagenomics • Marker gene – 16S / 18S / ITS – Amplify this and compare • Metagenomics – Extract all DNA – Fragment, sequence, interpret • In theory, the latter least biased*
  • 8. 16S studies are not metagenomics http://phylogenomics.blogspot.co.uk/2012/08/referring-to-16s-surveys-as.html, http://biomickwatson.wordpress.com/2014/01/12/youre-probably-not-doing-metagenomics/
  • 9. 16S • Prokaryotic rRNA subunit • Present in all (?) bacterial/archaeal genomes, contains constant and hypervariable regions • Hypervariable regions may give “species specific” signatures
  • 10. 16S process • Current sequencing technologies can’t sequence whole thing • Design primers in constant regions and PCR • Amplify 1 or more hypervariable regions • Cluster similar sequences into OTUs • Compare to 16S database and assign phylogenetic group • Compare abundance across sample groups (QIIME, Mothur)
  • 11. 16S problems • Some genomes have multiple copies of the 16S gene • The constant regions aren’t constant – Design degenerate primers – Some primers pick up certain groups better than others – A perfect match primer will amplify better than one containing mis-matches • The abundances from 16S are wrong, we simply hope that they are consistently wrong across samples • Absence really difficult to prove/wrong to assume • Chimeras, PCR artefacts consisting of 16S gene fragments from two different molecules
  • 12. • Ashelford KE, Chuzhanova NA, Fry JC, Jones AJ, Weightman AJ. At least 1 in 20 16S rRNA sequence records currently held in public repositories is estimated to contain substantial anomalies. Appl Environ Microbiol. 2005 71(12):7724-36.
  • 15. Technology Advantages Disadvantages Output per run Illumina Highly accurate; cheap; Sequencing: what’s on the market? industry leader; multiple platforms Slower than Ion; short reads; HiSeq X Ten: 18Tb HiSeq X: 1.8Tb 2500:HO 600Gb -> 1Tb 2500:RO: 180Gb NextSeq: 140Gb MiSeq: 25Gb Ion Torrent Fast; cheap machine Very poor on homopolymers; doesn’t match Illumina on throughput PGM: 2Gb Proton P1: 10Gb Proton P2: 30Gb PacBio Long reads; single molecule High error rate, needs correction; low throughput; expensive machine 300-500Mb Oxford Nanopore MinION Long reads; single molecule; cheap; portable High error rate; unknown quantity Unknown Complete Genomics Highly accurate; cheap Limited to human; black box Unknown; human genomes can be purchased
  • 16. Illumina read lengths • HiSeq X Ten (Human only): 100PE • HiSeq 2500: V4 125PE, V3R 150PE, V3H 100PE • NextSeq: 150PE • MiSeq: V2: 250PE, V3 300PE
  • 17. 16S sequencing strategy? • Platform: MiSeq • Theoretically: – 2x150bp can sequence ~180bp amplicon – 2x250bp can sequence ~480bp amplicon – 2x300bp can sequence ~580bp amplicon
  • 18. Important paper • Amongst other things, sequenced a mock community with different sequencing and bioinformatics strategies • Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Appl Environ Microbiol. 2013 S79(17):5112- 20.
  • 19. • Three 16S regions sequenced using 2x250bp – V4 (~250 bp), V34 (430bp), and V45 regions (~375 bp) – In the Mock community, there should be 20 OTUs
  • 20. 16S sequencing strategy? • The only strategy that got close to the correct result is complete overlap of 2x250bp MiSeq reads
  • 22. Shotgun metagenomics • Take ecosystem, extract all DNA and sequence it • Should be unbiased, right?... Right? • (NB: issues on the next few slides are also issues for marker gene studies)
  • 23. Extraction protocol “we found that each DNA extraction method resulted in unique community patterns” “We observed significant differences in distribution of bacterial taxa depending on the method.”
  • 24. Storage “Samples frozen with and without glycerol as cryoprotectant indicated a major loss of Bacteroidetes in unprotected samples”
  • 25. • In the chicken caecum, bacteroidetes dominate, followed by firmicutes: • Nordentoft S et al (2011) The influence of the cage system and colonisation of Salmonella Enteritidis on the microbial gut flora of laying hens studied by T-RFLP and 454 pyrosequencing. BMC Microbiol. 11:187
  • 26. • In the chicken caecum, firmicutes dominate, few proteobacteria, no bacteroidetes • Danzeisen JL et al (2011). Modulations of the chicken cecal microbiome and metagenome in response to anticoccidial and growth promoter treatment. PLOS ONE. 6(11):e27949.
  • 27. • Did I mention that microbiome research is undergoing a crisis?  • It gets worse…..
  • 28. Contamination • Sequenced a pure culture of Salmonella bongori • Extracted DNA using different kits • Did serial dilutions of the pure culture to assess impact of contaminating species
  • 29.
  • 30. The kits • FastDNA Spin Kit For Soil (FP), MoBio UltraClean Microbial DNA Isolation Kit (MB), QIAmp DNA Stool Mini Kit (QIA) and PSP Spin Stool DNA Plus kit (PSP) FP had a stable kit profile dominated by Burkholderia, PSP was dominated by Bradyrhizobium, while the QIA kit had the most complex mix of bacterial DNA. Bradyrhizobiaceae, Burkholderiaceae, Chitinophagaceae, Comomonadaceae, Propionibacteriaceae and Pseudomonadaceae were present in at least three quarters of the dilutions from PSP, FP and QIA kits. However, relative abundances of taxa at the Family level varied according to kit: FP was marked by Burkholderiaceae and Enterobacteriaceae, PSP was marked by Bradyrhizobiaceae and Chitinophagaceae. The contamination in the QIA kit was relatively diverse in comparison to the other kits, and included higher proportions of Aerococcaceae, Bacillaceae, Flavobacteriaceae, Microbacteriaceae, Paenibacillaceae, Planctomycetaceae and Polyangiaceae than the other kits. Kit MB did not have a distinct contaminant profile and varied from dilution to dilution due to paucity of reads
  • 31. “These metagenomic results therefore clearly show that contamination becomes the dominant feature of sequence data from low biomass samples, and that the kit used to extract DNA can have an impact on the observed bacterial diversity”
  • 32. From Salter et al: “Tellingly, Laurence et al [1] recently demonstrated with an in silico analysis that Bradyrhizobium is a common contaminant of sequencing datasets including the 1000 Human Genome Project” 1. Laurence M, Hatzis C, Brash DE. Common contaminants in next-generation sequencing that hinder discovery of low-abundance microbes. PLoS One. 2014 9(5):e97876. Adenoids are at the back of the nasal cavity Bradyrhizobium is a soil bacterium
  • 35. Shotgun metagenomics • Can assemble – MetaVelvet, Meta-IDBA, Ray Meta, MetAMOS – Different techniques for partitioning • Coverage, sequence composition, connectivity • MetaWatt, CONCOCT – Predict genes: Glimmer-MG, FragGenScan • Use reference – Kraken, PhyloSift, MetaPhlAn, HUMAnN
  • 36. All-in-one solution • EBI Metagenomics • Hunter S, et al. EBI metagenomics--a new resource for the analysis and archiving of metagenomic data. Nucleic Acids Res. 2014 42(Database issue):D600-6.
  • 38.
  • 39. Conclusions • I love microbiome research (honestly!) • Really, incredibly exciting… but…. • Every step counts • Be very careful, at all stages • 16S – cheap, biased but effective • WGS – expensive, information rich, less biased • Beware contamination, include controls