SlideShare a Scribd company logo
1 of 74
Download to read offline
Hospital Universitari Vall d’Hebron
Institut de Recerca - VHIR
Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII)
Bioinformatics for
Biological Researchers
http://eib.stat.ub.edu/2014BBR
Ferran Briansó
ferran.brianso@vhir.org
28/05/2014
INTRODUCTION TO METAGENOMICSINTRODUCTION TO METAGENOMICS
1. Introduction
2. Applications
3. Basic Concepts
4. Approaches & Workflows
1. Whole Genome Shotgun
2. 16S/ITS Community Surveys
●
Analysis Tools
1. MEGAN
2. Mothur
3. Qiime
4. Axiome & CloVR
5. MG-RAST
1. More resources
5
1
2
3
4
5
PRESENTATION OUTLINE
6
1 INTRODUCTIONINTRODUCTION
Introduction | Metagenomics definition1
4
First use of the term metagenome, referencing the idea that a collection of
genes sequenced from the environment could be analyzed in a way analogous
to the study of a single genome.
Handelsman, J.; Rondon, M. R.; Brady, S. F.; Clardy, J.; Goodman, R. M. (1998).
"Molecular biological access to the chemistry of unknown soil microbes: A new
frontier for natural products".
Chemistry & Biology 5 (10): R245–R249. doi:10.1016/S1074-5521(98)90108-9.
PMID 9818143
1
First use of the term metagenome, referencing the idea that a collection of
genes sequenced from the environment could be analyzed in a way analogous
to the study of a single genome.
Handelsman, J.; Rondon, M. R.; Brady, S. F.; Clardy, J.; Goodman, R. M. (1998).
"Molecular biological access to the chemistry of unknown soil microbes: A new
frontier for natural products".
Chemistry & Biology 5 (10): R245–R249. doi:10.1016/S1074-5521(98)90108-9.
PMID 9818143
Chen, K.; Pachter, L. (2005).
"Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities".
PLoS Computational Biology 1 (2): e24. doi:10.1371/journal.pcbi.0010024
Current definition:
“The application of modern genomics techniques to the
study of communities of microbial organisms directly in
their natural environments, bypassing the need for
isolation and lab cultivation of individual species.”
5
Introduction | Metagenomics definition
1
6
Introduction | Historical context
1
Source:
7
Introduction | Historical context
1
Source:
http://howcoolismyresear.ch/#metagenomics
8
Introduction | Historical context
1
9
Introduction | Basic purpose
2 APPLICATIONSAPPLICATIONS
2
11
Applications | What metagenomics can do
● Global Impacts. The role of microbes is critical in maintaining atmospheric
balances, as they are
● the main photosynthetic agents
● responsible for the generation and consumption of greenhouse
gases
● involved at all levels in ecosystems and trophic chains
2
12
Applications | What metagenomics can do
● Global Impacts. The role of microbes is critical in maintaining atmospheric
balances, as they are
● the main photosynthetic agents
● responsible for the generation and consumption of greenhouse
gases
● involved at all levels in ecosystems and trophic chains
● Bioremediation. Cleaning up environmental contamination, such as
● the waste from water treatment facilities
● gasoline leaks on lands or oil spills in the oceans
● toxic chemicals
2
13
Applications | What metagenomics can do
● Bioenergy. We are harnessing microbial power in order to produce
● ethanol (from cellulose), hydrogen, methane, butanol...
● Smart Farming. Microbes help our crops by
● the “supressive soil” phenomenon
(buffer effect against disease-causing organisms)
● soil enrichment and regeneration
2
14
Applications | What metagenomics can do
● Bioenergy. We are harnessing microbial power in order to produce
● ethanol (from cellulose), hydrogen, methane, butanol...
● Smart Farming. Microbes help our crops by
● the “supressive soil” phenomenon
(buffer effect against disease-causing organisms)
● soil enrichment and regeneration
● The World Within. Studying the human microbiome may lead
to valuable new tools and guidelines in
● human and animal nutrition
● better understanding of complex diseases
(obesity, cancer, asthma...)
● drug discovery
● preventative medicine
Grice E.A. & Segre J.A. (2012) The Human Microbiome: Our Second Genome,
Annu. Rev. Genomics Human Genet. 13, 151-170
2
15
Applications | Mapping the Human Microbiome
3 BASIC CONCEPTSBASIC CONCEPTS
3
17
Concepts | Trimming
● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from
automated DNA sequencers prior to sequence assembly and other downstream uses.
18
● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from
automated DNA sequencers prior to sequence assembly and other downstream uses.
● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs).
● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study.
Typically using a percent sequence similarity threshold for classifying microbes within the same, or different,
OTUs.
3 Concepts | Binning, OTUs
http://shuixia100.weebly.com/1/post/2011/12/mothur-tutorial-1.html / Wikipedia: Biological classification
19
● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from
automated DNA sequencers prior to sequence assembly and other downstream uses.
● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs).
● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study.
Typically using a percent sequence similarity threshold for classifying microbes within the same, or different,
OTUs.
3 Concepts | Binning, OTUs
http://shuixia100.weebly.com/1/post/2011/12/mothur-tutorial-1.html / Wikipedia: Biological classification
20
● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from
automated DNA sequencers prior to sequence assembly and other downstream uses.
● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs).
● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study.
Typically using a percent sequence similarity threshold for classifying microbes within the same, or different,
OTUs.
● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise
from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a
template derived from a different but similar sequence. This then acts as a primer that is extended to form a
chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998,
Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to
produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological
sequence.
3 Concepts | Chimeras
Hass B.J. et al (2011) Chimeric 16S rRNA sequence formation and detection in
Sanger and 454-pyrosequenced PCR amplicons, Genome Res. 21: 494-504.
3
21
● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from
automated DNA sequencers prior to sequence assembly and other downstream uses.
● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs).
● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study.
Typically using a percent sequence similarity threshold for classifying microbes within the same, or different,
OTUs.
● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise
from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a
template derived from a different but similar sequence. This then acts as a primer that is extended to form a
chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998,
Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to
produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological
sequence.
● Alpha diversity: the diversity within a particular area or ecosystem; expressed by the number of species (i.e.,
species richness) in that ecosystem, or by one or more diversity indices.
● Beta diversity: a comparison of of diversity between ecosystems, usually measured as the amount of species
change between the ecosystems.
● Gamma diversity: a measure of the overall diversity within a large region. Geographic-scale species diversity
according to Hunter (2002:448).
Concepts | Diversities
Zinger L. et al. (2012) Two decades of describing the unseen majority of
aquatic microbial diversity, Molecular Ecology 21, 1878–1896.
3
22
● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from
automated DNA sequencers prior to sequence assembly and other downstream uses.
● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs).
● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study.
Typically using a percent sequence similarity threshold for classifying microbes within the same, or different,
OTUs.
● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise
from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a
template derived from a different but similar sequence. This then acts as a primer that is extended to form a
chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998,
Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to
produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological
sequence.
● Alpha diversity: the diversity within a particular area or ecosystem; expressed by the number of species (i.e.,
species richness) in that ecosystem, or by one or more diversity indices.
● Beta diversity: a comparison of of diversity between ecosystems, usually measured as the amount of species
change between the ecosystems.
● Gamma diversity: a measure of the overall diversity within a large region. Geographic-scale species diversity
according to Hunter (2002:448).
Concepts | Diversity measurement issues
Zhou J. et al. (2010) Random Sampling Process Leads to Overestimation of β-Diversity
of Microbial Communities, mBio 4(3):e00324-13. doi:10.1128/mBio.00324-13.
Diversity can virtually never
be measured directly,
rather it must be estimated
or inferred from available
data. Our estimates are
anchored in the sample
itself.
Magurran (Ed.), Biological Diversity,
Oxford U.P. 2010. Ch. 16 Microbial
Diversity and Ecology
3
23
● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from
automated DNA sequencers prior to sequence assembly and other downstream uses.
● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs).
● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study.
Typically using a percent sequence similarity threshold for classifying microbes within the same, or different,
OTUs.
● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise
from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a
template derived from a different but similar sequence. This then acts as a primer that is extended to form a
chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998,
Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to
produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological
sequence.
● Alpha diversity: the diversity within a particular area or ecosystem; expressed by the number of species (i.e.,
species richness) in that ecosystem, or by one or more diversity indices.
● Beta diversity: a comparison of of diversity between ecosystems, usually measured as the amount of species
change between the ecosystems.
● Gamma diversity: a measure of the overall diversity within a large region. Geographic-scale species diversity
according to Hunter (2002:448).
● Rarefaction allows the calculation of species richness for a given number of individual samples, based on the
construction of so-called rarefaction curves. This curve is a plot of the number of species as a function of the
number of samples.
Concepts | Rarefaction
most or all species
have been sampled
species rich habitat, only a small
fraction has been sampled
this habitat has not been
exhaustively sampled
Wooley J.C. et al. (2010) A Primer on Metagenomics, PLoS Computational Biology 6 (2) e1000667
3
24
Concepts | Diversity indices (α diversity)
Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
Other indices:
berger_parker_d, brillouin_d, dominance,
doubles, esty_ci, fisher_alpha, gini_index,
goods_coverage, margalef, mcintosh_d,
mcintosh_e, menhinick,osd, simpson_reciprocal,
robbins, singles, strong...
3
25
Concepts | Compositional similarity (β diversity)
Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
3
26
Concepts | Compositional similarity (β diversity)
Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
3
27
Concepts | Compositional similarity (β diversity)
Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
3
28
Concepts | Compositional similarity (β diversity)
Heat map
Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
3
29
● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from
automated DNA sequencers prior to sequence assembly and other downstream uses.
● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs).
● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study.
Typically using a percent sequence similarity threshold for classifying microbes within the same, or different,
OTUs.
● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise
from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a
template derived from a different but similar sequence. This then acts as a primer that is extended to form a
chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998,
Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to
produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological
sequence.
● Alpha diversity: the diversity within a particular area or ecosystem; expressed by the number of species (i.e.,
species richness) in that ecosystem, or by one or more diversity indices.
● Beta diversity: a comparison of of diversity between ecosystems, usually measured as the amount of species
change between the ecosystems.
● Gamma diversity: a measure of the overall diversity within a large region. Geographic-scale species diversity
according to Hunter (2002:448).
● Rarefaction allows the calculation of species richness for a given number of individual samples, based on the
construction of so-called rarefaction curves. This curve is a plot of the number of species as a function of the
number of samples.
● Metadata, reads, fasta/fastq files, counts, OTU tables/networks, .biom files, PCoA, p-values, diversity
metrics, robustness, scores, jackniffed, clustering, UPGMA, trees, bootstrap, Bi-Plots, ...
Concepts | Summary
4 APPROACHES & WORKFLOWSAPPROACHES & WORKFLOWS
4
31
Workflows | Microbial ecology approaches
4
32
Grice E.A. & Segre J.A. (2012) The Human Microbiome: Our Second Genome,
Annu. Rev. Genomics Human Genet. 13, 151-170
Workflows | Overview
Sample collection
DNA extraction
and preparation
Sequencing
Analysis
4
33
Grice E.A. & Segre J.A. (2012) The Human Microbiome: Our Second Genome,
Annu. Rev. Genomics Human Genet. 13, 151-170
Workflows | Overview
Sample collection
DNA extraction
and preparation
Sequencing
Analysis
Experimental design
Sample Quality Controls
Sequence Quality Controls
Biological interpretation
4.1 WGS MetagenomicsWGS Metagenomics
4
35
Workflows | Whole Genome Shotgun (WGS)
Sven-Eric Schelhorn https://bioinf.mpi-inf.mpg.de/homepage/research.php?&account=sven
4
36
Workflows | Whole Genome Shotgun (WGS)
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4
37
Workflows | Whole Genome Shotgun (WGS)
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4
38
Workflows | Whole Genome Shotgun (WGS)
Sven-Eric Schelhorn https://bioinf.mpi-inf.mpg.de/homepage/research.php?&account=sven
4
39
Workflows | Whole Genome Shotgun (WGS)
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4
40
Workflows | Whole Genome Shotgun (WGS)
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4.2 16S/ITS Metagenomics16S/ITS Metagenomics
4
42
Workflows | 16S/ITS Community Surveys
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4
43
Workflows | 16S/ITS Community Surveys
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4
44
Workflows | 16S/ITS Community Surveys
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4
45
Workflows | 16S/ITS Community Surveys
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4
46
Workflows | 16S/ITS Community Surveys
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
4
47
Workflows | 16S/ITS Community Surveys
Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
5 METAGENOMICS TOOLSMETAGENOMICS TOOLS
5
49
Tools | “The great quest”
5
50
Tools | “The great quest”
5
51
Tools | “The great quest”
5
52
Tools | “The great quest”
5
53
Tools | MEGAN
http://ab.inf.uni-tuebingen.de/software/megan5/
5
54
Tools | MEGAN
http://ab.inf.uni-tuebingen.de/software/megan5/
5
55
Tools | MEGAN
http://ab.inf.uni-tuebingen.de/software/megan5/
5
56
Tools | Mothur
http://www.mothur.org/wiki/Main_Page / Kevin R. Theis (Michigan State University)
5
57
Tools | Mothur
http://www.mothur.org/wiki/Main_Page / Kevin R. Theis (Michigan State University)
5
58
Tools | Mothur
http://www.mothur.org/wiki/Main_Page / Kevin R. Theis (Michigan State University)
5
59
Tools | Qiime
5
60
Tools | Qiime
5
61
Tools | Qiime
5
62
Tools | Axiome
http://neufeld.github.io/AXIOME
5
63
Tools | Axiome
http://neufeld.github.io/AXIOME
5
64
Tools | Axiome
http://neufeld.github.io/AXIOME
5
65
Tools | CloVR
http://clovr.org
5
66
Tools | CloVR
http://clovr.org
5
67
Tools | CloVR
http://clovr.org
5
68
Tools | MG-RAST
http://http://metagenomics.anl.gov/
5
69
Tools | MG-RAST
http://http://metagenomics.anl.gov/
5
70
Tools | MG-RAST
http://http://metagenomics.anl.gov/
6 MORE RESOURCESMORE RESOURCES
6
72
More resources, courses...
Resources & Projects:
MEGAN DB http://www.megan-db.org/megan-db/ (MEtaGenomics ANalysis)
CAMERA http://camera.calit2.net/ (community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis)
MG-RAST Search http://metagenomics.anl.gov/metagenomics.cgi?page=MetagenomeSearch
IMG http://img.jgi.doe.gov/ (Integrated Microbial Genomes and metagenomes)
MetaBioME http://metasystems.riken.jp/metabiome/ (Comprehensive Metagenomic BioMining Engine)
BOLD http://www.boldsystems.org/ (Barcoding Of Live Database)
GOS Expedition http://www.jcvi.org/cms/research/projects/gos/overview (Global Ocean Sampling)
...
6
73
More resources, courses...
Courses:
EBI http://www.ebi.ac.uk/training/course/metagenomics2014
EMBO http://cymeandcystidium.com/?tag=metagenomics
Coursera https://www.coursera.org/course/genomescience
... and a lot of seminars and workshops everywhere
Hospital Universitari Vall d’Hebron
Institut de Recerca - VHIR
Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII)
Thanks for your attentionThanks for your attention
and also thanks to
Josep Gregori (VHIR, ROCHE)
for providing some materials
INTRODUCTION TO METAGENOMICSINTRODUCTION TO METAGENOMICS
Bioinformatics for
Biological Researchers
http://eib.stat.ub.edu/2014BBR
Ferran Briansó
ferran.brianso@vhir.org
28/05/2014

More Related Content

What's hot

What's hot (20)

metagenomics
metagenomicsmetagenomics
metagenomics
 
Metagenomic analysis
Metagenomic analysisMetagenomic analysis
Metagenomic analysis
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Metagenomic
MetagenomicMetagenomic
Metagenomic
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 
Msa
MsaMsa
Msa
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Blast
BlastBlast
Blast
 
Introduction to 16S Microbiome Analysis
Introduction to 16S Microbiome AnalysisIntroduction to 16S Microbiome Analysis
Introduction to 16S Microbiome Analysis
 
Gene prediction method
Gene prediction method Gene prediction method
Gene prediction method
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
NCBI
NCBINCBI
NCBI
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kk
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
Proteomics
ProteomicsProteomics
Proteomics
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Protein micro array
Protein micro arrayProtein micro array
Protein micro array
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
 

Viewers also liked (6)

Metagenómica biodiversidad y nuevos productos biotecnológicos.
Metagenómica biodiversidad y nuevos productos biotecnológicos.Metagenómica biodiversidad y nuevos productos biotecnológicos.
Metagenómica biodiversidad y nuevos productos biotecnológicos.
 
Vocabulario aparato digestivo
Vocabulario aparato digestivoVocabulario aparato digestivo
Vocabulario aparato digestivo
 
PLuginbuhl-SIM-2008
PLuginbuhl-SIM-2008PLuginbuhl-SIM-2008
PLuginbuhl-SIM-2008
 
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 
Metagenómica y sus Aplicaciones Industriales
Metagenómica y sus Aplicaciones IndustrialesMetagenómica y sus Aplicaciones Industriales
Metagenómica y sus Aplicaciones Industriales
 

Similar to Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformatics for Biological Researchers Course - CSIC, Blanes)

617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
AroojSheikh12
 
Deep learning methods in metagenomics: a review
Deep learning methods in metagenomics: a reviewDeep learning methods in metagenomics: a review
Deep learning methods in metagenomics: a review
ssuser6fc73c
 
Genetics of ideal traits in NE.pptx
Genetics of ideal traits in NE.pptxGenetics of ideal traits in NE.pptx
Genetics of ideal traits in NE.pptx
SimranBhatia71
 

Similar to Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformatics for Biological Researchers Course - CSIC, Blanes) (20)

A comparative study using different measure of filteration
A comparative study using different measure of filterationA comparative study using different measure of filteration
A comparative study using different measure of filteration
 
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx617....sjuwbwjisjnslosoanwbwbdhidje.pptx
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
 
Shriram belge (exome sequencing) 27 2003
Shriram belge (exome sequencing) 27  2003Shriram belge (exome sequencing) 27  2003
Shriram belge (exome sequencing) 27 2003
 
Deep learning methods in metagenomics: a review
Deep learning methods in metagenomics: a reviewDeep learning methods in metagenomics: a review
Deep learning methods in metagenomics: a review
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
introduction of Bioinformatics
introduction of Bioinformaticsintroduction of Bioinformatics
introduction of Bioinformatics
 
Systems biology
Systems biologySystems biology
Systems biology
 
Applications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahuApplications of bioinformatics, main by kk sahu
Applications of bioinformatics, main by kk sahu
 
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
 
metagenomics.pptx
metagenomics.pptxmetagenomics.pptx
metagenomics.pptx
 
Bioinformatics, its application main
Bioinformatics, its application mainBioinformatics, its application main
Bioinformatics, its application main
 
31961.ppt
31961.ppt31961.ppt
31961.ppt
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products Research
 
Genetics of ideal traits in NE.pptx
Genetics of ideal traits in NE.pptxGenetics of ideal traits in NE.pptx
Genetics of ideal traits in NE.pptx
 
Gdt 2-126
Gdt 2-126Gdt 2-126
Gdt 2-126
 
Gdt 2-126 (1)
Gdt 2-126 (1)Gdt 2-126 (1)
Gdt 2-126 (1)
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
 
Syngulon - Selection technology July 2022.pdf
Syngulon - Selection technology July 2022.pdfSyngulon - Selection technology July 2022.pdf
Syngulon - Selection technology July 2022.pdf
 
Teresa Coque Hospital Universitario Ramón y Cajal.
Teresa Coque  Hospital Universitario Ramón y Cajal. Teresa Coque  Hospital Universitario Ramón y Cajal.
Teresa Coque Hospital Universitario Ramón y Cajal.
 

More from VHIR Vall d’Hebron Institut de Recerca

Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - Tecnologías de Ultrasecuenciación y de ...
Curso de Genómica - UAT (VHIR) 2012 - Tecnologías de Ultrasecuenciación y de ...Curso de Genómica - UAT (VHIR) 2012 - Tecnologías de Ultrasecuenciación y de ...
Curso de Genómica - UAT (VHIR) 2012 - Tecnologías de Ultrasecuenciación y de ...
VHIR Vall d’Hebron Institut de Recerca
 
Curso de Genómica - UAT (VHIR) 2012 - Aplicaciones de las tecnologías de alto...
Curso de Genómica - UAT (VHIR) 2012 - Aplicaciones de las tecnologías de alto...Curso de Genómica - UAT (VHIR) 2012 - Aplicaciones de las tecnologías de alto...
Curso de Genómica - UAT (VHIR) 2012 - Aplicaciones de las tecnologías de alto...
VHIR Vall d’Hebron Institut de Recerca
 

More from VHIR Vall d’Hebron Institut de Recerca (20)

Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
 
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
 
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
Brief Overview to Amplicon Variant Analysis (UEB-UAT Bioinformatics Course - ...
 
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
Introduction to NGS Variant Calling Analysis (UEB-UAT Bioinformatics Course -...
 
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
 
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
NGS Applications I (UEB-UAT Bioinformatics Course - Session 2.1.2 - VHIR, Bar...
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
 
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
Storing and Accessing Information. Databases and Queries (UEB-UAT Bioinformat...
 
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
Introduction to Bioinformatics (UEB-UAT Bioinformatics Course - Session 1.1 -...
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
 
Information management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cmsInformation management at vhir ueb using tiki-cms
Information management at vhir ueb using tiki-cms
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de RT-qPCR
 
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCRCurso de Genómica - UAT (VHIR) 2012 - RT-qPCR
Curso de Genómica - UAT (VHIR) 2012 - RT-qPCR
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génicaCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de expression génica
 
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - MicroarraysCurso de Genómica - UAT (VHIR) 2012 - Microarrays
Curso de Genómica - UAT (VHIR) 2012 - Microarrays
 
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
Curso de Genómica - UAT (VHIR) 2012 - Arrays de Proteínas Zeptosens
 
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGSCurso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
Curso de Genómica - UAT (VHIR) 2012 - Análisis de datos de NGS
 
Curso de Genómica - UAT (VHIR) 2012 - Tecnologías de Ultrasecuenciación y de ...
Curso de Genómica - UAT (VHIR) 2012 - Tecnologías de Ultrasecuenciación y de ...Curso de Genómica - UAT (VHIR) 2012 - Tecnologías de Ultrasecuenciación y de ...
Curso de Genómica - UAT (VHIR) 2012 - Tecnologías de Ultrasecuenciación y de ...
 
Curso de Genómica - UAT (VHIR) 2012 - Aplicaciones de las tecnologías de alto...
Curso de Genómica - UAT (VHIR) 2012 - Aplicaciones de las tecnologías de alto...Curso de Genómica - UAT (VHIR) 2012 - Aplicaciones de las tecnologías de alto...
Curso de Genómica - UAT (VHIR) 2012 - Aplicaciones de las tecnologías de alto...
 

Recently uploaded

GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 

Recently uploaded (20)

GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptx
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 

Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformatics for Biological Researchers Course - CSIC, Blanes)

  • 1. Hospital Universitari Vall d’Hebron Institut de Recerca - VHIR Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII) Bioinformatics for Biological Researchers http://eib.stat.ub.edu/2014BBR Ferran Briansó ferran.brianso@vhir.org 28/05/2014 INTRODUCTION TO METAGENOMICSINTRODUCTION TO METAGENOMICS
  • 2. 1. Introduction 2. Applications 3. Basic Concepts 4. Approaches & Workflows 1. Whole Genome Shotgun 2. 16S/ITS Community Surveys ● Analysis Tools 1. MEGAN 2. Mothur 3. Qiime 4. Axiome & CloVR 5. MG-RAST 1. More resources 5 1 2 3 4 5 PRESENTATION OUTLINE 6
  • 4. Introduction | Metagenomics definition1 4 First use of the term metagenome, referencing the idea that a collection of genes sequenced from the environment could be analyzed in a way analogous to the study of a single genome. Handelsman, J.; Rondon, M. R.; Brady, S. F.; Clardy, J.; Goodman, R. M. (1998). "Molecular biological access to the chemistry of unknown soil microbes: A new frontier for natural products". Chemistry & Biology 5 (10): R245–R249. doi:10.1016/S1074-5521(98)90108-9. PMID 9818143
  • 5. 1 First use of the term metagenome, referencing the idea that a collection of genes sequenced from the environment could be analyzed in a way analogous to the study of a single genome. Handelsman, J.; Rondon, M. R.; Brady, S. F.; Clardy, J.; Goodman, R. M. (1998). "Molecular biological access to the chemistry of unknown soil microbes: A new frontier for natural products". Chemistry & Biology 5 (10): R245–R249. doi:10.1016/S1074-5521(98)90108-9. PMID 9818143 Chen, K.; Pachter, L. (2005). "Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities". PLoS Computational Biology 1 (2): e24. doi:10.1371/journal.pcbi.0010024 Current definition: “The application of modern genomics techniques to the study of communities of microbial organisms directly in their natural environments, bypassing the need for isolation and lab cultivation of individual species.” 5 Introduction | Metagenomics definition
  • 11. 2 11 Applications | What metagenomics can do ● Global Impacts. The role of microbes is critical in maintaining atmospheric balances, as they are ● the main photosynthetic agents ● responsible for the generation and consumption of greenhouse gases ● involved at all levels in ecosystems and trophic chains
  • 12. 2 12 Applications | What metagenomics can do ● Global Impacts. The role of microbes is critical in maintaining atmospheric balances, as they are ● the main photosynthetic agents ● responsible for the generation and consumption of greenhouse gases ● involved at all levels in ecosystems and trophic chains ● Bioremediation. Cleaning up environmental contamination, such as ● the waste from water treatment facilities ● gasoline leaks on lands or oil spills in the oceans ● toxic chemicals
  • 13. 2 13 Applications | What metagenomics can do ● Bioenergy. We are harnessing microbial power in order to produce ● ethanol (from cellulose), hydrogen, methane, butanol... ● Smart Farming. Microbes help our crops by ● the “supressive soil” phenomenon (buffer effect against disease-causing organisms) ● soil enrichment and regeneration
  • 14. 2 14 Applications | What metagenomics can do ● Bioenergy. We are harnessing microbial power in order to produce ● ethanol (from cellulose), hydrogen, methane, butanol... ● Smart Farming. Microbes help our crops by ● the “supressive soil” phenomenon (buffer effect against disease-causing organisms) ● soil enrichment and regeneration ● The World Within. Studying the human microbiome may lead to valuable new tools and guidelines in ● human and animal nutrition ● better understanding of complex diseases (obesity, cancer, asthma...) ● drug discovery ● preventative medicine Grice E.A. & Segre J.A. (2012) The Human Microbiome: Our Second Genome, Annu. Rev. Genomics Human Genet. 13, 151-170
  • 15. 2 15 Applications | Mapping the Human Microbiome
  • 17. 3 17 Concepts | Trimming ● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from automated DNA sequencers prior to sequence assembly and other downstream uses.
  • 18. 18 ● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from automated DNA sequencers prior to sequence assembly and other downstream uses. ● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs). ● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study. Typically using a percent sequence similarity threshold for classifying microbes within the same, or different, OTUs. 3 Concepts | Binning, OTUs http://shuixia100.weebly.com/1/post/2011/12/mothur-tutorial-1.html / Wikipedia: Biological classification
  • 19. 19 ● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from automated DNA sequencers prior to sequence assembly and other downstream uses. ● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs). ● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study. Typically using a percent sequence similarity threshold for classifying microbes within the same, or different, OTUs. 3 Concepts | Binning, OTUs http://shuixia100.weebly.com/1/post/2011/12/mothur-tutorial-1.html / Wikipedia: Biological classification
  • 20. 20 ● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from automated DNA sequencers prior to sequence assembly and other downstream uses. ● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs). ● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study. Typically using a percent sequence similarity threshold for classifying microbes within the same, or different, OTUs. ● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a template derived from a different but similar sequence. This then acts as a primer that is extended to form a chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998, Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological sequence. 3 Concepts | Chimeras Hass B.J. et al (2011) Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons, Genome Res. 21: 494-504.
  • 21. 3 21 ● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from automated DNA sequencers prior to sequence assembly and other downstream uses. ● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs). ● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study. Typically using a percent sequence similarity threshold for classifying microbes within the same, or different, OTUs. ● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a template derived from a different but similar sequence. This then acts as a primer that is extended to form a chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998, Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological sequence. ● Alpha diversity: the diversity within a particular area or ecosystem; expressed by the number of species (i.e., species richness) in that ecosystem, or by one or more diversity indices. ● Beta diversity: a comparison of of diversity between ecosystems, usually measured as the amount of species change between the ecosystems. ● Gamma diversity: a measure of the overall diversity within a large region. Geographic-scale species diversity according to Hunter (2002:448). Concepts | Diversities Zinger L. et al. (2012) Two decades of describing the unseen majority of aquatic microbial diversity, Molecular Ecology 21, 1878–1896.
  • 22. 3 22 ● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from automated DNA sequencers prior to sequence assembly and other downstream uses. ● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs). ● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study. Typically using a percent sequence similarity threshold for classifying microbes within the same, or different, OTUs. ● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a template derived from a different but similar sequence. This then acts as a primer that is extended to form a chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998, Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological sequence. ● Alpha diversity: the diversity within a particular area or ecosystem; expressed by the number of species (i.e., species richness) in that ecosystem, or by one or more diversity indices. ● Beta diversity: a comparison of of diversity between ecosystems, usually measured as the amount of species change between the ecosystems. ● Gamma diversity: a measure of the overall diversity within a large region. Geographic-scale species diversity according to Hunter (2002:448). Concepts | Diversity measurement issues Zhou J. et al. (2010) Random Sampling Process Leads to Overestimation of β-Diversity of Microbial Communities, mBio 4(3):e00324-13. doi:10.1128/mBio.00324-13. Diversity can virtually never be measured directly, rather it must be estimated or inferred from available data. Our estimates are anchored in the sample itself. Magurran (Ed.), Biological Diversity, Oxford U.P. 2010. Ch. 16 Microbial Diversity and Ecology
  • 23. 3 23 ● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from automated DNA sequencers prior to sequence assembly and other downstream uses. ● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs). ● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study. Typically using a percent sequence similarity threshold for classifying microbes within the same, or different, OTUs. ● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a template derived from a different but similar sequence. This then acts as a primer that is extended to form a chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998, Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological sequence. ● Alpha diversity: the diversity within a particular area or ecosystem; expressed by the number of species (i.e., species richness) in that ecosystem, or by one or more diversity indices. ● Beta diversity: a comparison of of diversity between ecosystems, usually measured as the amount of species change between the ecosystems. ● Gamma diversity: a measure of the overall diversity within a large region. Geographic-scale species diversity according to Hunter (2002:448). ● Rarefaction allows the calculation of species richness for a given number of individual samples, based on the construction of so-called rarefaction curves. This curve is a plot of the number of species as a function of the number of samples. Concepts | Rarefaction most or all species have been sampled species rich habitat, only a small fraction has been sampled this habitat has not been exhaustively sampled Wooley J.C. et al. (2010) A Primer on Metagenomics, PLoS Computational Biology 6 (2) e1000667
  • 24. 3 24 Concepts | Diversity indices (α diversity) Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html Other indices: berger_parker_d, brillouin_d, dominance, doubles, esty_ci, fisher_alpha, gini_index, goods_coverage, margalef, mcintosh_d, mcintosh_e, menhinick,osd, simpson_reciprocal, robbins, singles, strong...
  • 25. 3 25 Concepts | Compositional similarity (β diversity) Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
  • 26. 3 26 Concepts | Compositional similarity (β diversity) Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
  • 27. 3 27 Concepts | Compositional similarity (β diversity) Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
  • 28. 3 28 Concepts | Compositional similarity (β diversity) Heat map Mozzarella project, Michele Iacono http://www.science.gov/topicpages/w/water+buffalo+mozzarella.html
  • 29. 3 29 ● Trimming: is the pre-processing step of cleaning sequence data (primers, multiplexing barcodes...) from automated DNA sequencers prior to sequence assembly and other downstream uses. ● Binning is the process of grouping reads or contigs and assigning them to operational taxonomic units (OTUs). ● OTU (Operational Taxonomic Unit): Taxonomic level of sampling selected by the user to be used in a study. Typically using a percent sequence similarity threshold for classifying microbes within the same, or different, OTUs. ● Chimeras: Artificial sequences formed during PCR amplification. The majority of them are believed to arise from incomplete extension. During subsequent cycles of PCR, a partially extended strand can bind to a template derived from a different but similar sequence. This then acts as a primer that is extended to form a chimeric sequence (Smith et al. 2010, Thompson et al., 2002, Meyerhans et al., 1990, Judo et al., 1998, Odelberg, 1995). A chimeric template is created during one round, then amplified by subsequent rounds to produce chimeric amplicons that are difficult to distinguish from amplicons derived from a single biological sequence. ● Alpha diversity: the diversity within a particular area or ecosystem; expressed by the number of species (i.e., species richness) in that ecosystem, or by one or more diversity indices. ● Beta diversity: a comparison of of diversity between ecosystems, usually measured as the amount of species change between the ecosystems. ● Gamma diversity: a measure of the overall diversity within a large region. Geographic-scale species diversity according to Hunter (2002:448). ● Rarefaction allows the calculation of species richness for a given number of individual samples, based on the construction of so-called rarefaction curves. This curve is a plot of the number of species as a function of the number of samples. ● Metadata, reads, fasta/fastq files, counts, OTU tables/networks, .biom files, PCoA, p-values, diversity metrics, robustness, scores, jackniffed, clustering, UPGMA, trees, bootstrap, Bi-Plots, ... Concepts | Summary
  • 30. 4 APPROACHES & WORKFLOWSAPPROACHES & WORKFLOWS
  • 31. 4 31 Workflows | Microbial ecology approaches
  • 32. 4 32 Grice E.A. & Segre J.A. (2012) The Human Microbiome: Our Second Genome, Annu. Rev. Genomics Human Genet. 13, 151-170 Workflows | Overview Sample collection DNA extraction and preparation Sequencing Analysis
  • 33. 4 33 Grice E.A. & Segre J.A. (2012) The Human Microbiome: Our Second Genome, Annu. Rev. Genomics Human Genet. 13, 151-170 Workflows | Overview Sample collection DNA extraction and preparation Sequencing Analysis Experimental design Sample Quality Controls Sequence Quality Controls Biological interpretation
  • 34. 4.1 WGS MetagenomicsWGS Metagenomics
  • 35. 4 35 Workflows | Whole Genome Shotgun (WGS) Sven-Eric Schelhorn https://bioinf.mpi-inf.mpg.de/homepage/research.php?&account=sven
  • 36. 4 36 Workflows | Whole Genome Shotgun (WGS) Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 37. 4 37 Workflows | Whole Genome Shotgun (WGS) Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 38. 4 38 Workflows | Whole Genome Shotgun (WGS) Sven-Eric Schelhorn https://bioinf.mpi-inf.mpg.de/homepage/research.php?&account=sven
  • 39. 4 39 Workflows | Whole Genome Shotgun (WGS) Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 40. 4 40 Workflows | Whole Genome Shotgun (WGS) Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 42. 4 42 Workflows | 16S/ITS Community Surveys Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 43. 4 43 Workflows | 16S/ITS Community Surveys Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 44. 4 44 Workflows | 16S/ITS Community Surveys Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 45. 4 45 Workflows | 16S/ITS Community Surveys Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 46. 4 46 Workflows | 16S/ITS Community Surveys Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 47. 4 47 Workflows | 16S/ITS Community Surveys Surya Saha & Magdalen Lindeberg http://www.slideshare.net/suryasaha/surya-saha-metagenomics-tools
  • 49. 5 49 Tools | “The great quest”
  • 50. 5 50 Tools | “The great quest”
  • 51. 5 51 Tools | “The great quest”
  • 52. 5 52 Tools | “The great quest”
  • 56. 5 56 Tools | Mothur http://www.mothur.org/wiki/Main_Page / Kevin R. Theis (Michigan State University)
  • 57. 5 57 Tools | Mothur http://www.mothur.org/wiki/Main_Page / Kevin R. Theis (Michigan State University)
  • 58. 5 58 Tools | Mothur http://www.mothur.org/wiki/Main_Page / Kevin R. Theis (Michigan State University)
  • 71. 6 MORE RESOURCESMORE RESOURCES
  • 72. 6 72 More resources, courses... Resources & Projects: MEGAN DB http://www.megan-db.org/megan-db/ (MEtaGenomics ANalysis) CAMERA http://camera.calit2.net/ (community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis) MG-RAST Search http://metagenomics.anl.gov/metagenomics.cgi?page=MetagenomeSearch IMG http://img.jgi.doe.gov/ (Integrated Microbial Genomes and metagenomes) MetaBioME http://metasystems.riken.jp/metabiome/ (Comprehensive Metagenomic BioMining Engine) BOLD http://www.boldsystems.org/ (Barcoding Of Live Database) GOS Expedition http://www.jcvi.org/cms/research/projects/gos/overview (Global Ocean Sampling) ...
  • 73. 6 73 More resources, courses... Courses: EBI http://www.ebi.ac.uk/training/course/metagenomics2014 EMBO http://cymeandcystidium.com/?tag=metagenomics Coursera https://www.coursera.org/course/genomescience ... and a lot of seminars and workshops everywhere
  • 74. Hospital Universitari Vall d’Hebron Institut de Recerca - VHIR Institut d’Investigació Sanitària de l’Instituto de Salud Carlos III (ISCIII) Thanks for your attentionThanks for your attention and also thanks to Josep Gregori (VHIR, ROCHE) for providing some materials INTRODUCTION TO METAGENOMICSINTRODUCTION TO METAGENOMICS Bioinformatics for Biological Researchers http://eib.stat.ub.edu/2014BBR Ferran Briansó ferran.brianso@vhir.org 28/05/2014