SlideShare una empresa de Scribd logo
1 de 24
Genome Sequencing Projects, Genome
Size, Application of sequence information for
                identification of disease genes
Complete Genome Sequencing
 Whole genome shotgun sequencing
 BAC end sequencing
 Chromosome walking
 End sealing
Reference: http://en.wikipedia.org/wiki/File:Genome_Sizes.png
Cost of Genome Sequencing
Nextgen sequencing methods
 454 sequencing methods(2006)
    Principles of pyrophosphate detection(1985, 1988)

 Illumina(Solexa) Genome sequencing methods(2007)
 Applied Biosystems ABI SOLiD System(2007)
 Helicos single molecule sequencing(Helioscope, 2007)
 Pacific Biosciences single-molecule real-time(SMRT)
  technology, 2010
 Sequenom for Nanotechnology based sequencing.
 BioNanomatrixnanofluidiscs
 RNAP technology
http://www.ncbi.nlm.nih.gov/books/NBK20261/
Sequencing methods

          http://www.wellcome.ac.uk/Education-resources/Teaching-and-
          education/Animations/DNA/WTDV026689.htm



         Ref: http://www.wellcome.ac.uk/Education-resources/Teaching-and-
         education/Animations/DNA/WTX056046.htm




          http://www.wellcome.ac.uk/Education-resources/Teaching-and-
          education/Animations/DNA/WTX056051.htm
Ion Torrent
SOLiD Sequencing
http://www.genomesonline.org/cgi-bin/GOLD/index.cgi
http://www.insdc.org/   http://www.ebi.ac.uk/embl
                        /Contact/collaboration.ht
                        ml
Microbial Genome Sequencing
•   JGI – IMG [http://img.jgi.doe.gov/]
•   Broad [http://www.broadinstitute.org/]
•   TIGR [http://www.jcvi.org/]
•   WashU [http://genome.wustl.edu/]
•   VBI at Virginia Tech [www.vbi.vt.edu]
Human Genome Project
                                 NHGRI
                                Solicited                 RFAs were
                    First
                                  pilot                   sought for
                  Publicati
                               proposal for                  full
                   on in
                                ENCODE                    ENCODE
                    2000




  In October                              GWAS -
                              Finished        90% lies   First Report
 1990 Human                                                             ENCODE
                              paper in        outside     on Encode
   Genome                                      coding                   published
                                2003                     Published in
project started                                2005                       2012
                                                             2007
What happens next?
 You have 10 million characters – what to do with them?
    Locate genes
    Determine the function of the gene
         By similarity search
         By domain search
         By Predicting signal peptide
         By locating transmembrane region




Ref: http://www.nature.com/nature/journal/v406/n6797/pdf/406799a0.pdf
Genome Annotation


                       Run 6 frame                   Run Blastp
  ATGAAGATAGACAG       translation                   with nr
  CATACTAGCAGCAT
  AGAATAGATAAGAG
  ATAGAAATAGAATA                                           Matc
                                                            h
   AATATAAGAGAGA                                          found
                                             N
                                             o


      Repeat
      Finding, miRN                                        Product found
      A
                                         Make an
      finding, tRNAs
                                         hmmsearch
      can etc.                       N
                                     O
                                                     Pathway analysis
                                             Matc
                                                     Other analysis
                                              h
                                            found



                              Unknown
                               Genes                   Hypothesis
Genome Sizes
   Gametic Nuclear DNA content
   Represented as mass in pg(pico grams) or length in
    mega bases


                 1 pg = 10^-12 gms
                 1mb = 10^6 bases
                   1 pg = 978 Mb




Ref: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1669731/
Genome Sizes
 Database of Genome Sizes
    http://www.cbs.dtu.dk/databases/DOGS/
 Plant Genome database
    http://www.kew.org/genomesize/homepage.html
 Mamalian genome size database
    http://www.unipv.it/webbio/dbagsdb.htm
 Animal Genome size database
    www.genomesize.com
 Fungal Genome size database.
    www.zbi.ee/fungal-genomesize
Ref: http://www.kew.org/genomesize/homepage.html
Ref: http://www.genomesize.com/
Ref: http://www-3.unipv.it/webbio/dbagsh.htm
Ref: http://www.zbi.ee/fungal-genomesize/
Identifying Human Disease genes
ref: http://www.ncbi.nlm.nih.gov/books/NBK7561/

  Before 1980, very few genes were recognized
     Reverse Genetics: Know gene product and go back to
      gene and do a positional cloning
     Genetic Redundancy: Multiple genes have the same
      function
Identification of genes through
protein product
1000 genomes project
  1092 genomes of different individuals sequenced.
     14 populations
     Low coverage exome sequencing




 38 million SNPs
 1.4 million short insertions
 14,000 large deletions




Ref: http://www.nature.com/nature/journal/v491/n7422/full/nature11632.html

Más contenido relacionado

La actualidad más candente

Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiome
jukais
 
Next-generation genomics: an integrative approach
Next-generation genomics: an integrative approachNext-generation genomics: an integrative approach
Next-generation genomics: an integrative approach
Hong ChangBum
 

La actualidad más candente (14)

Neurotech seminar ish wish 2014 maduna
Neurotech seminar ish wish 2014 madunaNeurotech seminar ish wish 2014 maduna
Neurotech seminar ish wish 2014 maduna
 
Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiome
 
Data analysis pipelines for NGS applications
Data analysis pipelines for NGS applicationsData analysis pipelines for NGS applications
Data analysis pipelines for NGS applications
 
ECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing TutorialECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing Tutorial
 
Ngs introduction
Ngs introductionNgs introduction
Ngs introduction
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
 
Marker devt. workshop 27022012
Marker devt. workshop 27022012Marker devt. workshop 27022012
Marker devt. workshop 27022012
 
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)
 
Next-generation genomics: an integrative approach
Next-generation genomics: an integrative approachNext-generation genomics: an integrative approach
Next-generation genomics: an integrative approach
 
Molecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics PipelineMolecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics Pipeline
 
Rapd and its application
Rapd and its applicationRapd and its application
Rapd and its application
 
Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...
 
RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities
 
Toolbox for bacterial population analysis using NGS
Toolbox for bacterial population analysis using NGSToolbox for bacterial population analysis using NGS
Toolbox for bacterial population analysis using NGS
 

Similar a Lecture 3,4

Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012
gregcaporaso
 
Lab2_3_Lecture_DNA_PCR (3).pptx
Lab2_3_Lecture_DNA_PCR (3).pptxLab2_3_Lecture_DNA_PCR (3).pptx
Lab2_3_Lecture_DNA_PCR (3).pptx
karlos64
 
Bio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsBio-IT 2010 Genome Commons
Bio-IT 2010 Genome Commons
Reece Hart
 
Databases used in forensic sciences and current status of this science in pak...
Databases used in forensic sciences and current status of this science in pak...Databases used in forensic sciences and current status of this science in pak...
Databases used in forensic sciences and current status of this science in pak...
Muhammad Aurangzeb khan
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part III
hhalhaddad
 

Similar a Lecture 3,4 (20)

Experimentos de nubes científicas: Medical Genome Project
Experimentos de nubes científicas: Medical Genome ProjectExperimentos de nubes científicas: Medical Genome Project
Experimentos de nubes científicas: Medical Genome Project
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
 
GeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein productionGeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein production
 
Lab2_3_Lecture_DNA_PCR (3).pptx
Lab2_3_Lecture_DNA_PCR (3).pptxLab2_3_Lecture_DNA_PCR (3).pptx
Lab2_3_Lecture_DNA_PCR (3).pptx
 
Lecture5,6
Lecture5,6Lecture5,6
Lecture5,6
 
Bio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsBio-IT 2010 Genome Commons
Bio-IT 2010 Genome Commons
 
human genome project_094513.pptx
human genome project_094513.pptxhuman genome project_094513.pptx
human genome project_094513.pptx
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Identification and characterization of effector genes from wheat stripe rust
Identification and characterization of effector genes from wheat stripe rustIdentification and characterization of effector genes from wheat stripe rust
Identification and characterization of effector genes from wheat stripe rust
 
General Principles of Toxicogenomics
General Principles of ToxicogenomicsGeneral Principles of Toxicogenomics
General Principles of Toxicogenomics
 
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
From Sequence to Knowledge: The Art and Science of Phage Genome AnnotationFrom Sequence to Knowledge: The Art and Science of Phage Genome Annotation
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
 
Databases used in forensic sciences and current status of this science in pak...
Databases used in forensic sciences and current status of this science in pak...Databases used in forensic sciences and current status of this science in pak...
Databases used in forensic sciences and current status of this science in pak...
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part III
 
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
 
Genome Sequencing Project
Genome Sequencing ProjectGenome Sequencing Project
Genome Sequencing Project
 
Stephen Friend Nature Genetics Colloquium 2012-03-24
Stephen Friend Nature Genetics Colloquium 2012-03-24Stephen Friend Nature Genetics Colloquium 2012-03-24
Stephen Friend Nature Genetics Colloquium 2012-03-24
 
Genome sequencingprojects
Genome sequencingprojectsGenome sequencingprojects
Genome sequencingprojects
 
Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.
 

Más de Sucheta Tripathy

Más de Sucheta Tripathy (20)

Gal
GalGal
Gal
 
Ramorum2016 final
Ramorum2016 finalRamorum2016 final
Ramorum2016 final
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Motif andpatterndatabase
Motif andpatterndatabaseMotif andpatterndatabase
Motif andpatterndatabase
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Stat2013
Stat2013Stat2013
Stat2013
 
26 nov2013seminar
26 nov2013seminar26 nov2013seminar
26 nov2013seminar
 
Stat2013
Stat2013Stat2013
Stat2013
 
Presentation2013
Presentation2013Presentation2013
Presentation2013
 
Lecture7,8
Lecture7,8Lecture7,8
Lecture7,8
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2
 
Sequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSASequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSA
 
Databases Part II
Databases Part IIDatabases Part II
Databases Part II
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Human encodeproject
Human encodeprojectHuman encodeproject
Human encodeproject
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 
Vbi oomycetes2011 final
Vbi oomycetes2011 finalVbi oomycetes2011 final
Vbi oomycetes2011 final
 

Último

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Último (20)

Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 

Lecture 3,4

  • 1. Genome Sequencing Projects, Genome Size, Application of sequence information for identification of disease genes
  • 2. Complete Genome Sequencing  Whole genome shotgun sequencing  BAC end sequencing  Chromosome walking  End sealing
  • 4. Cost of Genome Sequencing
  • 5. Nextgen sequencing methods  454 sequencing methods(2006)  Principles of pyrophosphate detection(1985, 1988)  Illumina(Solexa) Genome sequencing methods(2007)  Applied Biosystems ABI SOLiD System(2007)  Helicos single molecule sequencing(Helioscope, 2007)  Pacific Biosciences single-molecule real-time(SMRT) technology, 2010  Sequenom for Nanotechnology based sequencing.  BioNanomatrixnanofluidiscs  RNAP technology http://www.ncbi.nlm.nih.gov/books/NBK20261/
  • 6. Sequencing methods http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTDV026689.htm Ref: http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTX056046.htm http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTX056051.htm
  • 10. http://www.insdc.org/ http://www.ebi.ac.uk/embl /Contact/collaboration.ht ml
  • 11. Microbial Genome Sequencing • JGI – IMG [http://img.jgi.doe.gov/] • Broad [http://www.broadinstitute.org/] • TIGR [http://www.jcvi.org/] • WashU [http://genome.wustl.edu/] • VBI at Virginia Tech [www.vbi.vt.edu]
  • 12. Human Genome Project NHGRI Solicited RFAs were First pilot sought for Publicati proposal for full on in ENCODE ENCODE 2000 In October GWAS - Finished 90% lies First Report 1990 Human ENCODE paper in outside on Encode Genome coding published 2003 Published in project started 2005 2012 2007
  • 13. What happens next?  You have 10 million characters – what to do with them?  Locate genes  Determine the function of the gene  By similarity search  By domain search  By Predicting signal peptide  By locating transmembrane region Ref: http://www.nature.com/nature/journal/v406/n6797/pdf/406799a0.pdf
  • 14. Genome Annotation Run 6 frame Run Blastp ATGAAGATAGACAG translation with nr CATACTAGCAGCAT AGAATAGATAAGAG ATAGAAATAGAATA Matc h AATATAAGAGAGA found N o Repeat Finding, miRN Product found A Make an finding, tRNAs hmmsearch can etc. N O Pathway analysis Matc Other analysis h found Unknown Genes Hypothesis
  • 15. Genome Sizes  Gametic Nuclear DNA content  Represented as mass in pg(pico grams) or length in mega bases 1 pg = 10^-12 gms 1mb = 10^6 bases 1 pg = 978 Mb Ref: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1669731/
  • 16. Genome Sizes  Database of Genome Sizes  http://www.cbs.dtu.dk/databases/DOGS/  Plant Genome database  http://www.kew.org/genomesize/homepage.html  Mamalian genome size database  http://www.unipv.it/webbio/dbagsdb.htm  Animal Genome size database  www.genomesize.com  Fungal Genome size database.  www.zbi.ee/fungal-genomesize
  • 17.
  • 22. Identifying Human Disease genes ref: http://www.ncbi.nlm.nih.gov/books/NBK7561/  Before 1980, very few genes were recognized  Reverse Genetics: Know gene product and go back to gene and do a positional cloning  Genetic Redundancy: Multiple genes have the same function
  • 23. Identification of genes through protein product
  • 24. 1000 genomes project  1092 genomes of different individuals sequenced.  14 populations  Low coverage exome sequencing 38 million SNPs 1.4 million short insertions 14,000 large deletions Ref: http://www.nature.com/nature/journal/v491/n7422/full/nature11632.html