SlideShare a Scribd company logo
1 of 24
Genome Sequencing Projects, Genome
Size, Application of sequence information for
                identification of disease genes
Complete Genome Sequencing
 Whole genome shotgun sequencing
 BAC end sequencing
 Chromosome walking
 End sealing
Reference: http://en.wikipedia.org/wiki/File:Genome_Sizes.png
Cost of Genome Sequencing
Nextgen sequencing methods
 454 sequencing methods(2006)
    Principles of pyrophosphate detection(1985, 1988)

 Illumina(Solexa) Genome sequencing methods(2007)
 Applied Biosystems ABI SOLiD System(2007)
 Helicos single molecule sequencing(Helioscope, 2007)
 Pacific Biosciences single-molecule real-time(SMRT)
  technology, 2010
 Sequenom for Nanotechnology based sequencing.
 BioNanomatrixnanofluidiscs
 RNAP technology
http://www.ncbi.nlm.nih.gov/books/NBK20261/
Sequencing methods

          http://www.wellcome.ac.uk/Education-resources/Teaching-and-
          education/Animations/DNA/WTDV026689.htm



         Ref: http://www.wellcome.ac.uk/Education-resources/Teaching-and-
         education/Animations/DNA/WTX056046.htm




          http://www.wellcome.ac.uk/Education-resources/Teaching-and-
          education/Animations/DNA/WTX056051.htm
Ion Torrent
SOLiD Sequencing
http://www.genomesonline.org/cgi-bin/GOLD/index.cgi
http://www.insdc.org/   http://www.ebi.ac.uk/embl
                        /Contact/collaboration.ht
                        ml
Microbial Genome Sequencing
•   JGI – IMG [http://img.jgi.doe.gov/]
•   Broad [http://www.broadinstitute.org/]
•   TIGR [http://www.jcvi.org/]
•   WashU [http://genome.wustl.edu/]
•   VBI at Virginia Tech [www.vbi.vt.edu]
Human Genome Project
                                 NHGRI
                                Solicited                 RFAs were
                    First
                                  pilot                   sought for
                  Publicati
                               proposal for                  full
                   on in
                                ENCODE                    ENCODE
                    2000




  In October                              GWAS -
                              Finished        90% lies   First Report
 1990 Human                                                             ENCODE
                              paper in        outside     on Encode
   Genome                                      coding                   published
                                2003                     Published in
project started                                2005                       2012
                                                             2007
What happens next?
 You have 10 million characters – what to do with them?
    Locate genes
    Determine the function of the gene
         By similarity search
         By domain search
         By Predicting signal peptide
         By locating transmembrane region




Ref: http://www.nature.com/nature/journal/v406/n6797/pdf/406799a0.pdf
Genome Annotation


                       Run 6 frame                   Run Blastp
  ATGAAGATAGACAG       translation                   with nr
  CATACTAGCAGCAT
  AGAATAGATAAGAG
  ATAGAAATAGAATA                                           Matc
                                                            h
   AATATAAGAGAGA                                          found
                                             N
                                             o


      Repeat
      Finding, miRN                                        Product found
      A
                                         Make an
      finding, tRNAs
                                         hmmsearch
      can etc.                       N
                                     O
                                                     Pathway analysis
                                             Matc
                                                     Other analysis
                                              h
                                            found



                              Unknown
                               Genes                   Hypothesis
Genome Sizes
   Gametic Nuclear DNA content
   Represented as mass in pg(pico grams) or length in
    mega bases


                 1 pg = 10^-12 gms
                 1mb = 10^6 bases
                   1 pg = 978 Mb




Ref: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1669731/
Genome Sizes
 Database of Genome Sizes
    http://www.cbs.dtu.dk/databases/DOGS/
 Plant Genome database
    http://www.kew.org/genomesize/homepage.html
 Mamalian genome size database
    http://www.unipv.it/webbio/dbagsdb.htm
 Animal Genome size database
    www.genomesize.com
 Fungal Genome size database.
    www.zbi.ee/fungal-genomesize
Ref: http://www.kew.org/genomesize/homepage.html
Ref: http://www.genomesize.com/
Ref: http://www-3.unipv.it/webbio/dbagsh.htm
Ref: http://www.zbi.ee/fungal-genomesize/
Identifying Human Disease genes
ref: http://www.ncbi.nlm.nih.gov/books/NBK7561/

  Before 1980, very few genes were recognized
     Reverse Genetics: Know gene product and go back to
      gene and do a positional cloning
     Genetic Redundancy: Multiple genes have the same
      function
Identification of genes through
protein product
1000 genomes project
  1092 genomes of different individuals sequenced.
     14 populations
     Low coverage exome sequencing




 38 million SNPs
 1.4 million short insertions
 14,000 large deletions




Ref: http://www.nature.com/nature/journal/v491/n7422/full/nature11632.html

More Related Content

What's hot

Neurotech seminar ish wish 2014 maduna
Neurotech seminar ish wish 2014 madunaNeurotech seminar ish wish 2014 maduna
Neurotech seminar ish wish 2014 madunaTando Maduna
 
Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiomejukais
 
ECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing TutorialECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing TutorialThomas Keane
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...VHIR Vall d’Hebron Institut de Recerca
 
Marker devt. workshop 27022012
Marker devt. workshop 27022012Marker devt. workshop 27022012
Marker devt. workshop 27022012Koppolu Ravi
 
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Sebastian Schmeier
 
Next-generation genomics: an integrative approach
Next-generation genomics: an integrative approachNext-generation genomics: an integrative approach
Next-generation genomics: an integrative approachHong ChangBum
 
Molecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics PipelineMolecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics PipelineCandy Smellie
 
Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...QBiC_Tue
 
RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities Paolo Dametto
 
Toolbox for bacterial population analysis using NGS
Toolbox for bacterial population analysis using NGSToolbox for bacterial population analysis using NGS
Toolbox for bacterial population analysis using NGSMirko Rossi
 

What's hot (14)

Neurotech seminar ish wish 2014 maduna
Neurotech seminar ish wish 2014 madunaNeurotech seminar ish wish 2014 maduna
Neurotech seminar ish wish 2014 maduna
 
Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiome
 
Data analysis pipelines for NGS applications
Data analysis pipelines for NGS applicationsData analysis pipelines for NGS applications
Data analysis pipelines for NGS applications
 
ECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing TutorialECCB 2010 Next-gen sequencing Tutorial
ECCB 2010 Next-gen sequencing Tutorial
 
Ngs introduction
Ngs introductionNgs introduction
Ngs introduction
 
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
 
Marker devt. workshop 27022012
Marker devt. workshop 27022012Marker devt. workshop 27022012
Marker devt. workshop 27022012
 
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)Next-generation sequencing and quality control: An Introduction (2016)
Next-generation sequencing and quality control: An Introduction (2016)
 
Next-generation genomics: an integrative approach
Next-generation genomics: an integrative approachNext-generation genomics: an integrative approach
Next-generation genomics: an integrative approach
 
Molecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics PipelineMolecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics Pipeline
 
Rapd and its application
Rapd and its applicationRapd and its application
Rapd and its application
 
Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...Data Management for Quantitative Biology - Data sources (Next generation tech...
Data Management for Quantitative Biology - Data sources (Next generation tech...
 
RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities
 
Toolbox for bacterial population analysis using NGS
Toolbox for bacterial population analysis using NGSToolbox for bacterial population analysis using NGS
Toolbox for bacterial population analysis using NGS
 

Similar to Lecture 3,4

Experimentos de nubes científicas: Medical Genome Project
Experimentos de nubes científicas: Medical Genome ProjectExperimentos de nubes científicas: Medical Genome Project
Experimentos de nubes científicas: Medical Genome ProjectFundación Ramón Areces
 
Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012gregcaporaso
 
GeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein productionGeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein productionThermo Fisher Scientific
 
Lab2_3_Lecture_DNA_PCR (3).pptx
Lab2_3_Lecture_DNA_PCR (3).pptxLab2_3_Lecture_DNA_PCR (3).pptx
Lab2_3_Lecture_DNA_PCR (3).pptxkarlos64
 
Bio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsBio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsReece Hart
 
human genome project_094513.pptx
human genome project_094513.pptxhuman genome project_094513.pptx
human genome project_094513.pptxpadmasriv25
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAGRF_Ltd
 
Identification and characterization of effector genes from wheat stripe rust
Identification and characterization of effector genes from wheat stripe rustIdentification and characterization of effector genes from wheat stripe rust
Identification and characterization of effector genes from wheat stripe rustBorlaug Global Rust Initiative
 
General Principles of Toxicogenomics
General Principles of ToxicogenomicsGeneral Principles of Toxicogenomics
General Principles of Toxicogenomicscwoodland
 
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
From Sequence to Knowledge: The Art and Science of Phage Genome AnnotationFrom Sequence to Knowledge: The Art and Science of Phage Genome Annotation
From Sequence to Knowledge: The Art and Science of Phage Genome AnnotationRamy K. Aziz
 
Databases used in forensic sciences and current status of this science in pak...
Databases used in forensic sciences and current status of this science in pak...Databases used in forensic sciences and current status of this science in pak...
Databases used in forensic sciences and current status of this science in pak...Muhammad Aurangzeb khan
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part IIIhhalhaddad
 
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...Eastern Pennsylvania Branch ASM
 
Genome Sequencing Project
Genome Sequencing ProjectGenome Sequencing Project
Genome Sequencing Projectguestd53a1
 
Stephen Friend Nature Genetics Colloquium 2012-03-24
Stephen Friend Nature Genetics Colloquium 2012-03-24Stephen Friend Nature Genetics Colloquium 2012-03-24
Stephen Friend Nature Genetics Colloquium 2012-03-24Sage Base
 
Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Nathan Olson
 

Similar to Lecture 3,4 (20)

Experimentos de nubes científicas: Medical Genome Project
Experimentos de nubes científicas: Medical Genome ProjectExperimentos de nubes científicas: Medical Genome Project
Experimentos de nubes científicas: Medical Genome Project
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012Caporaso sloan qiime_workshop_slides_18_oct2012
Caporaso sloan qiime_workshop_slides_18_oct2012
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
 
GeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein productionGeneArt® services - Gene synthesis through protein production
GeneArt® services - Gene synthesis through protein production
 
Lab2_3_Lecture_DNA_PCR (3).pptx
Lab2_3_Lecture_DNA_PCR (3).pptxLab2_3_Lecture_DNA_PCR (3).pptx
Lab2_3_Lecture_DNA_PCR (3).pptx
 
Lecture5,6
Lecture5,6Lecture5,6
Lecture5,6
 
Bio-IT 2010 Genome Commons
Bio-IT 2010 Genome CommonsBio-IT 2010 Genome Commons
Bio-IT 2010 Genome Commons
 
human genome project_094513.pptx
human genome project_094513.pptxhuman genome project_094513.pptx
human genome project_094513.pptx
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Identification and characterization of effector genes from wheat stripe rust
Identification and characterization of effector genes from wheat stripe rustIdentification and characterization of effector genes from wheat stripe rust
Identification and characterization of effector genes from wheat stripe rust
 
General Principles of Toxicogenomics
General Principles of ToxicogenomicsGeneral Principles of Toxicogenomics
General Principles of Toxicogenomics
 
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
From Sequence to Knowledge: The Art and Science of Phage Genome AnnotationFrom Sequence to Knowledge: The Art and Science of Phage Genome Annotation
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
 
Databases used in forensic sciences and current status of this science in pak...
Databases used in forensic sciences and current status of this science in pak...Databases used in forensic sciences and current status of this science in pak...
Databases used in forensic sciences and current status of this science in pak...
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part III
 
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
 
Genome Sequencing Project
Genome Sequencing ProjectGenome Sequencing Project
Genome Sequencing Project
 
Stephen Friend Nature Genetics Colloquium 2012-03-24
Stephen Friend Nature Genetics Colloquium 2012-03-24Stephen Friend Nature Genetics Colloquium 2012-03-24
Stephen Friend Nature Genetics Colloquium 2012-03-24
 
Genome sequencingprojects
Genome sequencingprojectsGenome sequencingprojects
Genome sequencingprojects
 
Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.
 

More from Sucheta Tripathy (20)

Gal
GalGal
Gal
 
Ramorum2016 final
Ramorum2016 finalRamorum2016 final
Ramorum2016 final
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Motif andpatterndatabase
Motif andpatterndatabaseMotif andpatterndatabase
Motif andpatterndatabase
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Stat2013
Stat2013Stat2013
Stat2013
 
26 nov2013seminar
26 nov2013seminar26 nov2013seminar
26 nov2013seminar
 
Stat2013
Stat2013Stat2013
Stat2013
 
Presentation2013
Presentation2013Presentation2013
Presentation2013
 
Lecture7,8
Lecture7,8Lecture7,8
Lecture7,8
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2
 
Sequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSASequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSA
 
Databases Part II
Databases Part IIDatabases Part II
Databases Part II
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Human encodeproject
Human encodeprojectHuman encodeproject
Human encodeproject
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 
Vbi oomycetes2011 final
Vbi oomycetes2011 finalVbi oomycetes2011 final
Vbi oomycetes2011 final
 

Recently uploaded

philosophy and it's principles based on the life
philosophy and it's principles based on the lifephilosophy and it's principles based on the life
philosophy and it's principles based on the lifeNitinDeodare
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjMohammed Sikander
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnershipsexpandedwebsite
 
Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
 Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatmentsaipooja36
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismDabee Kamal
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...Gary Wood
 
MOOD STABLIZERS DRUGS.pptx
MOOD     STABLIZERS           DRUGS.pptxMOOD     STABLIZERS           DRUGS.pptx
MOOD STABLIZERS DRUGS.pptxPoojaSen20
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...Nguyen Thanh Tu Collection
 
How to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryHow to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryCeline George
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45MysoreMuleSoftMeetup
 
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17Celine George
 
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhĐề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhleson0603
 
The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxheathfieldcps1
 
Book Review of Run For Your Life Powerpoint
Book Review of Run For Your Life PowerpointBook Review of Run For Your Life Powerpoint
Book Review of Run For Your Life Powerpoint23600690
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppCeline George
 
Capitol Tech Univ Doctoral Presentation -May 2024
Capitol Tech Univ Doctoral Presentation -May 2024Capitol Tech Univ Doctoral Presentation -May 2024
Capitol Tech Univ Doctoral Presentation -May 2024CapitolTechU
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽中 央社
 

Recently uploaded (20)

Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
 
philosophy and it's principles based on the life
philosophy and it's principles based on the lifephilosophy and it's principles based on the life
philosophy and it's principles based on the life
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
 
Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
 Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in Hinduism
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
MOOD STABLIZERS DRUGS.pptx
MOOD     STABLIZERS           DRUGS.pptxMOOD     STABLIZERS           DRUGS.pptx
MOOD STABLIZERS DRUGS.pptx
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
 
How to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryHow to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 Inventory
 
IPL Online Quiz by Pragya; Question Set.
IPL Online Quiz by Pragya; Question Set.IPL Online Quiz by Pragya; Question Set.
IPL Online Quiz by Pragya; Question Set.
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
 
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
Removal Strategy _ FEFO _ Working with Perishable Products in Odoo 17
 
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhĐề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
 
The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptx
 
Book Review of Run For Your Life Powerpoint
Book Review of Run For Your Life PowerpointBook Review of Run For Your Life Powerpoint
Book Review of Run For Your Life Powerpoint
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio App
 
Word Stress rules esl .pptx
Word Stress rules esl               .pptxWord Stress rules esl               .pptx
Word Stress rules esl .pptx
 
Capitol Tech Univ Doctoral Presentation -May 2024
Capitol Tech Univ Doctoral Presentation -May 2024Capitol Tech Univ Doctoral Presentation -May 2024
Capitol Tech Univ Doctoral Presentation -May 2024
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
 

Lecture 3,4

  • 1. Genome Sequencing Projects, Genome Size, Application of sequence information for identification of disease genes
  • 2. Complete Genome Sequencing  Whole genome shotgun sequencing  BAC end sequencing  Chromosome walking  End sealing
  • 4. Cost of Genome Sequencing
  • 5. Nextgen sequencing methods  454 sequencing methods(2006)  Principles of pyrophosphate detection(1985, 1988)  Illumina(Solexa) Genome sequencing methods(2007)  Applied Biosystems ABI SOLiD System(2007)  Helicos single molecule sequencing(Helioscope, 2007)  Pacific Biosciences single-molecule real-time(SMRT) technology, 2010  Sequenom for Nanotechnology based sequencing.  BioNanomatrixnanofluidiscs  RNAP technology http://www.ncbi.nlm.nih.gov/books/NBK20261/
  • 6. Sequencing methods http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTDV026689.htm Ref: http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTX056046.htm http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTX056051.htm
  • 10. http://www.insdc.org/ http://www.ebi.ac.uk/embl /Contact/collaboration.ht ml
  • 11. Microbial Genome Sequencing • JGI – IMG [http://img.jgi.doe.gov/] • Broad [http://www.broadinstitute.org/] • TIGR [http://www.jcvi.org/] • WashU [http://genome.wustl.edu/] • VBI at Virginia Tech [www.vbi.vt.edu]
  • 12. Human Genome Project NHGRI Solicited RFAs were First pilot sought for Publicati proposal for full on in ENCODE ENCODE 2000 In October GWAS - Finished 90% lies First Report 1990 Human ENCODE paper in outside on Encode Genome coding published 2003 Published in project started 2005 2012 2007
  • 13. What happens next?  You have 10 million characters – what to do with them?  Locate genes  Determine the function of the gene  By similarity search  By domain search  By Predicting signal peptide  By locating transmembrane region Ref: http://www.nature.com/nature/journal/v406/n6797/pdf/406799a0.pdf
  • 14. Genome Annotation Run 6 frame Run Blastp ATGAAGATAGACAG translation with nr CATACTAGCAGCAT AGAATAGATAAGAG ATAGAAATAGAATA Matc h AATATAAGAGAGA found N o Repeat Finding, miRN Product found A Make an finding, tRNAs hmmsearch can etc. N O Pathway analysis Matc Other analysis h found Unknown Genes Hypothesis
  • 15. Genome Sizes  Gametic Nuclear DNA content  Represented as mass in pg(pico grams) or length in mega bases 1 pg = 10^-12 gms 1mb = 10^6 bases 1 pg = 978 Mb Ref: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1669731/
  • 16. Genome Sizes  Database of Genome Sizes  http://www.cbs.dtu.dk/databases/DOGS/  Plant Genome database  http://www.kew.org/genomesize/homepage.html  Mamalian genome size database  http://www.unipv.it/webbio/dbagsdb.htm  Animal Genome size database  www.genomesize.com  Fungal Genome size database.  www.zbi.ee/fungal-genomesize
  • 17.
  • 22. Identifying Human Disease genes ref: http://www.ncbi.nlm.nih.gov/books/NBK7561/  Before 1980, very few genes were recognized  Reverse Genetics: Know gene product and go back to gene and do a positional cloning  Genetic Redundancy: Multiple genes have the same function
  • 23. Identification of genes through protein product
  • 24. 1000 genomes project  1092 genomes of different individuals sequenced.  14 populations  Low coverage exome sequencing 38 million SNPs 1.4 million short insertions 14,000 large deletions Ref: http://www.nature.com/nature/journal/v491/n7422/full/nature11632.html