SlideShare una empresa de Scribd logo
1 de 8
Genome Sequencing: Shotgun and Clone Contig Method
Dr. Naveen Gaurav
Associate Professor and Head
Department of Biotechnology
Shri Guru Ram Rai University
Dehradun
Genome Sequencing: Shotgun Method
In shotgun sequencing the genome is broken randomly into short fragments (1 to 2 kbp long)
suitable for sequencing. The fragments are ligated into a suitable vector and then partially
sequenced. Around 400–500 bp of sequence can be generated from each fragment in a
single sequencing run. In some cases, both ends of a fragment are sequenced. Computerized
searching for overlaps between individual sequences then assembles the complete
sequence. Overlapping sequences are assembled to generate contigs (Fig. 1). The term
contig refers to a known DNA sequence that is contiguous and lacks gaps.
Since fragments are cloned at random, duplicates will quite often be sequenced. To get full
coverage the total amount of sequence obtained must therefore be several times that of the
genome to allow for duplications. For example, 99.8% coverage requires a total amount of
sequence that is 6- to 8-fold the genome size. In principle, all that is required to assemble a
genome, however large, from small sequences is a sufficiently powerful computer. No
genetic map or prior information is needed about the organism whose genome is to be
sequenced. The original limitation to shotgun sequencing was the massive data handling that
is required. The development of faster computers overcame this problem. The first bacterial
genome to be sequenced was Haemophilus influenza. The sequence was deduced from just
under 25,000 sequences averaging 480 bp each. This gave a total of almost 12 million bp of
sequence—six times the genome size. Computerized assembly using overlaps resulted in 140
regions of contiguous sequence—that is, 140 contigs. The bacterium Haemophilus had the
honor of being the first organism to be totally sequenced.
Figure 1. Shotgun Sequencing
The first step in shotgun
sequencing an entire genome
is to digest the genome into a
large number of small
fragments suitable for
sequencing. All the small
fragments are then cloned and
sequenced. Computers
analyze the sequence data for
overlapping regions and
assemble the sequences into
several large contigs. Since
some regions of the genome
are unstable when cloned,
some gaps may remain even
after this procedure is
repeated several times.
The gaps between the contigs may be closed by more individualistic procedures. The
easiest method is to re-screen the original set of clones with pairs of probes corresponding
to sequences on the two sides of each gap. Clones that hybridize to both members of such
a pair of probes presumably carry DNA that bridges the gap between two contigs. Such
clones are then sequenced in full to close the gaps between contigs. However, many of the
gaps between contigs are due to regions of DNA that are unstable when cloned, especially
in a multicopy vector. Therefore, a second library in a different vector, often a single copy
vector such as a lambda phage, is often used during the later stages of shotgun cloning.
Pairs of end-of-contig probes are used to screen the new library for clones that hybridize
to both probes and carry DNA that bridges the gap between the two contigs (Fig. 2A). A
third approach, which avoids cloning altogether, is to run PCR reactions on whole genomic
DNA using random pairs of PCR primers corresponding to contig ends. A PCR product will
result only if the two contig ends are within a few kb of each other (Fig. 2B).
Figure 2. Closing Gaps between
Contigs
To identify gaps between
contigs, probes or primers are
made that correspond to the
ends of the contigs (pink). In (A)
a new library of clones (green) is
screened with end-of-contig
probes. Clones that hybridize to
probes from two sides of a gap
are isolated. In this example, a
probe for the end of contig #3
(3b) and the beginning of contig
#4 (4a) hybridize to the
fragment shown. Therefore, the
sequence of this clone should
close the gap between contig #3
and #4. (B) The second approach
uses PCR primers that
correspond to the ends of
contigs to amplify genomic DNA.
If the primer pair is within a few
kilobases of each other, a PCR
product is made and can be
sequenced.
Hierarchical shotgun sequencing
Although shotgun sequencing can in theory be applied to a genome of any size, its direct
application to the sequencing of large genomes (for instance, the human genome) was
limited until the late 1990s, when technological advances made practical the handling of the
vast quantities of complex data involved in the process. Historically, full-genome shotgun
sequencing was believed to be limited by both the sheer size of large genomes and by the
complexity added by the high percentage of repetitive DNA (greater than 50% for the
human genome) present in large genomes. It was not widely accepted that a full-genome
shotgun sequence of a large genome would provide reliable data. For these reasons, other
strategies that lowered the computational load of sequence assembly had to be utilized
before shotgun sequencing was performed. In hierarchical sequencing, also known as top-
down sequencing, a low-resolution physical map of the genome is made prior to actual
sequencing. From this map, a minimal number of fragments that cover the entire
chromosome are selected for sequencing. In this way, the minimum amount of high-
throughput sequencing and assembly is required.
The amplified genome is first sheared into larger pieces (50-200kb) and cloned into a
bacterial host using BACs or P1-derived artificial chromosomes (PAC). Because multiple
genome copies have been sheared at random, the fragments contained in these clones have
different ends, and with enough coverage (see section above) finding a scaffold of BAC
contigs that covers the entire genome is theoretically possible. This scaffold is called a tiling
path. Once a tiling path has been found, the BACs that form this path are sheared at random
into smaller fragments and can be sequenced using the shotgun method on a smaller scale.
Although the full sequences of the BAC contigs is not known, their orientations relative to one
another are known. There are several methods for deducing this order and selecting the BACs
that make up a tiling path. The general strategy involves identifying the positions of the clones
relative to one another and then selecting the fewest clones required to form a contiguous
scaffold that covers the entire area of interest. The order of the clones is deduced by
determining the way in which they overlap. Overlapping clones can be identified in several ways.
A small radioactively or chemically labeled probe containing a sequence-tagged site (STS) can be
hybridized onto a microarray upon which the clones are printed. In this way, all the clones that
contain a particular sequence in the genome are identified. The end of one of these clones can
then be sequenced to yield a new probe and the process repeated in a method called
chromosome walking. Alternatively, the BAC library can be restriction-digested. Two clones that
have several fragment sizes in common are inferred to overlap because they contain multiple
similarly spaced restriction sites in common. This method of genomic mapping is called
restriction fingerprinting because it identifies a set of restriction sites contained in each clone.
Once the overlap between the clones has been found and their order relative to the genome
known, a scaffold of a minimal subset of these contigs that covers the entire genome is shotgun-
sequenced. Because it involves first creating a low-resolution map of the genome, hierarchical
shotgun sequencing is slower than whole-genome shotgun sequencing, but relies less heavily on
computer algorithms than whole-genome shotgun sequencing. The process of extensive BAC
library creation and tiling path selection, however, make hierarchical shotgun sequencing slow
and labor-intensive. Now that the technology is available and the reliability of the data
demonstrated, the speed and cost efficiency of whole-genome shotgun sequencing has made it
the primary method for genome sequencing.
Thank you
References: Online notes, notes from research papers and Books by google search Engine

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Protein purification
Protein purificationProtein purification
Protein purification
 
Genomic and c dna library
Genomic and c dna libraryGenomic and c dna library
Genomic and c dna library
 
Synthesis and cloning of c dna
Synthesis and cloning of c dnaSynthesis and cloning of c dna
Synthesis and cloning of c dna
 
Genomic library
Genomic libraryGenomic library
Genomic library
 
pET vectors
pET vectorspET vectors
pET vectors
 
bacterial artificial chromosome & yeast artificial chromosome
bacterial artificial chromosome & yeast artificial chromosomebacterial artificial chromosome & yeast artificial chromosome
bacterial artificial chromosome & yeast artificial chromosome
 
Library screening
Library screeningLibrary screening
Library screening
 
Electrophoretic mobility shift assay
Electrophoretic mobility shift assay Electrophoretic mobility shift assay
Electrophoretic mobility shift assay
 
2d Page
2d Page2d Page
2d Page
 
Genome mapping
Genome mapping Genome mapping
Genome mapping
 
Expression vectors
Expression vectorsExpression vectors
Expression vectors
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Methylases
MethylasesMethylases
Methylases
 
Pyrosequencing
PyrosequencingPyrosequencing
Pyrosequencing
 
MODIFYING ENZYMES
MODIFYING ENZYMESMODIFYING ENZYMES
MODIFYING ENZYMES
 
Genomic library construction
Genomic library constructionGenomic library construction
Genomic library construction
 
Probe labelling
Probe labellingProbe labelling
Probe labelling
 
Artificial chromosomes - YAC and BAC
Artificial chromosomes - YAC and BACArtificial chromosomes - YAC and BAC
Artificial chromosomes - YAC and BAC
 
Labelling of dna
Labelling of dnaLabelling of dna
Labelling of dna
 
8. Biology and characterization of cultured cells
8. Biology and characterization of cultured cells8. Biology and characterization of cultured cells
8. Biology and characterization of cultured cells
 

Similar a Shotgun and clone contig method

Genome Assembly copy
Genome Assembly   copyGenome Assembly   copy
Genome Assembly copyPradeep Kumar
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysisDr. Olusoji Adewumi
 
Restriction mapping of bacterial dna
Restriction mapping of bacterial dnaRestriction mapping of bacterial dna
Restriction mapping of bacterial dnaAtai Rabby
 
Nucleic Acid Hybridisation & Gene Mapping
Nucleic Acid Hybridisation & Gene MappingNucleic Acid Hybridisation & Gene Mapping
Nucleic Acid Hybridisation & Gene Mappingmgsonline
 
Nucleic Acid Hybridisation
Nucleic Acid HybridisationNucleic Acid Hybridisation
Nucleic Acid Hybridisationmgsonline
 
Nucleic Acid Hybridisation & Gene Mapping
Nucleic Acid Hybridisation & Gene MappingNucleic Acid Hybridisation & Gene Mapping
Nucleic Acid Hybridisation & Gene Mappingmusselburghgrammar
 
Construction of human gene map through map integration- from genetic map to p...
Construction of human gene map through map integration- from genetic map to p...Construction of human gene map through map integration- from genetic map to p...
Construction of human gene map through map integration- from genetic map to p...Central University Of Kerala
 
How to sequence a large eukaryotic genome
How to sequence a large eukaryotic genomeHow to sequence a large eukaryotic genome
How to sequence a large eukaryotic genomeLex Nederbragt
 
Rna lecture
Rna lectureRna lecture
Rna lecturenishulpu
 
DNA-Protein interaction by 3C based method.pptx
DNA-Protein interaction by 3C based method.pptxDNA-Protein interaction by 3C based method.pptx
DNA-Protein interaction by 3C based method.pptxKashvi Jadia
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination NetworkMonica Munoz-Torres
 

Similar a Shotgun and clone contig method (20)

CROP GENOME SEQUENCING
CROP GENOME SEQUENCINGCROP GENOME SEQUENCING
CROP GENOME SEQUENCING
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 
Genome Assembly copy
Genome Assembly   copyGenome Assembly   copy
Genome Assembly copy
 
Unilag workshop complex genome analysis
Unilag workshop   complex genome analysisUnilag workshop   complex genome analysis
Unilag workshop complex genome analysis
 
Shotgun (2) metagenomics
Shotgun (2) metagenomicsShotgun (2) metagenomics
Shotgun (2) metagenomics
 
Restriction mapping of bacterial dna
Restriction mapping of bacterial dnaRestriction mapping of bacterial dna
Restriction mapping of bacterial dna
 
Nucleic Acid Hybridisation & Gene Mapping
Nucleic Acid Hybridisation & Gene MappingNucleic Acid Hybridisation & Gene Mapping
Nucleic Acid Hybridisation & Gene Mapping
 
Nucleic Acid Hybridisation
Nucleic Acid HybridisationNucleic Acid Hybridisation
Nucleic Acid Hybridisation
 
Nucleic Acid Hybridisation & Gene Mapping
Nucleic Acid Hybridisation & Gene MappingNucleic Acid Hybridisation & Gene Mapping
Nucleic Acid Hybridisation & Gene Mapping
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 
Construction of human gene map through map integration- from genetic map to p...
Construction of human gene map through map integration- from genetic map to p...Construction of human gene map through map integration- from genetic map to p...
Construction of human gene map through map integration- from genetic map to p...
 
How to sequence a large eukaryotic genome
How to sequence a large eukaryotic genomeHow to sequence a large eukaryotic genome
How to sequence a large eukaryotic genome
 
Chromosome walking
Chromosome walkingChromosome walking
Chromosome walking
 
Rna lecture
Rna lectureRna lecture
Rna lecture
 
Microbiology Assignment Help
Microbiology Assignment HelpMicrobiology Assignment Help
Microbiology Assignment Help
 
Molecular mapping
Molecular mappingMolecular mapping
Molecular mapping
 
DNA-Protein interaction by 3C based method.pptx
DNA-Protein interaction by 3C based method.pptxDNA-Protein interaction by 3C based method.pptx
DNA-Protein interaction by 3C based method.pptx
 
Samim
SamimSamim
Samim
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination Network
 
Physical mapping
Physical mappingPhysical mapping
Physical mapping
 

Más de Dr. Naveen Gaurav srivastava

Polymerase chain reaction andRestriction fragment length polymorphism (RFLP):...
Polymerase chain reaction andRestriction fragment length polymorphism (RFLP):...Polymerase chain reaction andRestriction fragment length polymorphism (RFLP):...
Polymerase chain reaction andRestriction fragment length polymorphism (RFLP):...Dr. Naveen Gaurav srivastava
 
Intracellular Compartments / Intracellular fluid
Intracellular Compartments / Intracellular fluidIntracellular Compartments / Intracellular fluid
Intracellular Compartments / Intracellular fluidDr. Naveen Gaurav srivastava
 
Biotechnology for the livestock improvements and phb degradation
Biotechnology for the livestock improvements and phb degradationBiotechnology for the livestock improvements and phb degradation
Biotechnology for the livestock improvements and phb degradationDr. Naveen Gaurav srivastava
 
Environmental biotech and plant tissue culture protocols
Environmental biotech and plant tissue culture protocolsEnvironmental biotech and plant tissue culture protocols
Environmental biotech and plant tissue culture protocolsDr. Naveen Gaurav srivastava
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysisDr. Naveen Gaurav srivastava
 
Biotechnology for the livestock improvements and phb degradation
Biotechnology for the livestock improvements and phb degradationBiotechnology for the livestock improvements and phb degradation
Biotechnology for the livestock improvements and phb degradationDr. Naveen Gaurav srivastava
 
Treatment of municipal waste and industrial effluents
Treatment of municipal waste and industrial effluentsTreatment of municipal waste and industrial effluents
Treatment of municipal waste and industrial effluentsDr. Naveen Gaurav srivastava
 

Más de Dr. Naveen Gaurav srivastava (20)

Global environmental problems
Global environmental problemsGlobal environmental problems
Global environmental problems
 
Polymerase chain reaction andRestriction fragment length polymorphism (RFLP):...
Polymerase chain reaction andRestriction fragment length polymorphism (RFLP):...Polymerase chain reaction andRestriction fragment length polymorphism (RFLP):...
Polymerase chain reaction andRestriction fragment length polymorphism (RFLP):...
 
Protocols for genomics and proteomics
Protocols for genomics and proteomics Protocols for genomics and proteomics
Protocols for genomics and proteomics
 
Intracellular Compartments / Intracellular fluid
Intracellular Compartments / Intracellular fluidIntracellular Compartments / Intracellular fluid
Intracellular Compartments / Intracellular fluid
 
Types of receptors
Types of receptors Types of receptors
Types of receptors
 
Hydrogen production by microbes
Hydrogen production by microbesHydrogen production by microbes
Hydrogen production by microbes
 
Permanent Tissues of Plants
Permanent Tissues of PlantsPermanent Tissues of Plants
Permanent Tissues of Plants
 
Biotechnology for the livestock improvements and phb degradation
Biotechnology for the livestock improvements and phb degradationBiotechnology for the livestock improvements and phb degradation
Biotechnology for the livestock improvements and phb degradation
 
Environmental biotech and plant tissue culture protocols
Environmental biotech and plant tissue culture protocolsEnvironmental biotech and plant tissue culture protocols
Environmental biotech and plant tissue culture protocols
 
Monoclonal and polyclonal in diagnostics
Monoclonal and polyclonal in diagnosticsMonoclonal and polyclonal in diagnostics
Monoclonal and polyclonal in diagnostics
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
Enzyme immuno assay and radioimmunoassay
Enzyme immuno assay and radioimmunoassayEnzyme immuno assay and radioimmunoassay
Enzyme immuno assay and radioimmunoassay
 
Biotechnology for the livestock improvements and phb degradation
Biotechnology for the livestock improvements and phb degradationBiotechnology for the livestock improvements and phb degradation
Biotechnology for the livestock improvements and phb degradation
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assembly
 
Vaccine production in plants
Vaccine production in plantsVaccine production in plants
Vaccine production in plants
 
Treatment of municipal waste and industrial effluents
Treatment of municipal waste and industrial effluentsTreatment of municipal waste and industrial effluents
Treatment of municipal waste and industrial effluents
 
Somaclonal variations
Somaclonal variationsSomaclonal variations
Somaclonal variations
 
Solid waste management
Solid waste managementSolid waste management
Solid waste management
 
Secondary metabolites
Secondary metabolitesSecondary metabolites
Secondary metabolites
 
Resistance to biotic stresses
Resistance to biotic stressesResistance to biotic stresses
Resistance to biotic stresses
 

Último

This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfSanaAli374401
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterMateoGardella
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 

Último (20)

This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 

Shotgun and clone contig method

  • 1. Genome Sequencing: Shotgun and Clone Contig Method Dr. Naveen Gaurav Associate Professor and Head Department of Biotechnology Shri Guru Ram Rai University Dehradun
  • 2. Genome Sequencing: Shotgun Method In shotgun sequencing the genome is broken randomly into short fragments (1 to 2 kbp long) suitable for sequencing. The fragments are ligated into a suitable vector and then partially sequenced. Around 400–500 bp of sequence can be generated from each fragment in a single sequencing run. In some cases, both ends of a fragment are sequenced. Computerized searching for overlaps between individual sequences then assembles the complete sequence. Overlapping sequences are assembled to generate contigs (Fig. 1). The term contig refers to a known DNA sequence that is contiguous and lacks gaps. Since fragments are cloned at random, duplicates will quite often be sequenced. To get full coverage the total amount of sequence obtained must therefore be several times that of the genome to allow for duplications. For example, 99.8% coverage requires a total amount of sequence that is 6- to 8-fold the genome size. In principle, all that is required to assemble a genome, however large, from small sequences is a sufficiently powerful computer. No genetic map or prior information is needed about the organism whose genome is to be sequenced. The original limitation to shotgun sequencing was the massive data handling that is required. The development of faster computers overcame this problem. The first bacterial genome to be sequenced was Haemophilus influenza. The sequence was deduced from just under 25,000 sequences averaging 480 bp each. This gave a total of almost 12 million bp of sequence—six times the genome size. Computerized assembly using overlaps resulted in 140 regions of contiguous sequence—that is, 140 contigs. The bacterium Haemophilus had the honor of being the first organism to be totally sequenced.
  • 3. Figure 1. Shotgun Sequencing The first step in shotgun sequencing an entire genome is to digest the genome into a large number of small fragments suitable for sequencing. All the small fragments are then cloned and sequenced. Computers analyze the sequence data for overlapping regions and assemble the sequences into several large contigs. Since some regions of the genome are unstable when cloned, some gaps may remain even after this procedure is repeated several times.
  • 4. The gaps between the contigs may be closed by more individualistic procedures. The easiest method is to re-screen the original set of clones with pairs of probes corresponding to sequences on the two sides of each gap. Clones that hybridize to both members of such a pair of probes presumably carry DNA that bridges the gap between two contigs. Such clones are then sequenced in full to close the gaps between contigs. However, many of the gaps between contigs are due to regions of DNA that are unstable when cloned, especially in a multicopy vector. Therefore, a second library in a different vector, often a single copy vector such as a lambda phage, is often used during the later stages of shotgun cloning. Pairs of end-of-contig probes are used to screen the new library for clones that hybridize to both probes and carry DNA that bridges the gap between the two contigs (Fig. 2A). A third approach, which avoids cloning altogether, is to run PCR reactions on whole genomic DNA using random pairs of PCR primers corresponding to contig ends. A PCR product will result only if the two contig ends are within a few kb of each other (Fig. 2B).
  • 5. Figure 2. Closing Gaps between Contigs To identify gaps between contigs, probes or primers are made that correspond to the ends of the contigs (pink). In (A) a new library of clones (green) is screened with end-of-contig probes. Clones that hybridize to probes from two sides of a gap are isolated. In this example, a probe for the end of contig #3 (3b) and the beginning of contig #4 (4a) hybridize to the fragment shown. Therefore, the sequence of this clone should close the gap between contig #3 and #4. (B) The second approach uses PCR primers that correspond to the ends of contigs to amplify genomic DNA. If the primer pair is within a few kilobases of each other, a PCR product is made and can be sequenced.
  • 6. Hierarchical shotgun sequencing Although shotgun sequencing can in theory be applied to a genome of any size, its direct application to the sequencing of large genomes (for instance, the human genome) was limited until the late 1990s, when technological advances made practical the handling of the vast quantities of complex data involved in the process. Historically, full-genome shotgun sequencing was believed to be limited by both the sheer size of large genomes and by the complexity added by the high percentage of repetitive DNA (greater than 50% for the human genome) present in large genomes. It was not widely accepted that a full-genome shotgun sequence of a large genome would provide reliable data. For these reasons, other strategies that lowered the computational load of sequence assembly had to be utilized before shotgun sequencing was performed. In hierarchical sequencing, also known as top- down sequencing, a low-resolution physical map of the genome is made prior to actual sequencing. From this map, a minimal number of fragments that cover the entire chromosome are selected for sequencing. In this way, the minimum amount of high- throughput sequencing and assembly is required. The amplified genome is first sheared into larger pieces (50-200kb) and cloned into a bacterial host using BACs or P1-derived artificial chromosomes (PAC). Because multiple genome copies have been sheared at random, the fragments contained in these clones have different ends, and with enough coverage (see section above) finding a scaffold of BAC contigs that covers the entire genome is theoretically possible. This scaffold is called a tiling path. Once a tiling path has been found, the BACs that form this path are sheared at random into smaller fragments and can be sequenced using the shotgun method on a smaller scale.
  • 7. Although the full sequences of the BAC contigs is not known, their orientations relative to one another are known. There are several methods for deducing this order and selecting the BACs that make up a tiling path. The general strategy involves identifying the positions of the clones relative to one another and then selecting the fewest clones required to form a contiguous scaffold that covers the entire area of interest. The order of the clones is deduced by determining the way in which they overlap. Overlapping clones can be identified in several ways. A small radioactively or chemically labeled probe containing a sequence-tagged site (STS) can be hybridized onto a microarray upon which the clones are printed. In this way, all the clones that contain a particular sequence in the genome are identified. The end of one of these clones can then be sequenced to yield a new probe and the process repeated in a method called chromosome walking. Alternatively, the BAC library can be restriction-digested. Two clones that have several fragment sizes in common are inferred to overlap because they contain multiple similarly spaced restriction sites in common. This method of genomic mapping is called restriction fingerprinting because it identifies a set of restriction sites contained in each clone. Once the overlap between the clones has been found and their order relative to the genome known, a scaffold of a minimal subset of these contigs that covers the entire genome is shotgun- sequenced. Because it involves first creating a low-resolution map of the genome, hierarchical shotgun sequencing is slower than whole-genome shotgun sequencing, but relies less heavily on computer algorithms than whole-genome shotgun sequencing. The process of extensive BAC library creation and tiling path selection, however, make hierarchical shotgun sequencing slow and labor-intensive. Now that the technology is available and the reliability of the data demonstrated, the speed and cost efficiency of whole-genome shotgun sequencing has made it the primary method for genome sequencing.
  • 8. Thank you References: Online notes, notes from research papers and Books by google search Engine