SlideShare a Scribd company logo
1 of 25
C. elegans Genetics

 C.elegans has 2 sexes, self fertilizing hermaphrodites and males.

 Sex determined chromosomally - XX-hermaphrodite, X-male.

 Diploid for 5 autosomes.

 Standard classical genetic techniques can be applied.

 Life cycle – Zygote to adult ~3 days.

 Grow on petri dish – they eat bacteria.

 Can store them frozen in liquid nitrogen indefinately.




   Why might the hermaphrodite sex be useful for genetics?
Chromosome I            Genetic mapping.

Left arm    m.u.   bli-3
                                    m.u. = map unit.
            -15    egl-30
                                    Genetic mapping – recombination.
                   mab-20
            -10
                                    1 m.u. is 1% recombination per meiosis.
             -5     fog-1
                    unc-73 unc-57
Central      0      dpy-5
                           dpy-14
cluster             fer-1
             5      lin-11 unc-29

                    unc-75                Parent              Recombinant
            10
                    unc-101
            15

            20      glp-4             fog-1     +       fog-1           +
            25
                    unc-54            glp-4     +         +             glp-4
Right arm
We want to understand how life works – at the molecular level.

We had mutant genes with informative phenotypes.

The mutated genes were mapped onto linkage groups – chromosomes.

What kinds of proteins do these genes encode and how do these proteins
function?



   In 1983, identifying the molecular sequence of a gene
   defined by mutation was a complicated and time
   consuming business, even in the worm.

   If we only new the sequence of the genome!
As the term applies to recombinant DNA, what is a clone?




                            Starting with DNA extracted from
                            any organism,
             Vector
                            How can you take that and get one
                            single fragment into a vector and
                            grow billions of copies of that
                            single “cloned” molecule?


        Cloned DNA insert
C. elegans Genome Project




                                                                             unc-101
                                                                    unc-75




                                                                                                    unc-54
                                                  unc-73
                                         mab-20




                                                           lin-11
                                                  dpy-5




                                                                                       glp-4
                                                  fog-1
                                egl-30




                                                           fer-1
 Mutants - function



                       bli-3
     Genetic map




                                                                                               25
                                                                      10

                                                                                15

                                                                                       20
                                                       0

                                                              5
                               -15

                                           -10

                                                  -5
    Chromosomes        AACGTTCCACG.......

DNA sequence – genes                                                                                         Cloned DNA
and proteins                                                                                                 fragments




 Identify DNA sequences corresponding to genes defined by mutation.
If you wanted to clone sections of chromosomes for
     sequencing, how many copies of each chromosome
     would you start with?




  DNA




Of the order of millions – millions of copies of each chromosome
Purified genomic DNA




Fragment the chromosomal
DNA – either restriction
enzyme or mechanical shear.
Cloning methods used by the C. elegans genome project
                 Cosmid clones – ~ 40 Kb insert size – Genomic Library.


   Cosmid cloning vector                                  Linearised cosmid vector



                                                           Random fragments of
                                                           genomic DNA –
   Drug resistance marker
   E. coli origin of replication                           millions of them.
   cos site
   Useful restriction sites             DNA Ligase




Long concatenates of cosmid
vectors interspaced with
random fragments of
genomic DNA.
Mixed population “inserts”         In vitro lambda packaging
                                           extracts

                                                Lambda Terminase

                                                Other phage proteins

     COS sites in
     cosmid vector


                                 E. coli


             Critical step




Phage “transfects” single
cosmid into an E. coli cell.
CLONING
 This is a clone
                               Cells are plated onto medium with antibiotic selection.

                               Cells grown up to form bacterial colonies.
   Insert X
                               Each colony is derived from a single transfected cell.

                               Each colony is a clonal population.

                                          E. coli - clonal population with a single
                                          cosmid clone – single genomic DNA
                                          fragment.

                                                Billions of copies of one cloned
                                                insert.
                                                Freeze it for storage.
                                                Purify cosmid DNA.
                                                Sequence the insert.
Solid medium on plates   Liquid culture         Sub-clone fragments etc.
Started with many millions of different fragments of chromosomal DNA in
one tube.

End up with potentially millions of CLONED fragments, each in a different
E.coli colony – or culture.
We have got as far as random cloned fragments of genomic DNA.

What next?



Average cosmid insert size – 40 Kb

C.elegans genome ~100.3 Mb = 100,300 Kb

100,300/40 = 2,507.5

i.e. ~2,500 cosmid clones could contain the entire C. elegans
genome – but WOULD they?
In principle, 2500 cosmid clones could contain all the DNA of the C. elegans
    genome.

    Why not just start sequencing ~2500 clones picked at random?


  Imagine this:

  I give you a large and awkwardly shaped dice with 2500 faces, with a single
  number on each face, the numbers 1-2500.

  Roll the dice and write down the number on top.
  Repeat this – again and again and…….

  How many times would you have to roll the dice so that every face of the dice
  would have been on top at least once?


~ 4x2500 will give ~95% probability of any one side or DNA fragment, appearing.
~10x2500 raises probability to ~99%
The Golden Path
What if you could identify clones that overlapped slightly with ones another?



                                                How can we get these clones?


    Cloned DNA fragments – moderate overlaps.



With this approach you could sequence the entire genome by
sequencing less than 5000 cosmid clones (2x2500)
Cosmid fingerprinting

          1. Restriction digest of cosmid DNA.
          2. Separate fragments according to size by gel electrophoresis.
          3. Digitise the ladder of different sized DNA fragments obtained.

        Multiple common fragments – clones probably overlap.

        C. elegans genome project, ~17,000 cosmid clones fingerprinted.
A B C
        Assembled into “contigs” – overlapping clones.



                   “Contig”              ~17,000 random cosmid clones
          A                              Fingerprinting ~700 contigs
              B
                    C
                          D                C.elegans genome 100 Mb
                                           ~2,500 cosmid clones
700 contigs.

What is the minimum number of contigs the C. elegans genome could be
contained in?

Or – how would we know when we had succeeded in joining all the contigs?




 A method of filling the gaps – joining the contigs – was needed.
YACs – Yeast Artificial Chromosomes

DNA inserts of ~100 kb – 2 Mb.

Grown in yeast.

Clonal growth of yeast colonies, much like cosmids in E. coli.

YAC DNA separated by pulsed-field gel electrophoresis.




        C. elegans genome is ~100 Mb.

        Cosmid clones – approximately 40 kb inserts.
        YAC clones – select average 500 kb inserts.

        ~2500 cosmid clones would permit 1x coverage of the genome.
        ~200 YAC clones would permit 1x coverage of the genome.
~17,000 fingerprinted cosmid clones – ~700 unlinked contigs.


Cosmid clone
contigs


                                        ?                                                ?



6 Chromosomes      AACGTTCCACG.......




                                                                       unc-101
                                                              unc-75




                                                                                              unc-54
                                            unc-73
                                   mab-20




                                                     lin-11
                                            dpy-5




                                                                                 glp-4
                                            fog-1
                          egl-30




                                                     fer-1
                 bli-3




Genetic map

                                                                          15
                                                                10



                                                                                 20

                                                                                         25
                                                 0

                                                        5
                         -15

                                     -10

                                            -5
Joining up the contigs

              Contig X                                  Contig Y



                                  YAC clone


~700 contigs – grids of
representative cosmid clones.          • Large YAC clones (> 1Mb).
                                       • Purify YAC DNA – (PFGE).
                                       • Radio-label YAC DNA.
                                       • Hybridise to cosmid grid.
                                       • Expose to X-ray film.



                                 Linked cosmid clones
unc-101
                                                                   unc-75




                                                                                                   unc-54
                                                 unc-73
                                        mab-20




                                                          lin-11
                                                 dpy-5




                                                                                      glp-4
                                                 fog-1
                               egl-30




                                                          fer-1
                      bli-3
       Genetic map




                                                                     10

                                                                               15

                                                                                      20

                                                                                              25
                                                      0

                                                             5
                              -15

                                          -10

                                                 -5
A physical map of the genome - the “Golden Path” – chromosomes represented in ordered
overlapping clones or “clone contigs”.

YACs
    Cosmids




                     The Sequence of The Genome
Sequencing the C. elegans Genome




                              Individual cosmid clone.
                                  Randomly fragmented and shotgun cloned into
                                  sequencing vectors.
                                  Generally smaller insert size is best for primary
                                  sequence determination – 2-10 Kb.



Sequence of cosmid or YAC etc, determined and compiled in silico.
Finishing – directed cloning to fill in any gaps.

Check for overlap of sequence with overlapping cosmids.
Gaps between cosmid contigs ~20% of genome.

Most of these gaps were not random. They contained regions that could not be
cloned in cosmids.




YAC clones covering most of the gaps.

YAC DNA shotgun cloned into M13 or plasmid vectors.

Most of the DNA contained in these awkward regions was successfully sub-cloned
into small insert size vectors, and sequenced.

The sequence as published in December 1998 was generated from:
2527 cosmids, 257 YACs, 113 fosmids, 44 PCR products.
C. elegans cosmid K06A5, 24323 bp.
Flat sequence file –3955 bp shown.

>CEK06A5
acaagagagggcgcctcggccgtatgttgaatgggagatcgatggaaccgagacaacgagaaaaggaatagagacggagaaagagagagagagcgcgcgttgttggaaggatg
aaaaagaaaaaagacatgagctgcttcacaagagcttggcgaaagcaaagggcaaagtgttgacagcttagtggtggtagttggatcttctctcctcgttctctgctcacaac
tcgtctatcactcatatcacatttatttcccaatatcattttaacaacatcttccgatgcatgttcgtcaatattgcgcaaccactttgcaatattgtcaaaacttttcgcat
ttgtgatatcgtaaaccagcataattcccattgctccgcggtaatatgatgttgtgattgtgtggaatcgttcttgtccagctgtgtcccagatttgtaatttaatctttttt
ccttttaattcgatagttttaattttgaagtcgattcctgaatgaaaaaagaaaattattttgaaatcactagattctgaataaaaactaaccaatagttgagatgaatgtgg
tgttaaaggcatcatccgaaaatctgtacagaatgcaagtttttccaactcctgagtcgcctattagcagcaatttgaagagcatgtcatacggtcggcgagccatttttctt
ctgaaatgagaaaaagttgagaactaaagttgcacaaaagtaagagaaaagcacttgagtcatggcaaatagaacgaacactttgagatttcgaagaagttatcaagagttga
caattggaagatatttggaagaactttctaatttttttctagttttccaaaattaggtttttgtcataaaatgttgtcaaagaaaaaacaggacaaaatagttaattgttgtt
tccattataacaaaaaaaaatttgaacggagctattaacgcgtgcatgcgcaaatcacatcgattagctgtttctgggaaattctcgggaaaaggtgaacagcagctgctggc
ttcctctgcgggtcacgaaaacacaaagagatcattataattgttatttggaaaggaagcgaatctaaaacgggtacaggtggacgtttattgatcgaaagtgctttttattt
gaaattgaatggtgaactttgcaattttgtaatgcaaagtacgttatcagatggcatgagatgtgtgaagtgataaggaataaaatgtgaacgacatgttcaagaaactgtga
tttttcaataatttgtgatgaaatattttaggaacagaaatgaacatattaattgatataaaaacaataggaacactaactcataattatgataggtgaatatcaaaatgtgc
tagattttttgaagttaaaaaatacatttctaatattttttcaaataataagtttcagctgaaatttcagggtgatttcagaaagctatgttttgataaattgttttgaaaat
taaaagaagctacagcaaaaaaaaattaaagagaacatcgctccctcgtagtgtataatttttgattatcgaaaaaaatgagtcaatgatgaaaaggaagtcgcaatctcaaa
acttcaaaaatcaaaagaagccgttgcctctgtcatcaaaaattcagaagacaaggttgttgacaagggtcaattctcagtggtggagggcattgggcgtggtgaaatttttg
aaggctagtgtggttggacctctactagatagacaaaacccccgaaatagacgtttaatttgatgagatggtggagaaagaaaaggactcattctctagatgatagagagacc
agagatacagacaagagagggcgcctcggccgtatgttgaatgggagatcgatggaaccgagacaacgagaaaaggaatagagacggagaaagagagagagagcgcgcgttgt
tggaaggatgaaaaagaaaaaagacatgagctgcttcacaagagcttggcgaaagcaaagggcaaagtgttgacagcttagtggtggtagttggatcatgtgtttttatgttt
ccggtgggagaaggttcaacaaaaaatgaaaagaaaaagttcaagcggcatgaatcattctgagtttaaaacaaaattattgcgaaaattaatattaaaaccttttcacaaaa
cttcaagctaatctgttcatgaaaatttgaataatagttttttcccacctatttagaattaacttcatattaacgaaattaattaacgaatcgaaaattatgacttttcagaa
tcatctgaagttttttcacattccatgctgcatggaataatttgatcctggaatcgatatgtttttatggtatactttttaaccttcaatttagctggaaaagtatggaataa
ataattcccgaagctatgtacatatatgtagaattattgaatgattgtgagaacaacttgactttagcttgagtaggaatcggaatggctatcgaccgatcaacacttaggat
tgtaagaatggcagtaagaatatattgaagaaagaatgtttgttcataggaagagaaagagtattgcgaaatcatcatcgcccactttagaatggacgggcggtgagcggaca
tagagaattgtgaatgactaatgcttttgcagaatctagggcaaaatcgtaggaacaaacaattgtaatacggagaaaacaatcatatcgatcgatgatcatggagaaaaatg
tgatttaagtgagtagacttggaaaaattaataaaagcatgaattgtcgatatttttcatttattttcattataaagctctttaaaaacaaattaaatattgagaatggcttc
gaagaatattgtttcaaatatgttcaatggtgacaccttgcggataaaattaatgtaaaaatcatggaacacagattcactgatatctcattatctcaagcagtgtaattaga
gattttttggaacaattattttataaaactataaataaaccgtttatactactcaaagccaaatattcaagctattaccattttttttctaactaattcttgagcaattaaag
tattccccagtttttattttgcaacgactccaggcaaacacgctccgttgcacttgccgccaaggcgttgcattcaaatcagagagacatctcattccgatttctgtttttct
tccaataaacggtattttatgcctaatgggtgatacggaaattgttcctcttcgagtacaaaatgtacttgatagcgaaatcattcgtctcaacttgtggtccatgaaggtaa
ctgtctagtttttttaagttttcatgatttcaatatttttacagtttaacgcgaccagtttcaaactcgaaggttttgtgagaaatgaagaaggcactatgatgcagaaagtt
tgttccgaatttatttgtgtaagtcgagaaacatattcgtcaacaattttcattaaatattcagagacgcttcacttctacgttgcttttcgatgtttccggacgtttcttcg
acttggtcggacagattgatcgggaatatcaacaaaaaatgggaatgcctagtagaattattgatgaattttcaaatggaattcctgaaaattgggccgaccttatctattcc
tgcatgtcagccaaccaaagaagcgcacttcgccctatccaacaggctccaaaagaaccaattagaactagaacagaaccaattgttacgttggcagatgaaaccgagctaac
tggaggatgccagaaaaattccgaaaacgagaaagaaaggaacagacgtgagcgtgaagaacagcaaacaaaggaacgtgagagaagattagaagaagaaaaacaacgacgag
atgctgaagctgaggctgaaagaaggcgaaaagaagaggaagagctggaagaagctaattacacccttcgtgctccgaaatctcagaacggcgagccaatcactccgataaga
Genome sequence of C.elegans.

                                Sequence of entire genome.

                                Sequence of cDNA clones.

                                Approximately 19,500 predicted protein coding
                                gene sequences.

                                Large number of various kinds of functional
                                RNAs – not discuss further.

                                For this lecture – focus predicted proteins.




                                     Gene prediction? How?

Science, December 1998.
Computer based predictions

GENEFINDER

Biases in coding sequence - in C. elegans non-coding is AT rich.
Splice site signals, initiator methionines, termination codons.

Likely exons and probable/possible splice patterns.




       • Evidence that a prediction is correct?
       • Homology with genes in other organisms – homologues.
       • Known protein families.

       •Experimental evidence.

More Related Content

What's hot

Bacteriophage T4 and Bacteriophage lambda
Bacteriophage T4 and Bacteriophage lambda Bacteriophage T4 and Bacteriophage lambda
Bacteriophage T4 and Bacteriophage lambda HARINATHA REDDY ASWARTHA
 
Batch (1) final sem (1) molecular biology
Batch (1) final sem (1) molecular biologyBatch (1) final sem (1) molecular biology
Batch (1) final sem (1) molecular biologySalah Abass
 
Batch (1) supplementary exam m.sc - molecular biology
Batch (1) supplementary exam   m.sc - molecular biologyBatch (1) supplementary exam   m.sc - molecular biology
Batch (1) supplementary exam m.sc - molecular biologySalah Abass
 
Pb Stem Cell Engineering
Pb Stem Cell EngineeringPb Stem Cell Engineering
Pb Stem Cell EngineeringJack Crawford
 
Critical role of host factors which recruit replication in positive strand rn...
Critical role of host factors which recruit replication in positive strand rn...Critical role of host factors which recruit replication in positive strand rn...
Critical role of host factors which recruit replication in positive strand rn...SARMAD HASHMI
 
MOLECULAR ORGANIZATION OF EKARYOTIC RNA
 MOLECULAR ORGANIZATION OF EKARYOTIC RNA MOLECULAR ORGANIZATION OF EKARYOTIC RNA
MOLECULAR ORGANIZATION OF EKARYOTIC RNAgohil sanjay bhagvanji
 
Ribozymes
RibozymesRibozymes
RibozymesAlbert
 
Biotechniques v29p146 Admid
Biotechniques v29p146 AdmidBiotechniques v29p146 Admid
Biotechniques v29p146 AdmidMichael Weiner
 
Práctica de Transformación
Práctica de TransformaciónPráctica de Transformación
Práctica de TransformaciónCiberGeneticaUNAM
 
Bairoch ISB closing-talk: CALIPHO
Bairoch ISB closing-talk: CALIPHOBairoch ISB closing-talk: CALIPHO
Bairoch ISB closing-talk: CALIPHOPascale Gaudet
 
Dna Modifying Enzymes by Arijit Pani
Dna Modifying Enzymes by Arijit PaniDna Modifying Enzymes by Arijit Pani
Dna Modifying Enzymes by Arijit PaniArijit Pani
 

What's hot (19)

Bacteriophage T4 and Bacteriophage lambda
Bacteriophage T4 and Bacteriophage lambda Bacteriophage T4 and Bacteriophage lambda
Bacteriophage T4 and Bacteriophage lambda
 
Batch (1) final sem (1) molecular biology
Batch (1) final sem (1) molecular biologyBatch (1) final sem (1) molecular biology
Batch (1) final sem (1) molecular biology
 
Batch (1) supplementary exam m.sc - molecular biology
Batch (1) supplementary exam   m.sc - molecular biologyBatch (1) supplementary exam   m.sc - molecular biology
Batch (1) supplementary exam m.sc - molecular biology
 
Pb Stem Cell Engineering
Pb Stem Cell EngineeringPb Stem Cell Engineering
Pb Stem Cell Engineering
 
Cancer ppt 2
Cancer ppt 2Cancer ppt 2
Cancer ppt 2
 
rprotein2
rprotein2rprotein2
rprotein2
 
Gateway
GatewayGateway
Gateway
 
Critical role of host factors which recruit replication in positive strand rn...
Critical role of host factors which recruit replication in positive strand rn...Critical role of host factors which recruit replication in positive strand rn...
Critical role of host factors which recruit replication in positive strand rn...
 
dna rekombinan
dna rekombinandna rekombinan
dna rekombinan
 
MOLECULAR ORGANIZATION OF EKARYOTIC RNA
 MOLECULAR ORGANIZATION OF EKARYOTIC RNA MOLECULAR ORGANIZATION OF EKARYOTIC RNA
MOLECULAR ORGANIZATION OF EKARYOTIC RNA
 
Structural studies of GPCRs
Structural studies of GPCRsStructural studies of GPCRs
Structural studies of GPCRs
 
Ribozymes
RibozymesRibozymes
Ribozymes
 
Biotechniques v29p146 Admid
Biotechniques v29p146 AdmidBiotechniques v29p146 Admid
Biotechniques v29p146 Admid
 
rprotein3
rprotein3rprotein3
rprotein3
 
Práctica de Transformación
Práctica de TransformaciónPráctica de Transformación
Práctica de Transformación
 
Comparative Genomics for Marker Development in Cassava
Comparative Genomics for Marker Development in CassavaComparative Genomics for Marker Development in Cassava
Comparative Genomics for Marker Development in Cassava
 
Bairoch ISB closing-talk: CALIPHO
Bairoch ISB closing-talk: CALIPHOBairoch ISB closing-talk: CALIPHO
Bairoch ISB closing-talk: CALIPHO
 
Heinrich et al., 2010
Heinrich et al., 2010Heinrich et al., 2010
Heinrich et al., 2010
 
Dna Modifying Enzymes by Arijit Pani
Dna Modifying Enzymes by Arijit PaniDna Modifying Enzymes by Arijit Pani
Dna Modifying Enzymes by Arijit Pani
 

Viewers also liked

molecular biology techniques -jaypee university of information technology- ra...
molecular biology techniques -jaypee university of information technology- ra...molecular biology techniques -jaypee university of information technology- ra...
molecular biology techniques -jaypee university of information technology- ra...RAVI RANJAN
 
DNA microarray
DNA microarrayDNA microarray
DNA microarrayS Rasouli
 
Probe labeling
Probe labelingProbe labeling
Probe labelingAman Ullah
 
NetBioSIG2014-Keynote by Marian Walhout
NetBioSIG2014-Keynote by Marian WalhoutNetBioSIG2014-Keynote by Marian Walhout
NetBioSIG2014-Keynote by Marian WalhoutAlexander Pico
 
Back to basics: Fundamental Concepts and Special Considerations in RNA Isolation
Back to basics: Fundamental Concepts and Special Considerations in RNA IsolationBack to basics: Fundamental Concepts and Special Considerations in RNA Isolation
Back to basics: Fundamental Concepts and Special Considerations in RNA IsolationQIAGEN
 
molecular biology techniques -jaypee university of information technology- ra...
molecular biology techniques -jaypee university of information technology- ra...molecular biology techniques -jaypee university of information technology- ra...
molecular biology techniques -jaypee university of information technology- ra...RAVI RANJAN
 
Lectut btn-202-ppt-l22. hybridization procedures
Lectut btn-202-ppt-l22. hybridization proceduresLectut btn-202-ppt-l22. hybridization procedures
Lectut btn-202-ppt-l22. hybridization proceduresRishabh Jain
 
RNA isolation
RNA isolationRNA isolation
RNA isolationzara khan
 
281 lec30 mol_tech2
281 lec30 mol_tech2281 lec30 mol_tech2
281 lec30 mol_tech2hhalhaddad
 
Lec16 Realtime PCR
Lec16 Realtime PCRLec16 Realtime PCR
Lec16 Realtime PCRsr320
 
B.Tech Biotechnology II Elements of Biotechnology Unit 4 DNA Fingerprinting
B.Tech Biotechnology II Elements of Biotechnology Unit 4 DNA FingerprintingB.Tech Biotechnology II Elements of Biotechnology Unit 4 DNA Fingerprinting
B.Tech Biotechnology II Elements of Biotechnology Unit 4 DNA FingerprintingRai University
 
Molecular probes kashmeera n.a.
Molecular probes   kashmeera n.a.Molecular probes   kashmeera n.a.
Molecular probes kashmeera n.a.Kashmeera N.A.
 
281 lec29 mol_tech1
281 lec29 mol_tech1281 lec29 mol_tech1
281 lec29 mol_tech1hhalhaddad
 
Preparation and isolation of genomic
Preparation and isolation of genomicPreparation and isolation of genomic
Preparation and isolation of genomicMubashar Ali
 
Genome organisation in eukaryotes...........!!!!!!!!!!!
Genome organisation in eukaryotes...........!!!!!!!!!!!Genome organisation in eukaryotes...........!!!!!!!!!!!
Genome organisation in eukaryotes...........!!!!!!!!!!!manish chovatiya
 
Dna isolation Principle
Dna isolation PrincipleDna isolation Principle
Dna isolation PrincipleAnkita Gurao
 

Viewers also liked (20)

molecular biology techniques -jaypee university of information technology- ra...
molecular biology techniques -jaypee university of information technology- ra...molecular biology techniques -jaypee university of information technology- ra...
molecular biology techniques -jaypee university of information technology- ra...
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
Probe labeling
Probe labelingProbe labeling
Probe labeling
 
RT PCR
RT PCRRT PCR
RT PCR
 
NetBioSIG2014-Keynote by Marian Walhout
NetBioSIG2014-Keynote by Marian WalhoutNetBioSIG2014-Keynote by Marian Walhout
NetBioSIG2014-Keynote by Marian Walhout
 
Back to basics: Fundamental Concepts and Special Considerations in RNA Isolation
Back to basics: Fundamental Concepts and Special Considerations in RNA IsolationBack to basics: Fundamental Concepts and Special Considerations in RNA Isolation
Back to basics: Fundamental Concepts and Special Considerations in RNA Isolation
 
molecular biology techniques -jaypee university of information technology- ra...
molecular biology techniques -jaypee university of information technology- ra...molecular biology techniques -jaypee university of information technology- ra...
molecular biology techniques -jaypee university of information technology- ra...
 
Lectut btn-202-ppt-l22. hybridization procedures
Lectut btn-202-ppt-l22. hybridization proceduresLectut btn-202-ppt-l22. hybridization procedures
Lectut btn-202-ppt-l22. hybridization procedures
 
RNA isolation
RNA isolationRNA isolation
RNA isolation
 
281 lec30 mol_tech2
281 lec30 mol_tech2281 lec30 mol_tech2
281 lec30 mol_tech2
 
Lec16 Realtime PCR
Lec16 Realtime PCRLec16 Realtime PCR
Lec16 Realtime PCR
 
B.Tech Biotechnology II Elements of Biotechnology Unit 4 DNA Fingerprinting
B.Tech Biotechnology II Elements of Biotechnology Unit 4 DNA FingerprintingB.Tech Biotechnology II Elements of Biotechnology Unit 4 DNA Fingerprinting
B.Tech Biotechnology II Elements of Biotechnology Unit 4 DNA Fingerprinting
 
Molecular probes kashmeera n.a.
Molecular probes   kashmeera n.a.Molecular probes   kashmeera n.a.
Molecular probes kashmeera n.a.
 
281 lec29 mol_tech1
281 lec29 mol_tech1281 lec29 mol_tech1
281 lec29 mol_tech1
 
Pseudomonas
PseudomonasPseudomonas
Pseudomonas
 
Preparation and isolation of genomic
Preparation and isolation of genomicPreparation and isolation of genomic
Preparation and isolation of genomic
 
Genome organisation in eukaryotes...........!!!!!!!!!!!
Genome organisation in eukaryotes...........!!!!!!!!!!!Genome organisation in eukaryotes...........!!!!!!!!!!!
Genome organisation in eukaryotes...........!!!!!!!!!!!
 
Plasmid isolation
Plasmid isolationPlasmid isolation
Plasmid isolation
 
Dna isolation Principle
Dna isolation PrincipleDna isolation Principle
Dna isolation Principle
 
Real time PCR
Real time PCRReal time PCR
Real time PCR
 

Similar to C. elegans Genetics: Understanding Life at the Molecular Level

LECTURE 5_Principle of Cloning.pdf
LECTURE 5_Principle of Cloning.pdfLECTURE 5_Principle of Cloning.pdf
LECTURE 5_Principle of Cloning.pdffadzilah omar
 
Recombinant DNA Technology- Study of cloning vectors.pptx
Recombinant DNA  Technology- Study of cloning vectors.pptxRecombinant DNA  Technology- Study of cloning vectors.pptx
Recombinant DNA Technology- Study of cloning vectors.pptxPoonam Patil
 
Vectors part 1 | molecular biology | biotechnology
Vectors  part 1 | molecular biology | biotechnologyVectors  part 1 | molecular biology | biotechnology
Vectors part 1 | molecular biology | biotechnologyatul azad
 
Molecular Cloning - Vectors: Types & Characteristics
Molecular Cloning -  Vectors: Types & CharacteristicsMolecular Cloning -  Vectors: Types & Characteristics
Molecular Cloning - Vectors: Types & CharacteristicsSruthy Chandran
 
cloning vectors in RDT.pptx
cloning vectors in RDT.pptxcloning vectors in RDT.pptx
cloning vectors in RDT.pptxvaishaliarora56
 
Bacteriophage by reshma
Bacteriophage by reshma Bacteriophage by reshma
Bacteriophage by reshma resgmasheikh
 
Glossary For Biotechnology
Glossary For BiotechnologyGlossary For Biotechnology
Glossary For Biotechnologyallyjer
 
RECOMBINANT DNA TECHNOLOGY
RECOMBINANT DNA TECHNOLOGYRECOMBINANT DNA TECHNOLOGY
RECOMBINANT DNA TECHNOLOGYYESANNA
 
Recombinant DNA Technology -2015
Recombinant DNA Technology -2015Recombinant DNA Technology -2015
Recombinant DNA Technology -2015ARPUTHA SELVARAJ A
 
Vectors a.k.saha
Vectors a.k.sahaVectors a.k.saha
Vectors a.k.sahaAnanda Saha
 

Similar to C. elegans Genetics: Understanding Life at the Molecular Level (20)

LECTURE 5_Principle of Cloning.pdf
LECTURE 5_Principle of Cloning.pdfLECTURE 5_Principle of Cloning.pdf
LECTURE 5_Principle of Cloning.pdf
 
Vector
VectorVector
Vector
 
Recombinant DNA Technology- Study of cloning vectors.pptx
Recombinant DNA  Technology- Study of cloning vectors.pptxRecombinant DNA  Technology- Study of cloning vectors.pptx
Recombinant DNA Technology- Study of cloning vectors.pptx
 
Vectors part 1 | molecular biology | biotechnology
Vectors  part 1 | molecular biology | biotechnologyVectors  part 1 | molecular biology | biotechnology
Vectors part 1 | molecular biology | biotechnology
 
Recombinant DNA Technology
Recombinant DNA TechnologyRecombinant DNA Technology
Recombinant DNA Technology
 
Molecular Cloning - Vectors: Types & Characteristics
Molecular Cloning -  Vectors: Types & CharacteristicsMolecular Cloning -  Vectors: Types & Characteristics
Molecular Cloning - Vectors: Types & Characteristics
 
cloning vectors in RDT.pptx
cloning vectors in RDT.pptxcloning vectors in RDT.pptx
cloning vectors in RDT.pptx
 
Gene Cloning
Gene CloningGene Cloning
Gene Cloning
 
Application1
Application1Application1
Application1
 
Bacteriophage by reshma
Bacteriophage by reshma Bacteriophage by reshma
Bacteriophage by reshma
 
Recombinant dna technology.pptx mona
Recombinant dna technology.pptx monaRecombinant dna technology.pptx mona
Recombinant dna technology.pptx mona
 
Dna cloning intro
Dna cloning introDna cloning intro
Dna cloning intro
 
Glossary For Biotechnology
Glossary For BiotechnologyGlossary For Biotechnology
Glossary For Biotechnology
 
dna cloning.pdf
dna cloning.pdfdna cloning.pdf
dna cloning.pdf
 
RECOMBINANT DNA TECHNOLOGY
RECOMBINANT DNA TECHNOLOGYRECOMBINANT DNA TECHNOLOGY
RECOMBINANT DNA TECHNOLOGY
 
Vectors-A.K.Saha_.ppt
Vectors-A.K.Saha_.pptVectors-A.K.Saha_.ppt
Vectors-A.K.Saha_.ppt
 
Vectors.pptx
 Vectors.pptx Vectors.pptx
Vectors.pptx
 
Recombinant DNA Technology -2015
Recombinant DNA Technology -2015Recombinant DNA Technology -2015
Recombinant DNA Technology -2015
 
Vectors a.k.saha
Vectors a.k.sahaVectors a.k.saha
Vectors a.k.saha
 
choice of vectors
 choice of vectors choice of vectors
choice of vectors
 

Recently uploaded

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Recently uploaded (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 

C. elegans Genetics: Understanding Life at the Molecular Level

  • 1. C. elegans Genetics  C.elegans has 2 sexes, self fertilizing hermaphrodites and males.  Sex determined chromosomally - XX-hermaphrodite, X-male.  Diploid for 5 autosomes.  Standard classical genetic techniques can be applied.  Life cycle – Zygote to adult ~3 days.  Grow on petri dish – they eat bacteria.  Can store them frozen in liquid nitrogen indefinately. Why might the hermaphrodite sex be useful for genetics?
  • 2. Chromosome I Genetic mapping. Left arm m.u. bli-3 m.u. = map unit. -15 egl-30 Genetic mapping – recombination. mab-20 -10 1 m.u. is 1% recombination per meiosis. -5 fog-1 unc-73 unc-57 Central 0 dpy-5 dpy-14 cluster fer-1 5 lin-11 unc-29 unc-75 Parent Recombinant 10 unc-101 15 20 glp-4 fog-1 + fog-1 + 25 unc-54 glp-4 + + glp-4 Right arm
  • 3. We want to understand how life works – at the molecular level. We had mutant genes with informative phenotypes. The mutated genes were mapped onto linkage groups – chromosomes. What kinds of proteins do these genes encode and how do these proteins function? In 1983, identifying the molecular sequence of a gene defined by mutation was a complicated and time consuming business, even in the worm. If we only new the sequence of the genome!
  • 4. As the term applies to recombinant DNA, what is a clone? Starting with DNA extracted from any organism, Vector How can you take that and get one single fragment into a vector and grow billions of copies of that single “cloned” molecule? Cloned DNA insert
  • 5. C. elegans Genome Project unc-101 unc-75 unc-54 unc-73 mab-20 lin-11 dpy-5 glp-4 fog-1 egl-30 fer-1 Mutants - function bli-3 Genetic map 25 10 15 20 0 5 -15 -10 -5 Chromosomes AACGTTCCACG....... DNA sequence – genes Cloned DNA and proteins fragments Identify DNA sequences corresponding to genes defined by mutation.
  • 6. If you wanted to clone sections of chromosomes for sequencing, how many copies of each chromosome would you start with? DNA Of the order of millions – millions of copies of each chromosome
  • 7. Purified genomic DNA Fragment the chromosomal DNA – either restriction enzyme or mechanical shear.
  • 8. Cloning methods used by the C. elegans genome project Cosmid clones – ~ 40 Kb insert size – Genomic Library. Cosmid cloning vector Linearised cosmid vector Random fragments of genomic DNA – Drug resistance marker E. coli origin of replication millions of them. cos site Useful restriction sites DNA Ligase Long concatenates of cosmid vectors interspaced with random fragments of genomic DNA.
  • 9. Mixed population “inserts” In vitro lambda packaging extracts Lambda Terminase Other phage proteins COS sites in cosmid vector E. coli Critical step Phage “transfects” single cosmid into an E. coli cell.
  • 10. CLONING This is a clone Cells are plated onto medium with antibiotic selection. Cells grown up to form bacterial colonies. Insert X Each colony is derived from a single transfected cell. Each colony is a clonal population. E. coli - clonal population with a single cosmid clone – single genomic DNA fragment. Billions of copies of one cloned insert. Freeze it for storage. Purify cosmid DNA. Sequence the insert. Solid medium on plates Liquid culture Sub-clone fragments etc.
  • 11. Started with many millions of different fragments of chromosomal DNA in one tube. End up with potentially millions of CLONED fragments, each in a different E.coli colony – or culture.
  • 12. We have got as far as random cloned fragments of genomic DNA. What next? Average cosmid insert size – 40 Kb C.elegans genome ~100.3 Mb = 100,300 Kb 100,300/40 = 2,507.5 i.e. ~2,500 cosmid clones could contain the entire C. elegans genome – but WOULD they?
  • 13. In principle, 2500 cosmid clones could contain all the DNA of the C. elegans genome. Why not just start sequencing ~2500 clones picked at random? Imagine this: I give you a large and awkwardly shaped dice with 2500 faces, with a single number on each face, the numbers 1-2500. Roll the dice and write down the number on top. Repeat this – again and again and……. How many times would you have to roll the dice so that every face of the dice would have been on top at least once? ~ 4x2500 will give ~95% probability of any one side or DNA fragment, appearing. ~10x2500 raises probability to ~99%
  • 14. The Golden Path What if you could identify clones that overlapped slightly with ones another? How can we get these clones? Cloned DNA fragments – moderate overlaps. With this approach you could sequence the entire genome by sequencing less than 5000 cosmid clones (2x2500)
  • 15. Cosmid fingerprinting 1. Restriction digest of cosmid DNA. 2. Separate fragments according to size by gel electrophoresis. 3. Digitise the ladder of different sized DNA fragments obtained. Multiple common fragments – clones probably overlap. C. elegans genome project, ~17,000 cosmid clones fingerprinted. A B C Assembled into “contigs” – overlapping clones. “Contig” ~17,000 random cosmid clones A Fingerprinting ~700 contigs B C D C.elegans genome 100 Mb ~2,500 cosmid clones
  • 16. 700 contigs. What is the minimum number of contigs the C. elegans genome could be contained in? Or – how would we know when we had succeeded in joining all the contigs? A method of filling the gaps – joining the contigs – was needed.
  • 17. YACs – Yeast Artificial Chromosomes DNA inserts of ~100 kb – 2 Mb. Grown in yeast. Clonal growth of yeast colonies, much like cosmids in E. coli. YAC DNA separated by pulsed-field gel electrophoresis. C. elegans genome is ~100 Mb. Cosmid clones – approximately 40 kb inserts. YAC clones – select average 500 kb inserts. ~2500 cosmid clones would permit 1x coverage of the genome. ~200 YAC clones would permit 1x coverage of the genome.
  • 18. ~17,000 fingerprinted cosmid clones – ~700 unlinked contigs. Cosmid clone contigs ? ? 6 Chromosomes AACGTTCCACG....... unc-101 unc-75 unc-54 unc-73 mab-20 lin-11 dpy-5 glp-4 fog-1 egl-30 fer-1 bli-3 Genetic map 15 10 20 25 0 5 -15 -10 -5
  • 19. Joining up the contigs Contig X Contig Y YAC clone ~700 contigs – grids of representative cosmid clones. • Large YAC clones (> 1Mb). • Purify YAC DNA – (PFGE). • Radio-label YAC DNA. • Hybridise to cosmid grid. • Expose to X-ray film. Linked cosmid clones
  • 20. unc-101 unc-75 unc-54 unc-73 mab-20 lin-11 dpy-5 glp-4 fog-1 egl-30 fer-1 bli-3 Genetic map 10 15 20 25 0 5 -15 -10 -5 A physical map of the genome - the “Golden Path” – chromosomes represented in ordered overlapping clones or “clone contigs”. YACs Cosmids The Sequence of The Genome
  • 21. Sequencing the C. elegans Genome Individual cosmid clone. Randomly fragmented and shotgun cloned into sequencing vectors. Generally smaller insert size is best for primary sequence determination – 2-10 Kb. Sequence of cosmid or YAC etc, determined and compiled in silico. Finishing – directed cloning to fill in any gaps. Check for overlap of sequence with overlapping cosmids.
  • 22. Gaps between cosmid contigs ~20% of genome. Most of these gaps were not random. They contained regions that could not be cloned in cosmids. YAC clones covering most of the gaps. YAC DNA shotgun cloned into M13 or plasmid vectors. Most of the DNA contained in these awkward regions was successfully sub-cloned into small insert size vectors, and sequenced. The sequence as published in December 1998 was generated from: 2527 cosmids, 257 YACs, 113 fosmids, 44 PCR products.
  • 23. C. elegans cosmid K06A5, 24323 bp. Flat sequence file –3955 bp shown. >CEK06A5 acaagagagggcgcctcggccgtatgttgaatgggagatcgatggaaccgagacaacgagaaaaggaatagagacggagaaagagagagagagcgcgcgttgttggaaggatg aaaaagaaaaaagacatgagctgcttcacaagagcttggcgaaagcaaagggcaaagtgttgacagcttagtggtggtagttggatcttctctcctcgttctctgctcacaac tcgtctatcactcatatcacatttatttcccaatatcattttaacaacatcttccgatgcatgttcgtcaatattgcgcaaccactttgcaatattgtcaaaacttttcgcat ttgtgatatcgtaaaccagcataattcccattgctccgcggtaatatgatgttgtgattgtgtggaatcgttcttgtccagctgtgtcccagatttgtaatttaatctttttt ccttttaattcgatagttttaattttgaagtcgattcctgaatgaaaaaagaaaattattttgaaatcactagattctgaataaaaactaaccaatagttgagatgaatgtgg tgttaaaggcatcatccgaaaatctgtacagaatgcaagtttttccaactcctgagtcgcctattagcagcaatttgaagagcatgtcatacggtcggcgagccatttttctt ctgaaatgagaaaaagttgagaactaaagttgcacaaaagtaagagaaaagcacttgagtcatggcaaatagaacgaacactttgagatttcgaagaagttatcaagagttga caattggaagatatttggaagaactttctaatttttttctagttttccaaaattaggtttttgtcataaaatgttgtcaaagaaaaaacaggacaaaatagttaattgttgtt tccattataacaaaaaaaaatttgaacggagctattaacgcgtgcatgcgcaaatcacatcgattagctgtttctgggaaattctcgggaaaaggtgaacagcagctgctggc ttcctctgcgggtcacgaaaacacaaagagatcattataattgttatttggaaaggaagcgaatctaaaacgggtacaggtggacgtttattgatcgaaagtgctttttattt gaaattgaatggtgaactttgcaattttgtaatgcaaagtacgttatcagatggcatgagatgtgtgaagtgataaggaataaaatgtgaacgacatgttcaagaaactgtga tttttcaataatttgtgatgaaatattttaggaacagaaatgaacatattaattgatataaaaacaataggaacactaactcataattatgataggtgaatatcaaaatgtgc tagattttttgaagttaaaaaatacatttctaatattttttcaaataataagtttcagctgaaatttcagggtgatttcagaaagctatgttttgataaattgttttgaaaat taaaagaagctacagcaaaaaaaaattaaagagaacatcgctccctcgtagtgtataatttttgattatcgaaaaaaatgagtcaatgatgaaaaggaagtcgcaatctcaaa acttcaaaaatcaaaagaagccgttgcctctgtcatcaaaaattcagaagacaaggttgttgacaagggtcaattctcagtggtggagggcattgggcgtggtgaaatttttg aaggctagtgtggttggacctctactagatagacaaaacccccgaaatagacgtttaatttgatgagatggtggagaaagaaaaggactcattctctagatgatagagagacc agagatacagacaagagagggcgcctcggccgtatgttgaatgggagatcgatggaaccgagacaacgagaaaaggaatagagacggagaaagagagagagagcgcgcgttgt tggaaggatgaaaaagaaaaaagacatgagctgcttcacaagagcttggcgaaagcaaagggcaaagtgttgacagcttagtggtggtagttggatcatgtgtttttatgttt ccggtgggagaaggttcaacaaaaaatgaaaagaaaaagttcaagcggcatgaatcattctgagtttaaaacaaaattattgcgaaaattaatattaaaaccttttcacaaaa cttcaagctaatctgttcatgaaaatttgaataatagttttttcccacctatttagaattaacttcatattaacgaaattaattaacgaatcgaaaattatgacttttcagaa tcatctgaagttttttcacattccatgctgcatggaataatttgatcctggaatcgatatgtttttatggtatactttttaaccttcaatttagctggaaaagtatggaataa ataattcccgaagctatgtacatatatgtagaattattgaatgattgtgagaacaacttgactttagcttgagtaggaatcggaatggctatcgaccgatcaacacttaggat tgtaagaatggcagtaagaatatattgaagaaagaatgtttgttcataggaagagaaagagtattgcgaaatcatcatcgcccactttagaatggacgggcggtgagcggaca tagagaattgtgaatgactaatgcttttgcagaatctagggcaaaatcgtaggaacaaacaattgtaatacggagaaaacaatcatatcgatcgatgatcatggagaaaaatg tgatttaagtgagtagacttggaaaaattaataaaagcatgaattgtcgatatttttcatttattttcattataaagctctttaaaaacaaattaaatattgagaatggcttc gaagaatattgtttcaaatatgttcaatggtgacaccttgcggataaaattaatgtaaaaatcatggaacacagattcactgatatctcattatctcaagcagtgtaattaga gattttttggaacaattattttataaaactataaataaaccgtttatactactcaaagccaaatattcaagctattaccattttttttctaactaattcttgagcaattaaag tattccccagtttttattttgcaacgactccaggcaaacacgctccgttgcacttgccgccaaggcgttgcattcaaatcagagagacatctcattccgatttctgtttttct tccaataaacggtattttatgcctaatgggtgatacggaaattgttcctcttcgagtacaaaatgtacttgatagcgaaatcattcgtctcaacttgtggtccatgaaggtaa ctgtctagtttttttaagttttcatgatttcaatatttttacagtttaacgcgaccagtttcaaactcgaaggttttgtgagaaatgaagaaggcactatgatgcagaaagtt tgttccgaatttatttgtgtaagtcgagaaacatattcgtcaacaattttcattaaatattcagagacgcttcacttctacgttgcttttcgatgtttccggacgtttcttcg acttggtcggacagattgatcgggaatatcaacaaaaaatgggaatgcctagtagaattattgatgaattttcaaatggaattcctgaaaattgggccgaccttatctattcc tgcatgtcagccaaccaaagaagcgcacttcgccctatccaacaggctccaaaagaaccaattagaactagaacagaaccaattgttacgttggcagatgaaaccgagctaac tggaggatgccagaaaaattccgaaaacgagaaagaaaggaacagacgtgagcgtgaagaacagcaaacaaaggaacgtgagagaagattagaagaagaaaaacaacgacgag atgctgaagctgaggctgaaagaaggcgaaaagaagaggaagagctggaagaagctaattacacccttcgtgctccgaaatctcagaacggcgagccaatcactccgataaga
  • 24. Genome sequence of C.elegans. Sequence of entire genome. Sequence of cDNA clones. Approximately 19,500 predicted protein coding gene sequences. Large number of various kinds of functional RNAs – not discuss further. For this lecture – focus predicted proteins. Gene prediction? How? Science, December 1998.
  • 25. Computer based predictions GENEFINDER Biases in coding sequence - in C. elegans non-coding is AT rich. Splice site signals, initiator methionines, termination codons. Likely exons and probable/possible splice patterns. • Evidence that a prediction is correct? • Homology with genes in other organisms – homologues. • Known protein families. •Experimental evidence.