SlideShare una empresa de Scribd logo
1 de 9
Descargar para leer sin conexión
http://genomebiology.com/2000/1/6/research/0011.1




Research
Evidence for symmetric chromosomal inversions around the




                                                                                                                                                 comment
replication origin in bacteria
Jonathan A Eisen, John F Heidelberg, Owen White and Steven L Salzberg
Address: The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.

Correspondence: Jonathan A Eisen. E-mail: jeisen@tigr.org




                                                                                                                                                 reviews
Published: 4 December 2000                                              Received: 7 August 2000
                                                                        Revised: 25 September 2000
Genome Biology 2000, 1(6):research0011.1–0011.9
                                                                        Accepted: 19 October 2000
The electronic version of this article is the complete one and can be
found online at http://genomebiology.com/2000/1/6/research/0011
© GenomeBiology.com (Print ISSN 1465-6906; Online ISSN 1465-6914)




                                                                                                                                                 reports
                 Abstract

                 Background: Whole-genome comparisons can provide great insight into many aspects of biology.
                 Until recently, however, comparisons were mainly possible only between distantly related species.
                 Complete genome sequences are now becoming available from multiple sets of closely related
                 strains or species.




                                                                                                                                                 deposited research
                 Results: By comparing the recently completed genome sequences of Vibrio cholerae, Streptococcus
                 pneumoniae and Mycobacterium tuberculosis to those of closely related species - Escherichia coli,
                 Streptococcus pyogenes and Mycobacterium leprae, respectively - we have identified an unusual and
                 previously unobserved feature of bacterial genome structure. Scatterplots of the conserved
                 sequences (both DNA and protein) between each pair of species produce a distinct X-shaped
                 pattern, which we call an X-alignment. The key feature of these alignments is that they have
                 symmetry around the replication origin and terminus; that is, the distance of a particular conserved
                 feature (DNA or protein) from the replication origin (or terminus) is conserved between closely




                                                                                                                                                 refereed research
                 related pairs of species. Statistically significant X-alignments are also found within some genomes,
                 indicating that there is symmetry about the replication origin for paralogous features as well.

                 Conclusions: The most likely mechanism of generation of X-alignments involves large chromosomal
                 inversions that reverse the genomic sequence symmetrically around the origin of replication. The
                 finding of these X-alignments between many pairs of species suggests that chromosomal inversions
                 around the origin are a common feature of bacterial genome evolution.


                                                                                                                                                 interactions
Background                                                              ability to detect large-scale genomic changes is dependent
Large-scale genomic rearrangements and duplications are                 in large part on which genomes are available. Such studies
important in the evolution of species. Previously, these                in bacteria, for example, have been limited by the availabil-
large-scale genome-changing events were studied through                 ity of genomes only from distantly related sets of species.
genetic or cytological studies. With the availability of many           Recently, however, the genomes of sets of closely related
complete genome sequences it is now possible to study such              bacterial species have become available. We have compared
                                                                                                                                                 information




events through comparative genomics. The publication of                 these closely related bacterial genomes and have discovered
the yeast genome has led to much better insight into the                an unusual phenomenon - alignments of whole genomes
duplication events that have occurred in fungal and eukary-             that show an X-shaped pattern (which we refer to as
otic evolution (for example, see [1]). Large chromosomal                X-alignments). Here we present the evidence for these
duplications have also been found from analysis of com-                 X-alignments and discuss mechanisms that might have pro-
pleted chromosomes of Arabidopsis thaliana [2,3]. The                   duced them.
2 Genome Biology                  Vol 1 No 6                     Eisen et al.




  Results and discussion                                                                                                                                                  at the same distance from the origin but not necessarily on
  Whole-genome X-alignments between species at the                                                                                                                        the same side of the origin. The X-alignment between
  DNA level                                                                                                                                                               V. cholerae and E. coli was found to be statistically signifi-
  We compared the DNA sequences of the two chromosomes                                                                                                                    cant using a test based on the number of matches found in
  of Vibrio cholerae [4] with the sequence of the Escherichia                                                                                                             diagonal strips in the alignment (see the Materials and
  coli chromosome [5] using a suffix tree alignment algorithm                                                                                                             methods section). Specifically, when V. cholerae chrI is
  [6]. The analysis revealed a significant alignment at the DNA                                                                                                           aligned in the forward direction against E. coli, there are 459
  level between the V. cholerae large chromosome (chrI) [4]                                                                                                               maximal unique matching subsequences (MUMs; see the
  and the E. coli chromosome [5] spanning the entire length of                                                                                                            Materials and methods section), of which 177 occurred in a
  these chromosomes (Figure 1a). Analysis of the reverse com-                                                                                                             diagonal strip covering 10% of the total area (compared to
  plement of V. cholerae chrI with E. coli also produced a sig-                                                                                                           the expected value of 46). The probability of observing this
  nificant alignment (Figure 1b). When superimposed, the two                                                                                                              high a number of MUMs by chance is 4.7 x 10–59. The align-
  alignments produce a clear ‘X’ shape (Figure 1c) that is sym-                                                                                                           ment of V. cholerae chrI in the reverse direction against E. coli
  metric about the origin of replication of both genomes. This                                                                                                            (which corresponds to the MUMs on the anti-diagonal) has a
  symmetry indicates that matching sequences tend to occur                                                                                                                probability of 1.8 x 10–90. As a control, we compared the



   (a)                                                                                                         (b)                                                                                                             (c)
                  5,000,000                                                                                                      5,000,000                                                                                               5,000,000


                  4,000,000                                                                                                      4,000,000                                                                                               4,000,000


                  3,000,000                                                                                                      3,000,000                                                                                               3,000,000
    E. coli




                                                                                                               E. coli




                                                                                                                                                                                                                               E. coli
                  2,000,000                                                                                                      2,000,000                                                                                               2,000,000


                  1,000,000                                                                                                      1,000,000                                                                                               1,000,000


                         0                                                                                                               0                                                                                                      0
                              0




                                                 1,000,000




                                                                         2,000,000




                                                                                                 3,000,000




                                                                                                                                             0




                                                                                                                                                              1,000,000




                                                                                                                                                                                           2,000,000




                                                                                                                                                                                                                   3,000,000




                                                                                                                                                                                                                                                     0




                                                                                                                                                                                                                                                               1,000,000




                                                                                                                                                                                                                                                                           2,000,000




                                                                                                                                                                                                                                                                                       3,000,000
                                  V. cholerae chromosome I                                                                                       V. cholerae chromosome I reverse                                                                        V. cholerae chromosome I


   (d)                                                                                                         (e)
                  2,000,000                                                                                                      4,000,000


                  1,500,000                                                                                                      3,000,000
                                                                                                               M. tuberculosis
    S. pyogenes




                  1,000,000                                                                                                      2,000,000


                    500,000                                                                                                      1,000,000


                          0                                                                                                             0
                                                                                                                                             0




                                                                                                                                                        1,000,000



                                                                                                                                                                               2,000,000



                                                                                                                                                                                                       3,000,000
                              0



                                      5,000,00



                                                             1,000,000



                                                                                     1,500,000



                                                                                                   2,000,000




                                                 S. pneumoniae                                                                                                       M. leprae



  Figure 1
  Between-species whole-genome DNA alignments. Plots of maximally unique matching subsequences (MUMs) between
  genomes as identified by the MUMmer program. (a) V. cholerae chrI forward strand versus E. coli forward strand. (b) E. coli
  forward versus V. cholerae chrI reverse. (c) V. cholerae chrI versus E. coli, forward and reverse overlaid. (d) S. pneumoniae
  forward versus S. pyogenes forward and reverse overlaid. (e) M. tuberculosis forward versus M. leprae forward and reverse
  overlaid. A point ( x,y) indicates a DNA sequence that occurs once within each genome, at location x in one genome and at
  location y in the other genome. The matching sequences may occur on either the forward or the reverse strand; in either case,
  the locations indicate the 5’ end of the sequences. The point (0,0) corresponds to the origin of replication for each genome.
http://genomebiology.com/2000/1/6/research/0011.3




Table 1                                                                          species (see the Materials and methods section). Figure 2a
                                                                                 shows a scatterplot of chromosome positions of all proteins
Whole-genome DNA alignments using MUMmer
                                                                                 homologous between V. cholerae chrI and E. coli. The pres-




                                                                                                                                                               comment
Organism 1              Organism 2        Total      Matches      Probability*   ence of many large gene families causes a great deal of noise
                                          MUMs        within                     in this comparison. This noise can be reduced by considering
                                                     diagonal                    only the best matching homolog for each open reading frame
                                                      (10%)
                                                                                 (ORF), rather than all protein homologs (Figure 2b). This fil-
V. cholerae             E. coli             459         177        4.7 x 10-59   tered protein comparison results in an X-alignment that is
                                                                                 statistically significant (Table 2).
V. cholerae (rev)       E. coli             467         217        1.8 x 10-90
V. cholerae (rev)       V. cholerae         342          86        8.2 x 10-16   Whole-genome X-alignments within species




                                                                                                                                                               reviews
E. coli (rev)           E. coli            1,128        225        1.5 x 10-23   The finding of the X-alignment pattern between species led
S. pyogenes             S. pneumoniae       706         259        4.5 x 10-80   us to search for similar patterns within species; that is, global
                                                                                 alignments of a genome with its own reverse complement. Of
S. pyogenes (rev)       S. pneumoniae       626         255        2.3 x 10-90
                                                                                 the genomes for which we found between-species X-align-
S. pyogenes (rev)       S. pyogenes         367          96        1.1 x10-18    ments (M. tuberculosis, M. leprae, S. pyogenes, S. pneumo-
S. pneumoniae (rev)     S. pneumoniae      1,054        154         1.5 x 10-6   niae, E. coli and V. cholerae), statistically significant
                                                                                 self-alignments are detected for all except M. tuberculosis
M. leprae (rev)         M. leprae           449          89        3.5 x 10-10
                                                                                 (Figure 3; probabilities shown in Table 1). Interestingly, these




                                                                                                                                                               reports
M. tuberculosis (rev)   M. tuberculosis    2,476        268              0.092   self-alignments are not as strong as those between species.
E. coli                 M. tuberculosis       81         13              0.06    Proteome analysis also shows an X-alignment within species
E. coli (rev)           M. tuberculosis       70          5              0.84    (shown for V. cholerae chrI in Figure 2d; probabilities shown
                                                                                 in Table 2). The X-alignment of proteins within V. cholerae
*Statistical significance was estimated as described in the text; rev,           chrI is statistically significant only for recently duplicated-
reverse complement sequence.                                                     genes, but disappears when all paralogs are included. The
                                                                                 importance of filtering for recent duplications is discussed




                                                                                                                                                               deposited research
genomes of distantly related species, such as E. coli and                        below.
Mycobacterium tuberculosis. These do not show a signifi-
cant X-alignment (Table 1).                                                      Model I: whole-genome inverted duplications
                                                                                 One possible explanation for an X-alignment within and
We have found that X-alignments of whole genomes are not                         between species is an ancestral inverted duplication of the
limited to the V. cholerae versus E. coli comparison. For                        whole genome, as has been suggested for E. coli [10]. The
example, a whole-genome comparison of two bacteria in the                        weak or missing X-alignment within species could be
genus Streptococcus - S. pyogenes [7] and S. pneumoniae




                                                                                                                                                               refereed research
(H. Tettelin, personal communication) - reveals a global                         Table 2
X-alignment similar to that of V. cholerae versus E. coli                        Whole-genome protein-level comparisons
(Figure 1d) which is also statistically significant (Table 1). In
addition, an X-alignment is found between two species in                         Organism 1             Organism 2     Total     Matches      Probability*
the genus Mycobacterium - M. tuberculosis [8] and                                                                     matches     within
                                                                                                                                   10%
M. leprae [9] (Figure 1e) - as well as between two strains of
                                                                                                                                 diagonal
Helicobacter pylori (data not shown). The X-alignments
observed between any two pairs of genomes are not identical                      Top matches
in every aspect. For example, in the alignment between the
                                                                                                                                                               interactions
                                                                                  V. cholerae           E. coli        1,797       369         3.2 x 10-40
two Mycobacterium species, each conserved region is much
                                                                                  V. cholerae (rev)     E. coli        1,797       441         2.3 x 10-70
longer than in the other genome pairs. We believe this is due
to different numbers of evolutionary events between the                           V. cholerae           V. cholerae    701         145         3.6 x 10-17
species (see below). Whole-genome X-alignments were not                           V. cholerae (rev)     V. cholerae    701         70             0.52
found between any other pairs of species, although a related
                                                                                  E. coli               E. coli        1,985       286         3.6 x 10-10
pattern was seen between some of the chlamydial species
(see below).                                                                      E. coli (rev)         E. coli        1,985       210            0.20
                                                                                                                                                               information




                                                                                 Recent duplications†
Whole-genome X-alignments between species are                                     V. cholerae           V. cholerae    195         60          1.0 x 10-15
also found at the proteome level
                                                                                  V. cholerae (rev)     V. cholerae    195         26          8.4 x 10-4
To test whether the X-alignments found in the DNA analysis
could also be found at the level of whole proteomes, we con-                     *Statistical significance was estimated as described in the text. †Best
ducted comparisons of homologous proteins between                                match to another V. cholerae ORF versus any other complete genome.
4 Genome Biology              Vol 1 No 6   Eisen et al.




   (a)                                                                                                (b)
                      4,000,000                                                                                        4,000,000


                      3,000,000                                                                                        3,000,000
      E. coli




                                                                                                          E. coli
                      2,000,000                                                                                        2,000,000


                      1,000,000                                                                                        1,000,000


                             0                                                                                                0
                                  0




                                            1,000,000




                                                                  2,000,000




                                                                                          3,000,000




                                                                                                                                   0




                                                                                                                                       1,000,000




                                                                                                                                                            2,000,000




                                                                                                                                                                        3,000,000
                                            V. cholerae                                                                                            V. cholerae


   (c)                                                                                                (d)              3,000,000


                      1,000,000
                                                                                                                       2,000,000
      C. pneumoniae




                                                                                                         V. cholerae




                                       T                                 R
                       500,000
                                                                                                                       1,000,000




                             0                                                                                                 0
                                  0




                                                        500,000




                                                                              1,000,000




                                                                                                                                   0




                                                                                                                                       1,000,000




                                                                                                                                                            2,000,000




                                                                                                                                                                        3,000,000
                                           C. trachomatis                                                                                           V. cholerae


  Figure 2
  Whole-genome proteome alignments. Plots show the chromosome locations of pairs of predicted proteins that have
  significant similarity (on the basis of fasta3 comparisons). (a) V. cholerae chrI versus E. coli. All significant matches for each
  V. cholerae ORF are shown. (b) V. cholerae chrI versus E. coli, top matches. Only the best match for each V. cholerae ORF is
  shown. The filtering for top matches for each ORF removes noise due to the presence of many large multigene families. This
  X-alignment is highly statistically significant (Table 2). (c) Chlamydia trachomatis versus C. pneumoniae, top matches. Only the
  best match for each C. trachomatis ORF is shown. The position of the origins (R) and termini (T) of replication are slightly
  shifted to see the inversions better. This pattern is consistent with the occurrence a small number of inversions around the
  origin and terminus in the two lineages since their divergence from a common ancestor (see Figure 4). (d) V. cholerae chrI
  versus V. cholerae chrI, self-alignment. Only recently duplicated pairs of genes are shown. Recent duplications were
  operationally defined as those genes that were more similar to another gene in V. cholerae than to any gene in any other
  complete genome sequence. The faint X-alignment is statistically significant (Table 2). No significant X-alignment was detected
  when all pairs of paralogs were included.


  explained by gene loss of one of the two duplicates of many                                         between species, the member of the gene pair lost in a par-
  of the pairs of genes in the different lineages. Gene loss has                                      ticular lineage should be essentially random. If an ancient
  been found to follow large chromosomal or genome duplica-                                           inverted duplication followed by differential gene loss is the
  tions [11-13]. This gene loss is thought to stabilize large                                         correct explanation for the observed X-alignments, one
  duplications by preventing recombination events between                                             would expect the genes along one diagonal to be orthologous
  duplicate genes. If gene loss is responsible for the weak                                           between species (related to each other by the speciation
  X-alignment within species, then to maintain the X-alignments                                       event), while the genes along the other diagonal should be
http://genomebiology.com/2000/1/6/research/0011.5




 (a)              3,000,000                                                                               (b)
                                                                                                                             4,000,000




                                                                                                                                                                                                                                            comment
                  2,000,000
                                                                                                                             3,000,000
    V. cholerae




                                                                                                             E. coli
                                                                                                                             2,000,000
                  1,000,000
                                                                                                                             1,000,000




                                                                                                                                                                                                                                            reviews
                          0                                                                                                         0
                              0




                                             1,000,000




                                                                     2,000,000




                                                                                              3,000,000




                                                                                                                                         0


                                                                                                                                                  1,000,000


                                                                                                                                                                    2,000,000


                                                                                                                                                                                            3,000,000


                                                                                                                                                                                                                    4,000,000
                                  V. cholerae chromosome I                                                                                                          E. coli


 (c)              2,000,000
                                                                                                          (d)




                                                                                                                                                                                                                                            reports
                                                                                                                              2,000,000

                  1,500,000
                                                                                                                              1,500,000
                                                                                                             S. pneumoniae
    S. pyogenes




                  1,000,000
                                                                                                                              1,000,000




                                                                                                                                                                                                                                            deposited research
                   500,000                                                                                                     500,000


                         0                                                                                                           0
                                                                                                                                          0


                                                                                                                                                          500,000


                                                                                                                                                                                1,000,000


                                                                                                                                                                                                        1,500,000


                                                                                                                                                                                                                                2,000,000
                              0



                                   500,000



                                                         1,000,000



                                                                                 1,500,000



                                                                                             2,000,000




                                                                                                                                                                                                                                            refereed research
                                         S. pyogenes                                                                                                            S. pneumoniae


Figure 3
Within-genome DNA alignments. Plots of exactly matching sequences within four genomes as identified by the MUMmer
program: (a) V. cholerae; (b) E. coli; (c) S. pyogenes; (d) S. pneumoniae. A point (x,y) indicates a DNA sequence that is
repeated within the genome, occurring once at location x and again at location y. Points near the diagonal (y = x) correspond
to tandem repeats. Points near the anti-diagonal (y = L -x, where L is genome length) correspond to repeats that occur at
symmetric locations about the origin of location. The point (0,0) corresponds to the origin of replication in each plot.
Statistically significant X-alignments occur in all four species (Table 1).

                                                                                                                                                                                                                                            interactions
paralogous (related to each other by the genome duplication                                               possible mechanisms for such movement, but we believe the
event before the speciation of the two lineages). However,                                                most likely explanation is the occurrence of large chromoso-
the evidence appears to contradict this model: likely ortholo-                                            mal inversions that pivot around the replication origin
gous gene pairs are equally distributed on each diagonal                                                  and/or terminus. Large chromosomal inversions, including
(data not shown).                                                                                         those that occur around the replication origin and terminus,
                                                                                                          have been shown to occur in E. coli and Salmonella
                                                                                                                                                                                                                                            information




Model II: chromosomal inversions about the origin                                                         typhimurium in the laboratory (see, for example, [14-18]).
and/or terminus                                                                                           The occurrence of such inversions over evolutionary time
A second possible explanation for the X-alignments is that an                                             scales was first suggested by comparative analysis of the com-
underlying mechanism allows sections of DNA to move                                                       plete genomes of four strains in the genus Chlamydia [19]. In
within the genome but maintains the distance of these sec-                                                that study, we found that the major chromosomal differ-
tions from the origin and/or terminus. There are a variety of                                             ences between C. pneumoniae and C. trachomatis (shown in
6 Genome Biology             Vol 1 No 6    Eisen et al.




                                                                   A2                                                       A3

                                                          31 32 1 2                                          31 32 1 2                                                 2 1 32 31
                                                       30            3                                    30            3                                         3                30
                                                    29                 4                               29                 4                                    4                     29
                                                  28                      5                          28                      5                               5                          28
                                                27                          6                      27                          6
                                                                                                                                            A2            *
                                                                                                                                                          27                              *
                                                                                                                                                                                          6
                                               26                            7         A1         26                            7                        26                                7
                                               25
                                               24           A1                8
                                                                              9
                                                                                                  25
                                                                                                  24
                                                                                                                  A2             8
                                                                                                                                 9
                                                                                                                                                         25
                                                                                                                                                         24              A3                 8
                                                                                                                                                                                            9
                                               23                           10                    23                           10                        23                                10
                                                22                         11                      22                         11                          22                              11
                                                  21
                                                   20
                                                                         12
                                                                        13        Inversion          21
                                                                                                      20
                                                                                                                            12
                                                                                                                           13           Inversion           21
                                                                                                                                                              20
                                                                                                                                                                                        12
                                                                                                                                                                                       13
                                                      19             14                                  14             19                                       14                 19
                                                         18 17 16 15
                                                                                  around                 *  15 16 17 18       *         around                        15 16 17 18


                                                                                  terminus (*)                                          origin (*)
                  32 1 2
            30 31         3
         29                 4
       28                      5
     27                          6
    26   Common                   7
    25
    24   ancestor of
                                   8
                                   9      B1                                                B2                                                   B3
    23                           10
     22  A and B                11
       21                     12
        20                   13
           19             14
              18 17 16 15


                                                              A1                                                 A2                                                    A3

                                                       30 31
                                                             32 1 2
                                                                     3
                                                                                  Inversion                  30 31
                                                                                                                   32 1 2
                                                                                                                           3
                                                                                                                                        Inversion             3 2
                                                                                                                                                                    1 32 31
                                                                                                                                                                            30
                                                    29                 4                                  29                 4                             *
                                                                                                                                                           29                  4*
                                                  28                      5       around                28                      5       around           28                       5
                                                27                          6                         27                          6                    27                           6
                                               26                            7    terminus (*)       26                            7    origin (*)    26                             7
                                               25                             8                  *   8                              *
                                                                                                                                   25                 8                              25
                                               24
                                               23
                                                            B1              10
                                                                              9                      9
                                                                                                     10
                                                                                                                 B2                24
                                                                                                                                  23
                                                                                                                                                      9
                                                                                                                                                      10
                                                                                                                                                                      B3             24
                                                                                                                                                                                    23
                                                22                         11                         11                         22                    11                          22
                                                  21                     12                            12                      21                       12                       21
                                                   20                   13                               13                   20                          13                    20
                                                      19             14                                     14             19                                14              19
                                                         18 17 16 15                                           15 16 17 18                                      15 16 17 18




                                                                    B2                                                      B3



                                                                                       B1                                                  B2

  Figure 4
  Schematic model of genome inversions. The model shows an initial speciation event, followed by a series of inversions in the
  different lineages (A and B). Inversions occur between the asterisks (*). Numbers on the chromosome refer to hypothetical
  genes 1-32. At time point 1, the genomes of the two species are still co-linear (as indicated in the scatterplot of A1 versus
  B1). Between time point 1 and time point 2, each species (A and B) undergoes a large inversion about the terminus (as
  indicated in the scatterplots of A1 versus A2 and B1 versus B2). This results in the between-species scatterplot looking as if
  there have been two nested inversions (A2 versus B2), similar to that seen for C. trachomatis versus C. pneumoniae (see
  Figure 2). Between time point 2 and time point 3 each species undergoes an additional inversion (as indicated in the
  scatterplots of B2 versus B3 and A2 versus A3). This results in the between-species scatterplots beginning to resemble an
  X-alignment, similar to that seen in M. tuberculosis versus M. leprae (see Figure 2).



  Figure 2c) were consistent with the occurrence of large                                        could produce patterns very similar to those seen in the
  inversions that pivoted around the origin and terminus                                         Chlamydia, Mycobacterium and Helicobacter comparisons.
  (including multiple inversions of different sizes). In Figure 4                                The continued occurrence of such inversion over longer time
  we present a hypothetical model showing how a small                                            scales would result in an X-alignment similar to that seen in
  number of inversions centered around the origin or terminus                                    the V. cholerae versus E. coli and S. pneumoniae versus
http://genomebiology.com/2000/1/6/research/0011.7




S. pyogenes comparisons. Thus the different between-               within the bacterial chromosome that inverts the sequence
species X-alignments could be the result of different              from the origin up to that point. As with inversion events,
numbers of inversions between particular pairs of species.         recombination and replication have been found to be tightly




                                                                                                                                          comment
                                                                   coupled [25].
Inversions about the origin and terminus could also produce
an X-alignment within species, through the splitting of
tandemly duplicated sequence. Many sets of tandemly dupli-         Conclusions
cated genes are found in most bacterial genomes [19,20]            We present here a novel observation regarding the conserva-
(also see Figure 3a,c). As tandem duplications are inherently      tion between bacterial species of the distance of particular
unstable (one of the duplicates can be rapidly eliminated by       genes from the replication origin or terminus. The initial
slippage and/or recombination events [21]), the fact that          observation was only possible due to the availability of com-




                                                                                                                                          reviews
many tandem pairs are present within each genome suggests          plete genome sequences from pairs of moderately closely
that tandem duplications occur frequently. Thus, it is rea-        related species (for example, V. cholerae and E. coli). This
sonable to assume that occasionally a large inversion will         shows the importance of having genome pairs from many
split a pair of tandemly duplicated genes. An inversion that       levels of evolutionary relatedness. Comparisons of distantly
pivots about the origin and also splits a tandem duplication       related species enable the determination of universal fea-
will result in a pair of paralogous genes spaced symmetri-         tures of life as well as of events that occur very rarely. Com-
cally on opposite sides of the origin.                             parison of very closely related species allows the
                                                                   identification of frequent events such as transitional changes




                                                                                                                                          reports
If our inversion model is correct, then the genes along both       at third codon positions or tandem duplications. To eluci-
diagonals in the between-species alignments should be              date all other events in the history of life, genome pairs cov-
orthologous, which is the case (see above). In contrast, genes     ering all the intermediate levels of evolutionary relatedness
along the anti-diagonal in the within-species X-alignments         will be needed.
should be recent tandem duplicates that have been sepa-
rated by inversions. This also appears to be the case - in the
within-species analysis of V. cholerae chrI ORFs, the




                                                                                                                                          deposited research
                                                                   Materials and methods
X-alignment shows up best when only recent duplicates are          Genomes analyzed
analyzed (Figure 2d). The splitting of tandem duplicates by        Complete published genome sequences were obtained from the
inversions may be a general mechanism to stabilize the coex-       National Center for Biotechnology Information website [26] or
istence of duplicated genes, as it will prevent their elimina-     from the TIGR Comprehensive Microbial Resource [27]. These
tion by unequal crossing-over or replication slippage events.      included Aeropyrum pernix [28], Aquifex aeolicus [29],
                                                                   Archaeoglobus fulgidus [30], Bacillus subtilis [31], Borrelia
What could cause inversions that pivot around the origin           burgdorferi [32], Campylobacter jejuni [33], Chlamydia
and terminus of the genome to occur more frequently than           pneumoniae AR39 [19], Chlamydia pneumoniae CWL029




                                                                                                                                          refereed research
other inversions? One possibility is that many inversions          [34], Chlamydia trachomatis (D/UW-3/Cx) [35], Chlamydia
occur, but there is selection against those that change the        trachomatis MoPn [19], Deinococcus radiodurans [36],
distance of a gene from the origin or terminus. Such a possi-      Escherichia coli [5], Haemophilus influenzae [37], Helicobac-
bility has been suggested by experimental work in E. coli          ter pylori [38], Helicobacter pylori J99 [39], Methanobac-
[14,15]. Additional studies have, however, suggested that          terium thermoautotrophicum [40], Methanococcus jannaschii
there is little selective difference between inversions and that   [41], Mycobacterium tuberculosis [8], Mycoplasma genital-
instead there may be certain regions that are more prone to        ium [42], Mycoplasma pneumoniae [43], Neisseria meningi-
inversion than others [16-18,22,23]. Alternatively, the inver-     tidis MC58 [20], Neisseria meningitidis serogroup A strain
sion events could be linked to replication, as has been sug-       Z2491 [44], Pyrococcus horikoshii [45], Rickettsia prowazekii
gested for small local inversion events [24]. Whatever the         [46], Synechocystis sp. [47], Thermotoga maritima [48], Tre-           interactions
mechanisms, the fact that we find evidence for such inver-         ponema pallidum [49], and Vibrio cholerae [4]. In addition, a
sions between many pairs of species suggests that they are a       few unpublished genomes were analyzed: Streptococcus pyo-
common feature of bacterial evolution. Many aspects of the         genes (obtained from the Oklahoma University Genome Center
X-alignments require further exploration. For example, to          website [7]), Streptococcus pneumoniae (H. Tettelin, personal
split a tandem duplication, an inversion must fall precisely       communication), and Mycobacterium leprae (obtained from
on the boundary between two duplicated genes. This would           the Sanger Centre Pathogen Sequencing Group website [9]).
                                                                                                                                          information




appear to be unlikely, requiring a large number of inversions
in order to generate a sufficient number of split gene pairs. If   Whole-genome DNA alignments
the mechanisms of gene duplication are somehow related to          DNA alignments of the complete genomic sequences of all
the mechanisms of inversion, however, then this model is           bacteria used in this study were accomplished with the
more plausible. The process of duplicating a gene, if it occurs    MUMmer program [6]. This program uses an efficient suffix
during replication, might promote a recombination event            tree construction algorithm to rapidly compute alignments
8 Genome Biology    Vol 1 No 6   Eisen et al.




  of entire genomes. The algorithm identifies all exact matches     termini of S. pneumoniae and S. pyogenes were determined
  of nucleotide subsequences that are contained in both input       by the authors of the present study using GC-skew analysis
  sequences; these exact matches must be longer than a speci-       and the locations of characteristic genes, particularly the
  fied minimum length, which was set to 20 base pairs for this      chromosome replication initiator gene dnaA.
  comparison. To search for genome-scale alignments within
  species, complete bacterial and archaeal genomes (25 in
  total including all published genomes) were aligned with          Acknowledgements
  their own reverse complements. To search for between-             We thank S. Eddy, M.A. Riley, T. Read, A. Stoltzfus, M-I Benito and I.
                                                                    Paulsen for helpful comments, suggestions and discussions. S.L.S. was sup-
  species alignments, all genomes were aligned against all          ported in part by NSF grant IIS-9902923 and NIH grant R01 LM06845. S.L.S.
  others in both orientations.                                      and J.A.E were supported in part by NSF grant KDI-9980088. Data for all
                                                                    published complete genome sequences were obtained from the NCBI
                                                                    genomes database [26] or from The Institute for Genomic Research (TIGR)
  Whole-genome protein comparisons                                  Microbial Genome Database [27]. The sequences of V. cholerae, S. pneumo-
  The predicted proteome of each complete genome                    niae, and M. tuberculosis (CDC 1551) were determined at TIGR with
                                                                    support from NIH and the NIAID. The M. leprae sequence data were pro-
  sequence (all predicted proteins in the genome) was com-          duced by the Pathogen Sequencing Group at the Sanger Centre. Sequencing
  pared to the proteomes of all complete genome sequences           of M. leprae is funded by the Heiser Program for Research in Leprosy and
  (including itself) using the fasta3 program [50]. Matches         Tuberculosis of The New York Community Trust and by L’Association
                                                                    Raoul Follereau. The M. tuberculosis CDC 1551 genome sequence was
  with an expected score (e-value) of 10–5 or less were con-        obtained from TIGR. The source of the S. pyogenes genome sequence was
  sidered significant.                                              the Streptococcal Genome Sequencing Project funded by USPHS/NIH grant
                                                                    AI38406, and was kindly made available by B. A. Roe, S.P. Linn, L. Song, X.
                                                                    Yuan, S. Clifton, R.E. McLaughlin, M. McShan and J. Ferretti, and can be
  Statistical significance of X-alignments                          obtained from the website of the Oklahoma University Genome Center [7].
  To calculate the statistical significance of the X-alignments,
  the maximal unique matching subsequences (MUMs) for
  unrelated genomes were examined and found to be uni-              References
  formly distributed [6]. With a uniform background, the            1.    Seoighe C, Wolfe KH: Updated map of duplicated regions in
                                                                          the yeast genome. Gene 1999, 238:253-261.
  expected density of MUMs in any region of an alignment            2.    Lin X, Kaul S, Rounsley S, Shea TP, Benito MI, Town CD, Fujii CY,
  plot is a simple proportion of the area of that region to the           Mason T, Bowman CL, Barnstead M, et al.: Sequence and analysis
                                                                          of chromosome 2 of the plant Arabidopsis thaliana. Nature
  entire plot. In particular, in an alignment with N total                1999, 402:761-768.
  MUMs, the probability (Pr) of observing at least m matches        3.    Mayer K, Schuller C, Wambutt R, Murphy G, Volckaert G, Pohl T,
  in a region with area p can be computed using the binomial              Dusterhoft A, Stiekema W, Entian KD, Terryn N, et al.: Sequence
                                                                          and analysis of chromosome 4 of the plant Arabidopsis
  distribution in Equation 1:                                             thaliana. Nature 1999, 402:769-777.
                     N                                              4.    Heidelberg JF, Eisen JA, Nelson WC, Clayton RA, Gwinn ML,
                             N                                            Dodson RJ, Haft DH, Hickey EK, Peterson JD, Umayam L, et al.: The
             Pr =                 px (1 - p) (N - x)                      genome sequence of Vibrio cholerae, the aetiologic agent of
                                                             (1)
                    x=m      x                                            cholera. Nature 2000, 406:477-484.
                                                                    5.    Blattner FR, Plunkett GI, Bloch CA, Perna NT, Burland V, Riley M,
                                                                          Collado-Vides J, Glasner JD, Rode CK, Mayhew GF, et al.: The com-
  The alignment of V. cholerae chrI (both forward and reverse             plete genome sequence of Escherichia coli K-12. Science 1997,
  strands) versus E. coli contains 926 MUMs. The MUMs                     277:1453-1462.
                                                                    6.    Delcher AL, Kasif S, Fleischmann RD, Peterson J, White O, Salzberg
  forming X-alignments appear along the diagonal (y = x) and              SL: Alignment of whole genomes. Nucleic Acids Res 1999,
  the anti-diagonal (y = L -x, where L is the genome length).             27:2369-2376.
  To estimate the significance of the alignments in both direc-     7.    Oklahoma University Genome Center
                                                                          [http://www.genome.ou.edu/strep.html]
  tions, diagonal strips were sampled along each of the diago-      8.    Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D,
  nals. The width of each strip was set at 10% of the plot area           Gordon SV, Eiglmeier K, Gas S, Barry III CE, et al.: Deciphering the
                                                                          biology of Mycobacterium tuberculosis from the complete
  and significance values were calculated (Table 1).                      genome sequence. Nature 1998, 393:537-544.
                                                                    9.    Sanger Centre Pathogen Sequencing Group
  Identification of origins of replication                                [ftp://ftp.sanger.ac.uk/pub/pathogens/leprae]
                                                                    10.   Zipkas D, Riley M: Proposal concerning mechanism of evolu-
  The origins of replication for the bacterial genomes have been          tion of the genome of Escherichia coli. Proc Natl Acad Sci USA
  characterized by a variety of methods. For E. coli, M. tubercu-         1975, 72:1354-1358.
                                                                    11.   Wagner A: The fate of duplicated genes: loss or new func-
  losis and M. leprae, the origins have been well-characterized           tion? BioEssays 1998, 20:785-788.
  by laboratory studies [51,52]. The origins and termini of C.      12.   Lynch M, Force A: The probability of duplicate gene preserva-
  trachomatis, C. pneumoniae and V. cholerae were identified              tion by subfunctionalization. Genetics 2000, 154:459-473.
                                                                    13.   Nadeau JH, Sankoff D: Comparable rates of gene loss and func-
  by GC-skew [53] and by characteristic genes in the region of            tional divergence after genome duplications early in verte-
  the origin [4,19]. GC-skew uses the function (G-     C)/(G+C)           brate evolution. Genetics 1997, 147:1259-1266.
                                                                    14.   Francois V, Louarn J, Patte J, Rebollo JE, Louarn JM: Constraints in
  computed on 2,000 bp windows across the genome, which                   chromosomal inversions in Escherichia coli are not explained
  exhibits a clear tendency in many bacterial genomes to be               by replication pausing at inverted terminator-like
  positive for the leading strand and negative for the lagging            sequences. Mol Microbiol 1990, 4:537-542.
                                                                    15.   Rebollo JE, Francois V, Louarn JM: Detection and possible role of
  strand. The origin of H. pylori was determined by oligomer              two large nondivisible zones on the Escherichia coli chromo-
  skew [54] and confirmed by GC-skew. The origins and                     some. Proc Natl Acad Sci USA 1988, 85:9391-9395.
http://genomebiology.com/2000/1/6/research/0011.9




16. Segall A, Mahan MJ, Roth JR: Rearrangement of the bacterial            39. Alm RA, Ling LS, Moir DT, King BL, Brown ED, Doig PC, Smith DR,
    chromosome: forbidden inversions. Science 1988, 241:1314-1318.             Noonan B, Guild BC, deJonge BL, et al.: Genomic-sequence com-
17. Mahan MJ, Roth JR: Ability of a bacterial chromosome                       parison of two unrelated isolates of the human gastric
    segment to invert is dictated by included material rather                  pathogen Helicobacter pylori. Nature 1999, 397:176-180.




                                                                                                                                                         comment
    than flanking sequence. Genetics 1991, 129:1021-1032.                  40. Smith DR, Doucette-Stamm LA, Deloughery C, Lee H, Dubois J,
18. Segall AM, Roth JR: Recombination between homologies in                    Aldredge T, Bashirzadeh R, Blakely D, Cook R, Gilbert K, et al.:
    direct and inverse orientation in the chromosome of Salmo-                 Complete genome sequence of Methanobacterium ther-
    nella: intervals which are nonpermissive for inversion for-                moautotrophicum DH: functional analysis and comparative
    mation. Genetics 1989, 122:737-747.                                        genomics. J Bacteriol 1996, 179:7135-7155.
19. Read TD, Brunham RC, Shen C, Gill SR, Heidelberg JF, White O,          41. Bult CJ, White O, Olsen GJ, Zhou L, Fleischmann RD, Sutton GG,
    Hickey EK, Peterson J, Utterback T, Berry K, et al.: Genome                Blake JA, Fitzgerald LM, Clayton RA, Gocayne JD, et al.: Complete
    sequences of Chlamydia trachomatis MoPn and Chlamydia                      genome sequence of the methanogenic archaeon,
    pneumoniae AR39. Nucleic Acids Res 2000, 28:1397-1406.                     Methanococcus jannaschii. Science 1996, 273:1058-1073.
20. Tettelin H, Saunders NJ, Heidelberg J, Jeffries AC, Nelson KE, Eisen   42. Fraser CM, Gocayne JD, White O, Adams MD, Clayton RA, Fleis-
    JA, Ketchum KA, Hood DW, Peden JF, Dodson RJ, et al.: Complete             chmann RD, Bult CJ, Kerlavage AR, Sutton G, Kelley JM, et al.: The
    genome sequence of Neisseria meningitidis serogroup B




                                                                                                                                                         reviews
                                                                               minimal gene complement of Mycoplasma genitalium. Science
    strain MC58. Science 2000, 287:1809-1815.                                  1995, 270:397-403.
21. Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J:        43. Himmelreich R, Hilbert H, Plagens H, Pirkl E, Li BC, Herrmann R:
    Preservation of duplicate genes by complementary, degen-                   Complete sequence analysis of the genome of the bac-
    erative mutations. Genetics 1999, 151:1531-1545.                           terium Mycoplasma pneumoniae. Nucleic Acids Res 1996,
22. Schmid MB, Roth JR: Selection and endpoint distribution of                 24:4420-4449.
    bacterial inversion mutations. Genetics 1983, 105:539-557.             44. Parkhill J, Achtman M, James KD, Bentley SD, Churcher C, Klee SR,
23. Mahan MJ, Roth JR: Reciprocality of recombination events that              Morelli G, Basham D, Brown D, Chillingworth T, et al.: Complete
    rearrange the chromosome. Genetics 1988, 120:23-35.                        DNA sequence of a serogroup A strain of Neisseria meningi-
24. Gordon AJ, Halliday JA: Inversions with deletions and duplica-             tidis Z2491. Nature 2000, 404:502-506.
    tions. Genetics 1995, 140:411-414.                                     45. Kawarabayasi Y, Sawada M, Horikawa H, Haikawa Y, Hino Y,
25. Valencia-Morales E, Romero D: Recombination enhancement




                                                                                                                                                         reports
                                                                               Yamamoto S, Sekine M, Baba S, Kosugi H, Hosoyama A, et al.: Com-
    by replication (RER) in Rhizobium etli. Genetics 2000, 154:971-983.        plete sequence and gene organization of the genome of a
26. National Center for Biotechnology Information, Entrez                      hyperthermophilic archaebacterium, Pyrococcus horikoshii
    Genomes [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Genome]          OT3. DNA Res 1998, 5:55-76.
27. The Institute for Genomic Research Microbial Genome                    46. Andersson SG, Zomorodipour A, Andersson JO, Sicheritz-Ponten T,
    Database [http://www.tigr.org/tdb/mdb/mdb.html]                            Alsmark UC, Podowski RM, Naslund AK, Eriksson AS, Winkler HH,
28. Kawarabayasi Y, Hino Y, Horikawa H, Yamazaki S, Haikawa Y, Jin-no          Kurland CG: The genome sequence of Rickettsia prowazekii
    K, Takahashi M, Sekine M, Baba S, Ankai A, et al.: Complete                and the origin of mitochondria. Nature 1998, 396:133-140.
    genome sequence of an aerobic hyper-thermophilic crenar-               47. Kaneko T, Sato S, Kotani H, Tanaka A, Asamizu E, Nakamura Y,




                                                                                                                                                         deposited research
    chaeon, Aeropyrum pernix K1. DNA Res 1999, 6:83-101, 145-52.               Miyajima N, Hirosawa M, Sugiura M, Sasamoto S, et al.: Sequence
29. Deckert G, Warren PV, Gaasterland T, Young WG, Lenox AL,                   analysis of the genome of the unicellular cyanobacterium
    Grahams DE, Overbeek R, Snead MA, Keller M, Aujay M, et al.: The           Synechocystis sp. strain PCC6803. II. Sequence determina-
    complete genome of the hyperthemophilic bacterium                          tion of the entire genome and assignment of potential
    Aquifex aeolicus. Nature 1998, 392:353-358.                                protein-coding regions. DNA Res 1996, 3:109-136.
30. Klenk H-P, Clayton RA, Tomb J-F, White O, Nelsen KE, Ketchum
                                                                           48. Nelson KE, Clayton RA, Gill SR, Gwinn ML, Dodson RJ, Haft DH,
    KA, Dodson RJ, Gwinn M, Hickey EK, Peterson JD, et al.: The com-
                                                                               Hickey EK, Peterson JD, Nelson WC, Ketchum KA, et al.: Evidence
    plete genomic sequence of the hyperthermophilic, sulfate-
                                                                               for lateral gene transfer between Archaea and bacteria
    reducing archaeon Archaeoglobus fulgidus. Nature 1997,
                                                                               from genome sequence of Thermotoga maritima. Nature 1999,
    390:364-370.
31. Kunst A, Ogasawara N, Moszer I, Albertini A, Alloni G, Azevedo V,          399:323-329.
    Bertero M, Bessieres P, Bolotin A, Borchert S, et al.: The complete    49. Fraser CM, Norris SJ, Weinstock GM, White O, Sutton GG, Dodson




                                                                                                                                                         refereed research
    genome sequence of the Gram-positive bacterium Bacillus                    R, Gwinn M, Hickey EK, Clayton R, Ketchum KA, et al.: Complete
    subtilis. Nature 1997, 390:249-256.                                        genome sequence of Treponema pallidum, the syphilis spiro-
32. Fraser CM, Norris SJ, Weinstock GM, White O, Sutton GG, Dodson             chete. Science 1998, 281:375-388.
    R, Gwinn M, Hickey EK, Clayton R, Ketchum KA, et al.: Genomic          50. Pearson WR: Flexible sequence similarity searching with the
    sequence of a Lyme disease spirochaete, Borrelia burgdorferi.              FASTA3 program package. Methods Mol Biol 2000, 132:185-219.
    Nature 1997, 390:580-586.                                              51. Marsh RC, Worcel A: A DNA fragment containing the origin
33. Parkhill J, Wren BW, Mungall K, Ketley JM, Churcher C, Basham D,           of replication of the Escherichia coli chromosome. Proc Natl
    Chillingworth T, Davies RM, Feltwell T, Holroyd S, et al.: The             Acad Sci USA 1977, 74:2720-2724.
    genome sequence of the food-borne pathogen Campylobac-                 52. Salazar L, Fsihi H, de Rossi E, Riccardi G, Rios C, Cole ST, Takiff HE:
    ter jejuni reveals hypervariable sequences. Nature 2000,                   Organization of the origins of replication of the chromo-
    403:665-668.                                                               somes of Mycobacterium smegmatis, Mycobacterium leprae
34. Kalman S, Mitchell W, Marathe R, Lammel C, Fan J, Hyman RW,                and Mycobacterium tuberculosis and isolation of a functional
                                                                               origin from M. smegmatis. Mol Microbiol 1996, 20:283-293.
                                                                                                                                                         interactions
    Olinger L, Grimwood J, Davis RW, Stephens RS: Comparative
    genomes of Chlamydia pneumoniae and C. trachomatis. Nat                53. Lobry JR: Asymmetric substitution patterns in the two DNA
    Genet 1999, 21:385-389.                                                    strands of bacteria. Mol Biol Evol 1996, 13:660-665.
35. Stephens RS, Kalman S, Lammel C, Fan J, Marathe R, Aravind L,          54. Salzberg SL, Salzberg AJ, Kerlavage AR, Tomb JF: Skewed
    Mitchell W, Olinger L, Tatusov RL, Zhao Q, et al.: Genome                  oligomers and origins of replication. Gene 1998, 217:57-67.
    sequence of an obligate intracellular pathogen of humans:
    Chlamydia trachomatis. Science 1998, 282:754-759.
36. White O, Eisen JA, Heidelberg JF, Hickey EK, Peterson JD, Dodson
    RJ, Haft DH, Gwinn ML, Nelson WC, Richardson DL, et al.:
    Genome sequence of the radioresistant bacterium
    Deinococcus radiodurans R1. Science 1999, 286:1571-1577.
                                                                                                                                                         information




37. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF,
    Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM, et al.:
    Whole-genome random sequencing and assembly of
    Haemophilus influenzae Rd. Science 1995, 269:496-512.
38. Tomb JF, White O, Kerlavage AR, Clayton RA, Sutton GG, Fleis-
    chmann RD, Ketchum KA, Klenk HP, Gill S, Dougherty BA, et al.:
    The complete genome sequence of the gastric pathogen
    Helicobacter pylori. Nature 1997, 388:539-547.

Más contenido relacionado

La actualidad más candente

EVE 161 Winter 2018 Class 14
EVE 161 Winter 2018 Class 14EVE 161 Winter 2018 Class 14
EVE 161 Winter 2018 Class 14Jonathan Eisen
 
R. Redfield's SMBE talk slides
R. Redfield's SMBE talk slidesR. Redfield's SMBE talk slides
R. Redfield's SMBE talk slidesRosie Redfield
 
Mapping the bacteriophage genome
Mapping the bacteriophage genomeMapping the bacteriophage genome
Mapping the bacteriophage genomevibhakhanna1
 
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics Jonathan Eisen
 
Macromolecule evolution
Macromolecule  evolutionMacromolecule  evolution
Macromolecule evolutionPaula Mills
 
Characterization of CesA6 in Oat copy
Characterization of CesA6 in Oat copyCharacterization of CesA6 in Oat copy
Characterization of CesA6 in Oat copyCassie Dohse
 
Introduction of Animal Genetics & History of Genetics
Introduction of Animal Genetics & History of GeneticsIntroduction of Animal Genetics & History of Genetics
Introduction of Animal Genetics & History of GeneticsAashish Patel
 
Lets Make a Mammoth
Lets Make a Mammoth  Lets Make a Mammoth
Lets Make a Mammoth Cheche Salas
 
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome SequencingMicrobial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome SequencingJonathan Eisen
 
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomicsUC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomicsJonathan Eisen
 
DNA and the hidden world of microbes
DNA and the hidden world of microbesDNA and the hidden world of microbes
DNA and the hidden world of microbesJonathan Eisen
 
Chapter 2 classical genetics
Chapter 2 classical geneticsChapter 2 classical genetics
Chapter 2 classical geneticsMosesPackiaraj2
 
Introduction to genetics
Introduction to geneticsIntroduction to genetics
Introduction to geneticsSekho Science
 
Historical development of genetics final
Historical development of genetics finalHistorical development of genetics final
Historical development of genetics finalHotaru Imai
 

La actualidad más candente (20)

EVE 161 Winter 2018 Class 14
EVE 161 Winter 2018 Class 14EVE 161 Winter 2018 Class 14
EVE 161 Winter 2018 Class 14
 
R. Redfield's SMBE talk slides
R. Redfield's SMBE talk slidesR. Redfield's SMBE talk slides
R. Redfield's SMBE talk slides
 
Mapping the bacteriophage genome
Mapping the bacteriophage genomeMapping the bacteriophage genome
Mapping the bacteriophage genome
 
Bacteriology
BacteriologyBacteriology
Bacteriology
 
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
 
Macromolecule evolution
Macromolecule  evolutionMacromolecule  evolution
Macromolecule evolution
 
RODRIGUEZMENDOZ-THESIS-2014
RODRIGUEZMENDOZ-THESIS-2014RODRIGUEZMENDOZ-THESIS-2014
RODRIGUEZMENDOZ-THESIS-2014
 
Characterization of CesA6 in Oat copy
Characterization of CesA6 in Oat copyCharacterization of CesA6 in Oat copy
Characterization of CesA6 in Oat copy
 
Introduction of Animal Genetics & History of Genetics
Introduction of Animal Genetics & History of GeneticsIntroduction of Animal Genetics & History of Genetics
Introduction of Animal Genetics & History of Genetics
 
Applied virology converted
Applied virology convertedApplied virology converted
Applied virology converted
 
Lets Make a Mammoth
Lets Make a Mammoth  Lets Make a Mammoth
Lets Make a Mammoth
 
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome SequencingMicrobial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
 
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomicsUC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
 
e. coli
e. colie. coli
e. coli
 
DNA and the hidden world of microbes
DNA and the hidden world of microbesDNA and the hidden world of microbes
DNA and the hidden world of microbes
 
Chapter 2 classical genetics
Chapter 2 classical geneticsChapter 2 classical genetics
Chapter 2 classical genetics
 
Introduction to genetics
Introduction to geneticsIntroduction to genetics
Introduction to genetics
 
Historical development of genetics final
Historical development of genetics finalHistorical development of genetics final
Historical development of genetics final
 
Chloroplast dna
Chloroplast dnaChloroplast dna
Chloroplast dna
 
General Genetics Lec 1
General Genetics Lec 1General Genetics Lec 1
General Genetics Lec 1
 

Similar a Evidence for symmetric chromosomal inversions around the replication origin in bacteria

BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS
 
Introduction to Genetics
Introduction to GeneticsIntroduction to Genetics
Introduction to GeneticsCEU
 
Cytotaxonomy(Its History,Background, Chromosome Evolution in Primates.pptx
Cytotaxonomy(Its History,Background, Chromosome Evolution in Primates.pptxCytotaxonomy(Its History,Background, Chromosome Evolution in Primates.pptx
Cytotaxonomy(Its History,Background, Chromosome Evolution in Primates.pptxIshtiyaqMir3
 
Introduction to Genetics
Introduction to GeneticsIntroduction to Genetics
Introduction to GeneticsCEU
 
presentation on E.coli
presentation on E.colipresentation on E.coli
presentation on E.coliVandana singh
 
Comparative genomics.pdf
Comparative genomics.pdfComparative genomics.pdf
Comparative genomics.pdfshinycthomas
 
Detection of genomic homology in eukaryotic genomes
Detection of genomic homology in eukaryotic genomesDetection of genomic homology in eukaryotic genomes
Detection of genomic homology in eukaryotic genomesKlaas Vandepoele
 
Mapping the genome of bacteria
Mapping the genome of bacteriaMapping the genome of bacteria
Mapping the genome of bacteriaMeisam Ruzbahani
 
Chapter 3 chromosomal basis of inheritance
Chapter 3 chromosomal basis of inheritanceChapter 3 chromosomal basis of inheritance
Chapter 3 chromosomal basis of inheritanceMosesPackiaraj2
 
Microbial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 16: Shotgun MetagenomicsMicrobial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 16: Shotgun MetagenomicsJonathan Eisen
 
EVE 161 Winter 2018 Class 15
EVE 161 Winter 2018 Class 15EVE 161 Winter 2018 Class 15
EVE 161 Winter 2018 Class 15Jonathan Eisen
 

Similar a Evidence for symmetric chromosomal inversions around the replication origin in bacteria (20)

BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomics
 
Introduction to Genetics
Introduction to GeneticsIntroduction to Genetics
Introduction to Genetics
 
Cytotaxonomy(Its History,Background, Chromosome Evolution in Primates.pptx
Cytotaxonomy(Its History,Background, Chromosome Evolution in Primates.pptxCytotaxonomy(Its History,Background, Chromosome Evolution in Primates.pptx
Cytotaxonomy(Its History,Background, Chromosome Evolution in Primates.pptx
 
Genomic variation
Genomic variationGenomic variation
Genomic variation
 
Introduction to Genetics
Introduction to GeneticsIntroduction to Genetics
Introduction to Genetics
 
presentation on E.coli
presentation on E.colipresentation on E.coli
presentation on E.coli
 
Genome <imran>
Genome <imran>Genome <imran>
Genome <imran>
 
Gene order
Gene orderGene order
Gene order
 
Comparative genomics.pdf
Comparative genomics.pdfComparative genomics.pdf
Comparative genomics.pdf
 
Nature Of Gene.pdf
Nature Of Gene.pdfNature Of Gene.pdf
Nature Of Gene.pdf
 
Nature Of Gene.pdf
Nature Of Gene.pdfNature Of Gene.pdf
Nature Of Gene.pdf
 
Detection of genomic homology in eukaryotic genomes
Detection of genomic homology in eukaryotic genomesDetection of genomic homology in eukaryotic genomes
Detection of genomic homology in eukaryotic genomes
 
Mapping the genome of bacteria
Mapping the genome of bacteriaMapping the genome of bacteria
Mapping the genome of bacteria
 
Chapter 3 chromosomal basis of inheritance
Chapter 3 chromosomal basis of inheritanceChapter 3 chromosomal basis of inheritance
Chapter 3 chromosomal basis of inheritance
 
B27 the human genome
 B27 the human genome B27 the human genome
B27 the human genome
 
Microbial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 16: Shotgun MetagenomicsMicrobial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
 
EVE 161 Winter 2018 Class 15
EVE 161 Winter 2018 Class 15EVE 161 Winter 2018 Class 15
EVE 161 Winter 2018 Class 15
 
linkage and crossing over
linkage and crossing overlinkage and crossing over
linkage and crossing over
 
Biology Literature Review Example
Biology Literature Review ExampleBiology Literature Review Example
Biology Literature Review Example
 
Biology Literature Review Example
Biology Literature Review ExampleBiology Literature Review Example
Biology Literature Review Example
 

Más de Jonathan Eisen

Eisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfEisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfJonathan Eisen
 
Phylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesPhylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesJonathan Eisen
 
Talk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingTalk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingJonathan Eisen
 
Thoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsThoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsJonathan Eisen
 
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Jonathan Eisen
 
A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2Jonathan Eisen
 
EVE198 Summer Session Class 4
EVE198 Summer Session Class 4EVE198 Summer Session Class 4
EVE198 Summer Session Class 4Jonathan Eisen
 
EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 Jonathan Eisen
 
EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines Jonathan Eisen
 
EVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionEVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionJonathan Eisen
 
EVE198 Spring2021 Class2
EVE198 Spring2021 Class2EVE198 Spring2021 Class2
EVE198 Spring2021 Class2Jonathan Eisen
 
EVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesEVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesJonathan Eisen
 
EVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionEVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionJonathan Eisen
 
EVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionEVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionJonathan Eisen
 
EVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingEVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingJonathan Eisen
 
EVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesEVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesJonathan Eisen
 
EVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionEVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionJonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesEVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesJonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingEVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingJonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionEVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionJonathan Eisen
 

Más de Jonathan Eisen (20)

Eisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfEisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdf
 
Phylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesPhylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of Microbes
 
Talk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingTalk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meeting
 
Thoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsThoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current Actions
 
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
 
A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2
 
EVE198 Summer Session Class 4
EVE198 Summer Session Class 4EVE198 Summer Session Class 4
EVE198 Summer Session Class 4
 
EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1
 
EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines
 
EVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionEVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 Introduction
 
EVE198 Spring2021 Class2
EVE198 Spring2021 Class2EVE198 Spring2021 Class2
EVE198 Spring2021 Class2
 
EVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesEVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 Vaccines
 
EVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionEVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA Detection
 
EVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionEVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 Introduction
 
EVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingEVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID Testing
 
EVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesEVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID Vaccines
 
EVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionEVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID Transmission
 
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesEVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
 
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingEVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
 
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionEVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
 

Último

(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...parulsinha
 
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...chennailover
 
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableGENUINE ESCORT AGENCY
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Sheetaleventcompany
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappInaaya Sharma
 
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...parulsinha
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...chandars293
 
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Availableperfect solution
 
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...Ishani Gupta
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...mahaiklolahd
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...GENUINE ESCORT AGENCY
 
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426jennyeacort
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...chandars293
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...Arohi Goyal
 
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableGENUINE ESCORT AGENCY
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...GENUINE ESCORT AGENCY
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Anamika Rawat
 

Último (20)

(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
 
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
 
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
 
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
 
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
 
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
 
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
 

Evidence for symmetric chromosomal inversions around the replication origin in bacteria

  • 1. http://genomebiology.com/2000/1/6/research/0011.1 Research Evidence for symmetric chromosomal inversions around the comment replication origin in bacteria Jonathan A Eisen, John F Heidelberg, Owen White and Steven L Salzberg Address: The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA. Correspondence: Jonathan A Eisen. E-mail: jeisen@tigr.org reviews Published: 4 December 2000 Received: 7 August 2000 Revised: 25 September 2000 Genome Biology 2000, 1(6):research0011.1–0011.9 Accepted: 19 October 2000 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2000/1/6/research/0011 © GenomeBiology.com (Print ISSN 1465-6906; Online ISSN 1465-6914) reports Abstract Background: Whole-genome comparisons can provide great insight into many aspects of biology. Until recently, however, comparisons were mainly possible only between distantly related species. Complete genome sequences are now becoming available from multiple sets of closely related strains or species. deposited research Results: By comparing the recently completed genome sequences of Vibrio cholerae, Streptococcus pneumoniae and Mycobacterium tuberculosis to those of closely related species - Escherichia coli, Streptococcus pyogenes and Mycobacterium leprae, respectively - we have identified an unusual and previously unobserved feature of bacterial genome structure. Scatterplots of the conserved sequences (both DNA and protein) between each pair of species produce a distinct X-shaped pattern, which we call an X-alignment. The key feature of these alignments is that they have symmetry around the replication origin and terminus; that is, the distance of a particular conserved feature (DNA or protein) from the replication origin (or terminus) is conserved between closely refereed research related pairs of species. Statistically significant X-alignments are also found within some genomes, indicating that there is symmetry about the replication origin for paralogous features as well. Conclusions: The most likely mechanism of generation of X-alignments involves large chromosomal inversions that reverse the genomic sequence symmetrically around the origin of replication. The finding of these X-alignments between many pairs of species suggests that chromosomal inversions around the origin are a common feature of bacterial genome evolution. interactions Background ability to detect large-scale genomic changes is dependent Large-scale genomic rearrangements and duplications are in large part on which genomes are available. Such studies important in the evolution of species. Previously, these in bacteria, for example, have been limited by the availabil- large-scale genome-changing events were studied through ity of genomes only from distantly related sets of species. genetic or cytological studies. With the availability of many Recently, however, the genomes of sets of closely related complete genome sequences it is now possible to study such bacterial species have become available. We have compared information events through comparative genomics. The publication of these closely related bacterial genomes and have discovered the yeast genome has led to much better insight into the an unusual phenomenon - alignments of whole genomes duplication events that have occurred in fungal and eukary- that show an X-shaped pattern (which we refer to as otic evolution (for example, see [1]). Large chromosomal X-alignments). Here we present the evidence for these duplications have also been found from analysis of com- X-alignments and discuss mechanisms that might have pro- pleted chromosomes of Arabidopsis thaliana [2,3]. The duced them.
  • 2. 2 Genome Biology Vol 1 No 6 Eisen et al. Results and discussion at the same distance from the origin but not necessarily on Whole-genome X-alignments between species at the the same side of the origin. The X-alignment between DNA level V. cholerae and E. coli was found to be statistically signifi- We compared the DNA sequences of the two chromosomes cant using a test based on the number of matches found in of Vibrio cholerae [4] with the sequence of the Escherichia diagonal strips in the alignment (see the Materials and coli chromosome [5] using a suffix tree alignment algorithm methods section). Specifically, when V. cholerae chrI is [6]. The analysis revealed a significant alignment at the DNA aligned in the forward direction against E. coli, there are 459 level between the V. cholerae large chromosome (chrI) [4] maximal unique matching subsequences (MUMs; see the and the E. coli chromosome [5] spanning the entire length of Materials and methods section), of which 177 occurred in a these chromosomes (Figure 1a). Analysis of the reverse com- diagonal strip covering 10% of the total area (compared to plement of V. cholerae chrI with E. coli also produced a sig- the expected value of 46). The probability of observing this nificant alignment (Figure 1b). When superimposed, the two high a number of MUMs by chance is 4.7 x 10–59. The align- alignments produce a clear ‘X’ shape (Figure 1c) that is sym- ment of V. cholerae chrI in the reverse direction against E. coli metric about the origin of replication of both genomes. This (which corresponds to the MUMs on the anti-diagonal) has a symmetry indicates that matching sequences tend to occur probability of 1.8 x 10–90. As a control, we compared the (a) (b) (c) 5,000,000 5,000,000 5,000,000 4,000,000 4,000,000 4,000,000 3,000,000 3,000,000 3,000,000 E. coli E. coli E. coli 2,000,000 2,000,000 2,000,000 1,000,000 1,000,000 1,000,000 0 0 0 0 1,000,000 2,000,000 3,000,000 0 1,000,000 2,000,000 3,000,000 0 1,000,000 2,000,000 3,000,000 V. cholerae chromosome I V. cholerae chromosome I reverse V. cholerae chromosome I (d) (e) 2,000,000 4,000,000 1,500,000 3,000,000 M. tuberculosis S. pyogenes 1,000,000 2,000,000 500,000 1,000,000 0 0 0 1,000,000 2,000,000 3,000,000 0 5,000,00 1,000,000 1,500,000 2,000,000 S. pneumoniae M. leprae Figure 1 Between-species whole-genome DNA alignments. Plots of maximally unique matching subsequences (MUMs) between genomes as identified by the MUMmer program. (a) V. cholerae chrI forward strand versus E. coli forward strand. (b) E. coli forward versus V. cholerae chrI reverse. (c) V. cholerae chrI versus E. coli, forward and reverse overlaid. (d) S. pneumoniae forward versus S. pyogenes forward and reverse overlaid. (e) M. tuberculosis forward versus M. leprae forward and reverse overlaid. A point ( x,y) indicates a DNA sequence that occurs once within each genome, at location x in one genome and at location y in the other genome. The matching sequences may occur on either the forward or the reverse strand; in either case, the locations indicate the 5’ end of the sequences. The point (0,0) corresponds to the origin of replication for each genome.
  • 3. http://genomebiology.com/2000/1/6/research/0011.3 Table 1 species (see the Materials and methods section). Figure 2a shows a scatterplot of chromosome positions of all proteins Whole-genome DNA alignments using MUMmer homologous between V. cholerae chrI and E. coli. The pres- comment Organism 1 Organism 2 Total Matches Probability* ence of many large gene families causes a great deal of noise MUMs within in this comparison. This noise can be reduced by considering diagonal only the best matching homolog for each open reading frame (10%) (ORF), rather than all protein homologs (Figure 2b). This fil- V. cholerae E. coli 459 177 4.7 x 10-59 tered protein comparison results in an X-alignment that is statistically significant (Table 2). V. cholerae (rev) E. coli 467 217 1.8 x 10-90 V. cholerae (rev) V. cholerae 342 86 8.2 x 10-16 Whole-genome X-alignments within species reviews E. coli (rev) E. coli 1,128 225 1.5 x 10-23 The finding of the X-alignment pattern between species led S. pyogenes S. pneumoniae 706 259 4.5 x 10-80 us to search for similar patterns within species; that is, global alignments of a genome with its own reverse complement. Of S. pyogenes (rev) S. pneumoniae 626 255 2.3 x 10-90 the genomes for which we found between-species X-align- S. pyogenes (rev) S. pyogenes 367 96 1.1 x10-18 ments (M. tuberculosis, M. leprae, S. pyogenes, S. pneumo- S. pneumoniae (rev) S. pneumoniae 1,054 154 1.5 x 10-6 niae, E. coli and V. cholerae), statistically significant self-alignments are detected for all except M. tuberculosis M. leprae (rev) M. leprae 449 89 3.5 x 10-10 (Figure 3; probabilities shown in Table 1). Interestingly, these reports M. tuberculosis (rev) M. tuberculosis 2,476 268 0.092 self-alignments are not as strong as those between species. E. coli M. tuberculosis 81 13 0.06 Proteome analysis also shows an X-alignment within species E. coli (rev) M. tuberculosis 70 5 0.84 (shown for V. cholerae chrI in Figure 2d; probabilities shown in Table 2). The X-alignment of proteins within V. cholerae *Statistical significance was estimated as described in the text; rev, chrI is statistically significant only for recently duplicated- reverse complement sequence. genes, but disappears when all paralogs are included. The importance of filtering for recent duplications is discussed deposited research genomes of distantly related species, such as E. coli and below. Mycobacterium tuberculosis. These do not show a signifi- cant X-alignment (Table 1). Model I: whole-genome inverted duplications One possible explanation for an X-alignment within and We have found that X-alignments of whole genomes are not between species is an ancestral inverted duplication of the limited to the V. cholerae versus E. coli comparison. For whole genome, as has been suggested for E. coli [10]. The example, a whole-genome comparison of two bacteria in the weak or missing X-alignment within species could be genus Streptococcus - S. pyogenes [7] and S. pneumoniae refereed research (H. Tettelin, personal communication) - reveals a global Table 2 X-alignment similar to that of V. cholerae versus E. coli Whole-genome protein-level comparisons (Figure 1d) which is also statistically significant (Table 1). In addition, an X-alignment is found between two species in Organism 1 Organism 2 Total Matches Probability* the genus Mycobacterium - M. tuberculosis [8] and matches within 10% M. leprae [9] (Figure 1e) - as well as between two strains of diagonal Helicobacter pylori (data not shown). The X-alignments observed between any two pairs of genomes are not identical Top matches in every aspect. For example, in the alignment between the interactions V. cholerae E. coli 1,797 369 3.2 x 10-40 two Mycobacterium species, each conserved region is much V. cholerae (rev) E. coli 1,797 441 2.3 x 10-70 longer than in the other genome pairs. We believe this is due to different numbers of evolutionary events between the V. cholerae V. cholerae 701 145 3.6 x 10-17 species (see below). Whole-genome X-alignments were not V. cholerae (rev) V. cholerae 701 70 0.52 found between any other pairs of species, although a related E. coli E. coli 1,985 286 3.6 x 10-10 pattern was seen between some of the chlamydial species (see below). E. coli (rev) E. coli 1,985 210 0.20 information Recent duplications† Whole-genome X-alignments between species are V. cholerae V. cholerae 195 60 1.0 x 10-15 also found at the proteome level V. cholerae (rev) V. cholerae 195 26 8.4 x 10-4 To test whether the X-alignments found in the DNA analysis could also be found at the level of whole proteomes, we con- *Statistical significance was estimated as described in the text. †Best ducted comparisons of homologous proteins between match to another V. cholerae ORF versus any other complete genome.
  • 4. 4 Genome Biology Vol 1 No 6 Eisen et al. (a) (b) 4,000,000 4,000,000 3,000,000 3,000,000 E. coli E. coli 2,000,000 2,000,000 1,000,000 1,000,000 0 0 0 1,000,000 2,000,000 3,000,000 0 1,000,000 2,000,000 3,000,000 V. cholerae V. cholerae (c) (d) 3,000,000 1,000,000 2,000,000 C. pneumoniae V. cholerae T R 500,000 1,000,000 0 0 0 500,000 1,000,000 0 1,000,000 2,000,000 3,000,000 C. trachomatis V. cholerae Figure 2 Whole-genome proteome alignments. Plots show the chromosome locations of pairs of predicted proteins that have significant similarity (on the basis of fasta3 comparisons). (a) V. cholerae chrI versus E. coli. All significant matches for each V. cholerae ORF are shown. (b) V. cholerae chrI versus E. coli, top matches. Only the best match for each V. cholerae ORF is shown. The filtering for top matches for each ORF removes noise due to the presence of many large multigene families. This X-alignment is highly statistically significant (Table 2). (c) Chlamydia trachomatis versus C. pneumoniae, top matches. Only the best match for each C. trachomatis ORF is shown. The position of the origins (R) and termini (T) of replication are slightly shifted to see the inversions better. This pattern is consistent with the occurrence a small number of inversions around the origin and terminus in the two lineages since their divergence from a common ancestor (see Figure 4). (d) V. cholerae chrI versus V. cholerae chrI, self-alignment. Only recently duplicated pairs of genes are shown. Recent duplications were operationally defined as those genes that were more similar to another gene in V. cholerae than to any gene in any other complete genome sequence. The faint X-alignment is statistically significant (Table 2). No significant X-alignment was detected when all pairs of paralogs were included. explained by gene loss of one of the two duplicates of many between species, the member of the gene pair lost in a par- of the pairs of genes in the different lineages. Gene loss has ticular lineage should be essentially random. If an ancient been found to follow large chromosomal or genome duplica- inverted duplication followed by differential gene loss is the tions [11-13]. This gene loss is thought to stabilize large correct explanation for the observed X-alignments, one duplications by preventing recombination events between would expect the genes along one diagonal to be orthologous duplicate genes. If gene loss is responsible for the weak between species (related to each other by the speciation X-alignment within species, then to maintain the X-alignments event), while the genes along the other diagonal should be
  • 5. http://genomebiology.com/2000/1/6/research/0011.5 (a) 3,000,000 (b) 4,000,000 comment 2,000,000 3,000,000 V. cholerae E. coli 2,000,000 1,000,000 1,000,000 reviews 0 0 0 1,000,000 2,000,000 3,000,000 0 1,000,000 2,000,000 3,000,000 4,000,000 V. cholerae chromosome I E. coli (c) 2,000,000 (d) reports 2,000,000 1,500,000 1,500,000 S. pneumoniae S. pyogenes 1,000,000 1,000,000 deposited research 500,000 500,000 0 0 0 500,000 1,000,000 1,500,000 2,000,000 0 500,000 1,000,000 1,500,000 2,000,000 refereed research S. pyogenes S. pneumoniae Figure 3 Within-genome DNA alignments. Plots of exactly matching sequences within four genomes as identified by the MUMmer program: (a) V. cholerae; (b) E. coli; (c) S. pyogenes; (d) S. pneumoniae. A point (x,y) indicates a DNA sequence that is repeated within the genome, occurring once at location x and again at location y. Points near the diagonal (y = x) correspond to tandem repeats. Points near the anti-diagonal (y = L -x, where L is genome length) correspond to repeats that occur at symmetric locations about the origin of location. The point (0,0) corresponds to the origin of replication in each plot. Statistically significant X-alignments occur in all four species (Table 1). interactions paralogous (related to each other by the genome duplication possible mechanisms for such movement, but we believe the event before the speciation of the two lineages). However, most likely explanation is the occurrence of large chromoso- the evidence appears to contradict this model: likely ortholo- mal inversions that pivot around the replication origin gous gene pairs are equally distributed on each diagonal and/or terminus. Large chromosomal inversions, including (data not shown). those that occur around the replication origin and terminus, have been shown to occur in E. coli and Salmonella information Model II: chromosomal inversions about the origin typhimurium in the laboratory (see, for example, [14-18]). and/or terminus The occurrence of such inversions over evolutionary time A second possible explanation for the X-alignments is that an scales was first suggested by comparative analysis of the com- underlying mechanism allows sections of DNA to move plete genomes of four strains in the genus Chlamydia [19]. In within the genome but maintains the distance of these sec- that study, we found that the major chromosomal differ- tions from the origin and/or terminus. There are a variety of ences between C. pneumoniae and C. trachomatis (shown in
  • 6. 6 Genome Biology Vol 1 No 6 Eisen et al. A2 A3 31 32 1 2 31 32 1 2 2 1 32 31 30 3 30 3 3 30 29 4 29 4 4 29 28 5 28 5 5 28 27 6 27 6 A2 * 27 * 6 26 7 A1 26 7 26 7 25 24 A1 8 9 25 24 A2 8 9 25 24 A3 8 9 23 10 23 10 23 10 22 11 22 11 22 11 21 20 12 13 Inversion 21 20 12 13 Inversion 21 20 12 13 19 14 14 19 14 19 18 17 16 15 around * 15 16 17 18 * around 15 16 17 18 terminus (*) origin (*) 32 1 2 30 31 3 29 4 28 5 27 6 26 Common 7 25 24 ancestor of 8 9 B1 B2 B3 23 10 22 A and B 11 21 12 20 13 19 14 18 17 16 15 A1 A2 A3 30 31 32 1 2 3 Inversion 30 31 32 1 2 3 Inversion 3 2 1 32 31 30 29 4 29 4 * 29 4* 28 5 around 28 5 around 28 5 27 6 27 6 27 6 26 7 terminus (*) 26 7 origin (*) 26 7 25 8 * 8 * 25 8 25 24 23 B1 10 9 9 10 B2 24 23 9 10 B3 24 23 22 11 11 22 11 22 21 12 12 21 12 21 20 13 13 20 13 20 19 14 14 19 14 19 18 17 16 15 15 16 17 18 15 16 17 18 B2 B3 B1 B2 Figure 4 Schematic model of genome inversions. The model shows an initial speciation event, followed by a series of inversions in the different lineages (A and B). Inversions occur between the asterisks (*). Numbers on the chromosome refer to hypothetical genes 1-32. At time point 1, the genomes of the two species are still co-linear (as indicated in the scatterplot of A1 versus B1). Between time point 1 and time point 2, each species (A and B) undergoes a large inversion about the terminus (as indicated in the scatterplots of A1 versus A2 and B1 versus B2). This results in the between-species scatterplot looking as if there have been two nested inversions (A2 versus B2), similar to that seen for C. trachomatis versus C. pneumoniae (see Figure 2). Between time point 2 and time point 3 each species undergoes an additional inversion (as indicated in the scatterplots of B2 versus B3 and A2 versus A3). This results in the between-species scatterplots beginning to resemble an X-alignment, similar to that seen in M. tuberculosis versus M. leprae (see Figure 2). Figure 2c) were consistent with the occurrence of large could produce patterns very similar to those seen in the inversions that pivoted around the origin and terminus Chlamydia, Mycobacterium and Helicobacter comparisons. (including multiple inversions of different sizes). In Figure 4 The continued occurrence of such inversion over longer time we present a hypothetical model showing how a small scales would result in an X-alignment similar to that seen in number of inversions centered around the origin or terminus the V. cholerae versus E. coli and S. pneumoniae versus
  • 7. http://genomebiology.com/2000/1/6/research/0011.7 S. pyogenes comparisons. Thus the different between- within the bacterial chromosome that inverts the sequence species X-alignments could be the result of different from the origin up to that point. As with inversion events, numbers of inversions between particular pairs of species. recombination and replication have been found to be tightly comment coupled [25]. Inversions about the origin and terminus could also produce an X-alignment within species, through the splitting of tandemly duplicated sequence. Many sets of tandemly dupli- Conclusions cated genes are found in most bacterial genomes [19,20] We present here a novel observation regarding the conserva- (also see Figure 3a,c). As tandem duplications are inherently tion between bacterial species of the distance of particular unstable (one of the duplicates can be rapidly eliminated by genes from the replication origin or terminus. The initial slippage and/or recombination events [21]), the fact that observation was only possible due to the availability of com- reviews many tandem pairs are present within each genome suggests plete genome sequences from pairs of moderately closely that tandem duplications occur frequently. Thus, it is rea- related species (for example, V. cholerae and E. coli). This sonable to assume that occasionally a large inversion will shows the importance of having genome pairs from many split a pair of tandemly duplicated genes. An inversion that levels of evolutionary relatedness. Comparisons of distantly pivots about the origin and also splits a tandem duplication related species enable the determination of universal fea- will result in a pair of paralogous genes spaced symmetri- tures of life as well as of events that occur very rarely. Com- cally on opposite sides of the origin. parison of very closely related species allows the identification of frequent events such as transitional changes reports If our inversion model is correct, then the genes along both at third codon positions or tandem duplications. To eluci- diagonals in the between-species alignments should be date all other events in the history of life, genome pairs cov- orthologous, which is the case (see above). In contrast, genes ering all the intermediate levels of evolutionary relatedness along the anti-diagonal in the within-species X-alignments will be needed. should be recent tandem duplicates that have been sepa- rated by inversions. This also appears to be the case - in the within-species analysis of V. cholerae chrI ORFs, the deposited research Materials and methods X-alignment shows up best when only recent duplicates are Genomes analyzed analyzed (Figure 2d). The splitting of tandem duplicates by Complete published genome sequences were obtained from the inversions may be a general mechanism to stabilize the coex- National Center for Biotechnology Information website [26] or istence of duplicated genes, as it will prevent their elimina- from the TIGR Comprehensive Microbial Resource [27]. These tion by unequal crossing-over or replication slippage events. included Aeropyrum pernix [28], Aquifex aeolicus [29], Archaeoglobus fulgidus [30], Bacillus subtilis [31], Borrelia What could cause inversions that pivot around the origin burgdorferi [32], Campylobacter jejuni [33], Chlamydia and terminus of the genome to occur more frequently than pneumoniae AR39 [19], Chlamydia pneumoniae CWL029 refereed research other inversions? One possibility is that many inversions [34], Chlamydia trachomatis (D/UW-3/Cx) [35], Chlamydia occur, but there is selection against those that change the trachomatis MoPn [19], Deinococcus radiodurans [36], distance of a gene from the origin or terminus. Such a possi- Escherichia coli [5], Haemophilus influenzae [37], Helicobac- bility has been suggested by experimental work in E. coli ter pylori [38], Helicobacter pylori J99 [39], Methanobac- [14,15]. Additional studies have, however, suggested that terium thermoautotrophicum [40], Methanococcus jannaschii there is little selective difference between inversions and that [41], Mycobacterium tuberculosis [8], Mycoplasma genital- instead there may be certain regions that are more prone to ium [42], Mycoplasma pneumoniae [43], Neisseria meningi- inversion than others [16-18,22,23]. Alternatively, the inver- tidis MC58 [20], Neisseria meningitidis serogroup A strain sion events could be linked to replication, as has been sug- Z2491 [44], Pyrococcus horikoshii [45], Rickettsia prowazekii gested for small local inversion events [24]. Whatever the [46], Synechocystis sp. [47], Thermotoga maritima [48], Tre- interactions mechanisms, the fact that we find evidence for such inver- ponema pallidum [49], and Vibrio cholerae [4]. In addition, a sions between many pairs of species suggests that they are a few unpublished genomes were analyzed: Streptococcus pyo- common feature of bacterial evolution. Many aspects of the genes (obtained from the Oklahoma University Genome Center X-alignments require further exploration. For example, to website [7]), Streptococcus pneumoniae (H. Tettelin, personal split a tandem duplication, an inversion must fall precisely communication), and Mycobacterium leprae (obtained from on the boundary between two duplicated genes. This would the Sanger Centre Pathogen Sequencing Group website [9]). information appear to be unlikely, requiring a large number of inversions in order to generate a sufficient number of split gene pairs. If Whole-genome DNA alignments the mechanisms of gene duplication are somehow related to DNA alignments of the complete genomic sequences of all the mechanisms of inversion, however, then this model is bacteria used in this study were accomplished with the more plausible. The process of duplicating a gene, if it occurs MUMmer program [6]. This program uses an efficient suffix during replication, might promote a recombination event tree construction algorithm to rapidly compute alignments
  • 8. 8 Genome Biology Vol 1 No 6 Eisen et al. of entire genomes. The algorithm identifies all exact matches termini of S. pneumoniae and S. pyogenes were determined of nucleotide subsequences that are contained in both input by the authors of the present study using GC-skew analysis sequences; these exact matches must be longer than a speci- and the locations of characteristic genes, particularly the fied minimum length, which was set to 20 base pairs for this chromosome replication initiator gene dnaA. comparison. To search for genome-scale alignments within species, complete bacterial and archaeal genomes (25 in total including all published genomes) were aligned with Acknowledgements their own reverse complements. To search for between- We thank S. Eddy, M.A. Riley, T. Read, A. Stoltzfus, M-I Benito and I. Paulsen for helpful comments, suggestions and discussions. S.L.S. was sup- species alignments, all genomes were aligned against all ported in part by NSF grant IIS-9902923 and NIH grant R01 LM06845. S.L.S. others in both orientations. and J.A.E were supported in part by NSF grant KDI-9980088. Data for all published complete genome sequences were obtained from the NCBI genomes database [26] or from The Institute for Genomic Research (TIGR) Whole-genome protein comparisons Microbial Genome Database [27]. The sequences of V. cholerae, S. pneumo- The predicted proteome of each complete genome niae, and M. tuberculosis (CDC 1551) were determined at TIGR with support from NIH and the NIAID. The M. leprae sequence data were pro- sequence (all predicted proteins in the genome) was com- duced by the Pathogen Sequencing Group at the Sanger Centre. Sequencing pared to the proteomes of all complete genome sequences of M. leprae is funded by the Heiser Program for Research in Leprosy and (including itself) using the fasta3 program [50]. Matches Tuberculosis of The New York Community Trust and by L’Association Raoul Follereau. The M. tuberculosis CDC 1551 genome sequence was with an expected score (e-value) of 10–5 or less were con- obtained from TIGR. The source of the S. pyogenes genome sequence was sidered significant. the Streptococcal Genome Sequencing Project funded by USPHS/NIH grant AI38406, and was kindly made available by B. A. Roe, S.P. Linn, L. Song, X. Yuan, S. Clifton, R.E. McLaughlin, M. McShan and J. Ferretti, and can be Statistical significance of X-alignments obtained from the website of the Oklahoma University Genome Center [7]. To calculate the statistical significance of the X-alignments, the maximal unique matching subsequences (MUMs) for unrelated genomes were examined and found to be uni- References formly distributed [6]. With a uniform background, the 1. Seoighe C, Wolfe KH: Updated map of duplicated regions in the yeast genome. Gene 1999, 238:253-261. expected density of MUMs in any region of an alignment 2. Lin X, Kaul S, Rounsley S, Shea TP, Benito MI, Town CD, Fujii CY, plot is a simple proportion of the area of that region to the Mason T, Bowman CL, Barnstead M, et al.: Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. Nature entire plot. In particular, in an alignment with N total 1999, 402:761-768. MUMs, the probability (Pr) of observing at least m matches 3. Mayer K, Schuller C, Wambutt R, Murphy G, Volckaert G, Pohl T, in a region with area p can be computed using the binomial Dusterhoft A, Stiekema W, Entian KD, Terryn N, et al.: Sequence and analysis of chromosome 4 of the plant Arabidopsis distribution in Equation 1: thaliana. Nature 1999, 402:769-777. N 4. Heidelberg JF, Eisen JA, Nelson WC, Clayton RA, Gwinn ML, N Dodson RJ, Haft DH, Hickey EK, Peterson JD, Umayam L, et al.: The Pr = px (1 - p) (N - x) genome sequence of Vibrio cholerae, the aetiologic agent of (1) x=m x cholera. Nature 2000, 406:477-484. 5. Blattner FR, Plunkett GI, Bloch CA, Perna NT, Burland V, Riley M, Collado-Vides J, Glasner JD, Rode CK, Mayhew GF, et al.: The com- The alignment of V. cholerae chrI (both forward and reverse plete genome sequence of Escherichia coli K-12. Science 1997, strands) versus E. coli contains 926 MUMs. The MUMs 277:1453-1462. 6. Delcher AL, Kasif S, Fleischmann RD, Peterson J, White O, Salzberg forming X-alignments appear along the diagonal (y = x) and SL: Alignment of whole genomes. Nucleic Acids Res 1999, the anti-diagonal (y = L -x, where L is the genome length). 27:2369-2376. To estimate the significance of the alignments in both direc- 7. Oklahoma University Genome Center [http://www.genome.ou.edu/strep.html] tions, diagonal strips were sampled along each of the diago- 8. Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, nals. The width of each strip was set at 10% of the plot area Gordon SV, Eiglmeier K, Gas S, Barry III CE, et al.: Deciphering the biology of Mycobacterium tuberculosis from the complete and significance values were calculated (Table 1). genome sequence. Nature 1998, 393:537-544. 9. Sanger Centre Pathogen Sequencing Group Identification of origins of replication [ftp://ftp.sanger.ac.uk/pub/pathogens/leprae] 10. Zipkas D, Riley M: Proposal concerning mechanism of evolu- The origins of replication for the bacterial genomes have been tion of the genome of Escherichia coli. Proc Natl Acad Sci USA characterized by a variety of methods. For E. coli, M. tubercu- 1975, 72:1354-1358. 11. Wagner A: The fate of duplicated genes: loss or new func- losis and M. leprae, the origins have been well-characterized tion? BioEssays 1998, 20:785-788. by laboratory studies [51,52]. The origins and termini of C. 12. Lynch M, Force A: The probability of duplicate gene preserva- trachomatis, C. pneumoniae and V. cholerae were identified tion by subfunctionalization. Genetics 2000, 154:459-473. 13. Nadeau JH, Sankoff D: Comparable rates of gene loss and func- by GC-skew [53] and by characteristic genes in the region of tional divergence after genome duplications early in verte- the origin [4,19]. GC-skew uses the function (G- C)/(G+C) brate evolution. Genetics 1997, 147:1259-1266. 14. Francois V, Louarn J, Patte J, Rebollo JE, Louarn JM: Constraints in computed on 2,000 bp windows across the genome, which chromosomal inversions in Escherichia coli are not explained exhibits a clear tendency in many bacterial genomes to be by replication pausing at inverted terminator-like positive for the leading strand and negative for the lagging sequences. Mol Microbiol 1990, 4:537-542. 15. Rebollo JE, Francois V, Louarn JM: Detection and possible role of strand. The origin of H. pylori was determined by oligomer two large nondivisible zones on the Escherichia coli chromo- skew [54] and confirmed by GC-skew. The origins and some. Proc Natl Acad Sci USA 1988, 85:9391-9395.
  • 9. http://genomebiology.com/2000/1/6/research/0011.9 16. Segall A, Mahan MJ, Roth JR: Rearrangement of the bacterial 39. Alm RA, Ling LS, Moir DT, King BL, Brown ED, Doig PC, Smith DR, chromosome: forbidden inversions. Science 1988, 241:1314-1318. Noonan B, Guild BC, deJonge BL, et al.: Genomic-sequence com- 17. Mahan MJ, Roth JR: Ability of a bacterial chromosome parison of two unrelated isolates of the human gastric segment to invert is dictated by included material rather pathogen Helicobacter pylori. Nature 1999, 397:176-180. comment than flanking sequence. Genetics 1991, 129:1021-1032. 40. Smith DR, Doucette-Stamm LA, Deloughery C, Lee H, Dubois J, 18. Segall AM, Roth JR: Recombination between homologies in Aldredge T, Bashirzadeh R, Blakely D, Cook R, Gilbert K, et al.: direct and inverse orientation in the chromosome of Salmo- Complete genome sequence of Methanobacterium ther- nella: intervals which are nonpermissive for inversion for- moautotrophicum DH: functional analysis and comparative mation. Genetics 1989, 122:737-747. genomics. J Bacteriol 1996, 179:7135-7155. 19. Read TD, Brunham RC, Shen C, Gill SR, Heidelberg JF, White O, 41. Bult CJ, White O, Olsen GJ, Zhou L, Fleischmann RD, Sutton GG, Hickey EK, Peterson J, Utterback T, Berry K, et al.: Genome Blake JA, Fitzgerald LM, Clayton RA, Gocayne JD, et al.: Complete sequences of Chlamydia trachomatis MoPn and Chlamydia genome sequence of the methanogenic archaeon, pneumoniae AR39. Nucleic Acids Res 2000, 28:1397-1406. Methanococcus jannaschii. Science 1996, 273:1058-1073. 20. Tettelin H, Saunders NJ, Heidelberg J, Jeffries AC, Nelson KE, Eisen 42. Fraser CM, Gocayne JD, White O, Adams MD, Clayton RA, Fleis- JA, Ketchum KA, Hood DW, Peden JF, Dodson RJ, et al.: Complete chmann RD, Bult CJ, Kerlavage AR, Sutton G, Kelley JM, et al.: The genome sequence of Neisseria meningitidis serogroup B reviews minimal gene complement of Mycoplasma genitalium. Science strain MC58. Science 2000, 287:1809-1815. 1995, 270:397-403. 21. Force A, Lynch M, Pickett FB, Amores A, Yan YL, Postlethwait J: 43. Himmelreich R, Hilbert H, Plagens H, Pirkl E, Li BC, Herrmann R: Preservation of duplicate genes by complementary, degen- Complete sequence analysis of the genome of the bac- erative mutations. Genetics 1999, 151:1531-1545. terium Mycoplasma pneumoniae. Nucleic Acids Res 1996, 22. Schmid MB, Roth JR: Selection and endpoint distribution of 24:4420-4449. bacterial inversion mutations. Genetics 1983, 105:539-557. 44. Parkhill J, Achtman M, James KD, Bentley SD, Churcher C, Klee SR, 23. Mahan MJ, Roth JR: Reciprocality of recombination events that Morelli G, Basham D, Brown D, Chillingworth T, et al.: Complete rearrange the chromosome. Genetics 1988, 120:23-35. DNA sequence of a serogroup A strain of Neisseria meningi- 24. Gordon AJ, Halliday JA: Inversions with deletions and duplica- tidis Z2491. Nature 2000, 404:502-506. tions. Genetics 1995, 140:411-414. 45. Kawarabayasi Y, Sawada M, Horikawa H, Haikawa Y, Hino Y, 25. Valencia-Morales E, Romero D: Recombination enhancement reports Yamamoto S, Sekine M, Baba S, Kosugi H, Hosoyama A, et al.: Com- by replication (RER) in Rhizobium etli. Genetics 2000, 154:971-983. plete sequence and gene organization of the genome of a 26. National Center for Biotechnology Information, Entrez hyperthermophilic archaebacterium, Pyrococcus horikoshii Genomes [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Genome] OT3. DNA Res 1998, 5:55-76. 27. The Institute for Genomic Research Microbial Genome 46. Andersson SG, Zomorodipour A, Andersson JO, Sicheritz-Ponten T, Database [http://www.tigr.org/tdb/mdb/mdb.html] Alsmark UC, Podowski RM, Naslund AK, Eriksson AS, Winkler HH, 28. Kawarabayasi Y, Hino Y, Horikawa H, Yamazaki S, Haikawa Y, Jin-no Kurland CG: The genome sequence of Rickettsia prowazekii K, Takahashi M, Sekine M, Baba S, Ankai A, et al.: Complete and the origin of mitochondria. Nature 1998, 396:133-140. genome sequence of an aerobic hyper-thermophilic crenar- 47. Kaneko T, Sato S, Kotani H, Tanaka A, Asamizu E, Nakamura Y, deposited research chaeon, Aeropyrum pernix K1. DNA Res 1999, 6:83-101, 145-52. Miyajima N, Hirosawa M, Sugiura M, Sasamoto S, et al.: Sequence 29. Deckert G, Warren PV, Gaasterland T, Young WG, Lenox AL, analysis of the genome of the unicellular cyanobacterium Grahams DE, Overbeek R, Snead MA, Keller M, Aujay M, et al.: The Synechocystis sp. strain PCC6803. II. Sequence determina- complete genome of the hyperthemophilic bacterium tion of the entire genome and assignment of potential Aquifex aeolicus. Nature 1998, 392:353-358. protein-coding regions. DNA Res 1996, 3:109-136. 30. Klenk H-P, Clayton RA, Tomb J-F, White O, Nelsen KE, Ketchum 48. Nelson KE, Clayton RA, Gill SR, Gwinn ML, Dodson RJ, Haft DH, KA, Dodson RJ, Gwinn M, Hickey EK, Peterson JD, et al.: The com- Hickey EK, Peterson JD, Nelson WC, Ketchum KA, et al.: Evidence plete genomic sequence of the hyperthermophilic, sulfate- for lateral gene transfer between Archaea and bacteria reducing archaeon Archaeoglobus fulgidus. Nature 1997, from genome sequence of Thermotoga maritima. Nature 1999, 390:364-370. 31. Kunst A, Ogasawara N, Moszer I, Albertini A, Alloni G, Azevedo V, 399:323-329. Bertero M, Bessieres P, Bolotin A, Borchert S, et al.: The complete 49. Fraser CM, Norris SJ, Weinstock GM, White O, Sutton GG, Dodson refereed research genome sequence of the Gram-positive bacterium Bacillus R, Gwinn M, Hickey EK, Clayton R, Ketchum KA, et al.: Complete subtilis. Nature 1997, 390:249-256. genome sequence of Treponema pallidum, the syphilis spiro- 32. Fraser CM, Norris SJ, Weinstock GM, White O, Sutton GG, Dodson chete. Science 1998, 281:375-388. R, Gwinn M, Hickey EK, Clayton R, Ketchum KA, et al.: Genomic 50. Pearson WR: Flexible sequence similarity searching with the sequence of a Lyme disease spirochaete, Borrelia burgdorferi. FASTA3 program package. Methods Mol Biol 2000, 132:185-219. Nature 1997, 390:580-586. 51. Marsh RC, Worcel A: A DNA fragment containing the origin 33. Parkhill J, Wren BW, Mungall K, Ketley JM, Churcher C, Basham D, of replication of the Escherichia coli chromosome. Proc Natl Chillingworth T, Davies RM, Feltwell T, Holroyd S, et al.: The Acad Sci USA 1977, 74:2720-2724. genome sequence of the food-borne pathogen Campylobac- 52. Salazar L, Fsihi H, de Rossi E, Riccardi G, Rios C, Cole ST, Takiff HE: ter jejuni reveals hypervariable sequences. Nature 2000, Organization of the origins of replication of the chromo- 403:665-668. somes of Mycobacterium smegmatis, Mycobacterium leprae 34. Kalman S, Mitchell W, Marathe R, Lammel C, Fan J, Hyman RW, and Mycobacterium tuberculosis and isolation of a functional origin from M. smegmatis. Mol Microbiol 1996, 20:283-293. interactions Olinger L, Grimwood J, Davis RW, Stephens RS: Comparative genomes of Chlamydia pneumoniae and C. trachomatis. Nat 53. Lobry JR: Asymmetric substitution patterns in the two DNA Genet 1999, 21:385-389. strands of bacteria. Mol Biol Evol 1996, 13:660-665. 35. Stephens RS, Kalman S, Lammel C, Fan J, Marathe R, Aravind L, 54. Salzberg SL, Salzberg AJ, Kerlavage AR, Tomb JF: Skewed Mitchell W, Olinger L, Tatusov RL, Zhao Q, et al.: Genome oligomers and origins of replication. Gene 1998, 217:57-67. sequence of an obligate intracellular pathogen of humans: Chlamydia trachomatis. Science 1998, 282:754-759. 36. White O, Eisen JA, Heidelberg JF, Hickey EK, Peterson JD, Dodson RJ, Haft DH, Gwinn ML, Nelson WC, Richardson DL, et al.: Genome sequence of the radioresistant bacterium Deinococcus radiodurans R1. Science 1999, 286:1571-1577. information 37. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM, et al.: Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 1995, 269:496-512. 38. Tomb JF, White O, Kerlavage AR, Clayton RA, Sutton GG, Fleis- chmann RD, Ketchum KA, Klenk HP, Gill S, Dougherty BA, et al.: The complete genome sequence of the gastric pathogen Helicobacter pylori. Nature 1997, 388:539-547.