SlideShare a Scribd company logo
1 of 18
Distribution of rRNA Introns in the Three-dimensional
Structure of the Ribosome
Scott A. Jackson1
, Jamie J. Cannone2
, Jung C. Lee3
, Robin R. Gutell2
*
and Sarah A. Woodson4
*
1
Department of Chemistry
and Biochemistry
University of Maryland
College Park, MD 20497-2021
USA
2
The Institute for Cellular
and Molecular Biology
and Section of Integrative
Biology, The University of
Texas at Austin, TX 78712
USA
3
Division of Medicinal
Chemistry, College of
Pharmacy, The University of
Texas at Austin, TX 78712
USA
4
T. C. Jenkins Department
of Biophysics, Johns Hopkins
University, 3400 N. Charles
Street, Baltimore
MD 21218-4118, USA
More than 1200 introns have been documented at over 150 unique sites in
the small and large subunit ribosomal RNA genes (as of February 2002).
Nearly all of these introns are assigned to one of four main types: group
I, group II, archaeal and spliceosomal. This sequence information has
been organized into a relational database that is accessible through the
Comparative RNA Web Site (http://www.rna.icmb.utexas.edu/) While
the rRNA introns are distributed across the entire tree of life, the
majority of introns occur within a few phylogenetic groups. We analyzed
the distributions of rRNA introns within the three-dimensional structures
of the 30 S and 50 S ribosomes. Most sites in rRNA genes that contain
introns contain only one type of intron. While the intron insertion sites
occur at many different coordinates, the majority are clustered near
conserved residues that form tRNA binding sites and the subunit
interface. Contrary to our expectations, many of these positions are not
accessible to solvent in the mature ribosome. The correlation between the
frequency of intron insertions and proximity of the insertion site to
functionally important residues suggests an association between intron
evolution and rRNA function.
q 2002 Elsevier Science Ltd. All rights reserved
Keywords: group I/II introns; ribosomal RNA; intron transposition; reverse
splicing; sequence database*Corresponding authors
Introduction
Introns in ribosomal RNA genes are found pre-
dominantly found within conserved sequences
near tRNA and mRNA binding sites, suggesting a
possible link between intron evolution and rRNA
function.1 – 3
Examples of every major intron class
have been identified in rRNA genes.4,5
These
include group I and group II introns,6,7
tRNA-like
introns in archaeal genomes,8
a newly defined
family of “spliceosomal” introns in eukaryotic
nuclear rDNA with splice sites that resemble the
conserved splice site sequences of nuclear pre-
mRNA introns,9
and a small number of introns
that cannot be assigned to one of these four groups.
The sporadic appearance of group I and group II
introns among the rRNA genes of organisms from
all three phylogenetic kingdoms points to a
complex evolutionary past.10
Although examples
of introns that have descended through ancient
lineages are known,11 – 14
the appearance of similar
introns in different genes or unrelated organisms
suggests that they were inserted into host genomes
many times during their evolution.15 – 19
Conse-
quently, the distribution of known rRNA introns
is the product of multiple insertions and selective
losses.20
Since ribosomal RNAs are excellent chrono-
meters by which to measure phylogenetic
relationships,21
many laboratories are determining
rRNA sequences from organisms spanning the
entire tree of life. Consequently, GenBank contains
nearly 10,000 complete 16 S and 23 S (and 16 S-like
and 23 S-like) sequences.22
From this diverse
collection of rRNA sequences, approximately 1200
introns have been identified, sequenced, and
0022-2836/02/$ - see front matter q 2002 Elsevier Science Ltd. All rights reserved
E-mail addresses of the corresponding authors:
swoodson@jhu.edu; robin.gutell@mail.utexas.edu
Abbreviations used: ASA, accessible surface area;
CRW, Comparative RNA Web.
doi:10.1016/S0022-2836(02)00895-1 available online at http://www.idealibrary.com on
Bw
J. Mol. Biol. (2002) 323, 35–52
Table 1. Number of known intron sequences at each rRNA position
Intron typea
Cell locationb
Positionc
No. intronsd
I II A S U C M N
A. 16 S rRNA
TBD 20 20 – – – – – – 20
40 1 1 – – – – – – 1
114 2 2 – – – – – – 2
156 3 3 – – – – – – 3
170 1 1 – – – – – – 1
263 1 – – 1 – – – – 1
265 3 – – – 3 – – – 3
287 2 2 – – – – – – 2
297 4 – – – 4 – – – 4
298 2 – – – 2 – – – 2
299 11 – – – 11 – – – 11
300 1 – – – 1 – – – 1
322 1 – – 1 – – – – 1
323 3 3 – – – – – – 3
330 21 – – – 21 – – – 21
331 7 – – – 7 – – – 7
332 1 – – – 1 – – – 1
333 1 – – – 1 – – – 1
337 1 – – – 1 – – – 1
374 2 – – 2 – – – – 2
390 1 – – – 1 – – – 1
392 1 1 – – – – – – 1
393 11 – – – 11 – – – 11
400 1 – – – 1 – – – 1
497 2 2 – – – – – – 2
508 1 1 – – – – 1 – –
516 118 116 – – – 2 – – 118
529 1 1 – – – – – – 1
531 2 2 – – – – 1 1 –
532 1 – – 1 – – – – 1
548 1 – – 1 – – – – 1
568 1 1 – – – – – – 1
569 6 6 – – – – – 6 –
570 1 1 – – – – – 1 –
651 1 1 – – – – – – 1
674 3 – – – 3 – – – 3
742 1 – – – 1 – – 1
781 4 – – 4 – – – – 4
788 25 22 2 – – 1 – 2 23
789 1 1 – – – – – – 1
793 6 6 – – – – 1 5 –
879 1 – – – – 1 – – 1
882 1 – – – 1 – – – 1
883 5 – – – 5 – – – 5
891 1 1 – – – – – – 1
896 1 – – – – 1 – – 1
901 2 – – 2 – – – – 2
908 12 – – 12 – – – – 12
911 1 – 1 – – – – 1 –
915 2 – – – – 2 – 2 –
934 1 1 – – – – – – 1
939 8 – – – 8 – – – 8
940 9 9 – – – – – – 9
943 126 125 – – – 1 – – 126
952 5 – 5 – – – – 5 –
956 6 6 – – – – – – 6
966 1 1 – – – – – – 1
989 24 24 – – – – – – 24
1046 10 10 – – – – – – 10
1052 2 2 – – – – – – 2
1062 1 1 – – – – – – 1
1068 8 – – 8 – – – – 8
1071 1 – – – 1 – – – 1
1083 1 – – – 1 – – – 1
1092 1 – – 1 – – – – 1
1139 2 2 – – – – – – 2
1197 1 – – – 1 – – – 1
1199 66 66 – – – – – 66
(continued)
36 rRNA Introns in 3D
Table 1 Continued
Intron typea
Cell locationb
Positionc
No. intronsd
I II A S U C M N
1205 5 – – 5 – – – – 5
1210 9 9 – – – – – 5 4
1213 5 – – 5 – – – – 5
1224 2 2 – – – – – 2 –
1226 2 – – – 2 – – – 2
1229 6 – – – 6 – – – 6
1247 1 1 – – – – – 1 –
1363 1 – – 1 – – – – 1
1389 7 7 – – – – – – 7
1391 1 – – 1 – – – – 1
1502 2 – – – – 2 – – 2
1506 152 152 – – – – – – 152
1508 1 – – – – 1 – – 1
1511 4 4 – – – – – – 4
1512 29 29 – – – – – – 29
1514 1 – – – 1 – – – 1
1516 145 145 – – – – – – 145
Total 950 790 8 45 92 15 3 31 916
B. 23 S rRNA
575 1 – 1 – – – – 1 –
580 1 1 – – – – – 1 –
678 12 – – – 12 – – – 12
681 1 – – – 1 – – – 1
711 1 – – – 1 – – – 1
730 2 2 – – – – 2 – –
775 1 – – – 1 – – – 1
776 4 – – – 4 – – – 4
779 1 1 – – – – – 1 –
780 1 – – – 1 – – – 1
784 1 – – – 1 – – – 1
786 1 – – – 1 – – – 1
787 2 – 1 – 1 – – 1 1
796 1 1 – – – – – 1 –
798 30 30 – – – – – – 30
799 1 1 – – – – – – 1
800 3 3 – – – – – – 3
824 1 – – – 1 – – – 1
858 2 – – – 2 – – – 2
958 3 3 – – – – 3 – –
978 1 – – – 1 – – – 1
1025 3 3 – – – – – – 3
1065 4 4 – – – – 3 1 –
1066 1 1 – – – – – 1 –
1085 9 – – 9 – – – – 9
1090 1 1 – – – – – – 1
1091 1 – – – 1 – – – 1
1094 6 6 – – – – – – 6
1255 3 3 – – – – 2 – –
1685 1 1 – – – – – 1 –
1699 1 1 – – – – – 1 –
1766 2 2 – – – – 2 – –
1787 2 – 2 – – – – 2 –
1809 1 – – 1 – – – – 1
1849 1 – – – 1 – – – 1
1915 1 1 – – – – – 1 –
1917 1 1 – – – – 1 – –
1921 13 13 – – – – – – 13
1923 14 13 – – – 1 8 3 3
1925 9 9 – – – – – – 9
1926 4 4 – – – – – – 4
1927 7 – – 7 – – – – 7
1931 16 16 – – – – 11 4 1
1939 1 1 – – – – – 1 –
1943 1 1 – – – – – 1 –
1949 12 12 – – – – – 7 5
1951 3 3 – – – – 2 1 –
1952 1 – – 1 – – – – 1
1974 2 1 – – – 1 – 2 –
(continued)
rRNA Introns in 3D 37
organized into a relational database.5
Since the
rRNA sequences flanking the introns are con-
served, the intron/exon boundaries have been
mapped unambiguously. The collections of rRNA
and intron sequences are sufficiently large to inde-
pendently determine the phylogenetic relation-
ships of the introns and the host organisms.
The prevalence of introns in regions of the 16 S
and 23 S rRNAs that bind tRNAs and elongation
factors sparked the suggestion in 1993 that this dis-
tribution arises from reverse splicing into rRNA
sites that are solvent-accessible in ribosomes.23
Other experiments showed that the efficiency of
pre-rRNA splicing depends on interactions
between the intron and the surrounding rRNA.24–27
Both of these results suggest that introns are more
frequent in certain regions of rRNA genes than
others, due to differences in the conformations of
the mature rRNAs at those sites.
Here, we revisit this question in light of the sig-
nificant increase in the collection of rRNA introns
and the recent high-resolution structures of the
ribosome.28 – 31
We have determined that intron-
containing sites are strongly clustered around pos-
itions of the rRNA that interact with tRNAs and
mRNA, but these positions are not more accessible
to solvent in the mature ribosome than average.
Although the splicing mechanism of each intron
class is distinct, most types of introns cluster
within the same regions of the rRNA, suggesting
that the movement and retention of introns is
driven by structural features in the rRNA. The
implications of these findings for mechanisms of
intron mobility and retention are discussed.
Results
A total of 1253 rRNA sequences containing
introns were retrieved from GenBank as of
February 2002, representing organisms from all
major phylogenetic groups. The positions of the
introns within the rRNA gene were determined
from alignments of the mature rRNAs, as pre-
viously described,5
and numbered according to
the sequences of the Escherichia coli 16 S and 23 S
rRNAs.32
Sequence entries were sorted according
to the position of the intron in the 16 S or 23 S
rRNA, and the number of rRNA sequences con-
taining an intron at each position was noted (Table
1). The introns were also classified as group I6,33
or
group II7
families, tRNA-like archaeal introns,26
or
nuclear spliceosomal introns,9,34
based on their
conformity to consensus secondary structures
and conserved sequences. Group I and group II
introns were further subdivided into structural
families.6,7,33
The database of rRNA introns is continually
updated as new rRNA sequences are deposited in
GenBank.5
Hence, the precise statistics reported
here are expected to change as the number of
available intron-containing sequences increases.
However, the number of intron-containing pos-
itions in the rRNA is increasing more slowly than
Table 1 Continued
Intron typea
Cell locationb
Positionc
No. intronsd
I II A S U C M N
2004 1 – – – – 1 – – 1
2059 4 – 4 – – – – 4 –
2066 6 6 – – – – – – 6
2067 1 1 – – – – – 1 –
2256 1 1 – – – – – 1 –
2262 5 5 – – – – 5 – –
2449 48 48 – – – – 7 30 11
2451 1 – 1 – – – – 1 –
2455 1 – 1 – – – – 1 –
2499 1 1 – – – – – 1 –
2500 32 32 – – – – 10 22 –
2504 5 5 – – – – – 5 –
2509 2 – 2 – – – – 2 –
2552 1 – – 1 – – – – 1
2563 7 7 – – – – – – 7
2585 2 2 – – – – – 2 –
2593 16 16 – – – – 13 3 –
2596 3 3 – – – – 3 – –
2601 2 – – 2 – – – – 2
2610 1 – 1 – – – – 1 –
Total 335 269 13 21 29 3 76 105 154
a
Intron types are classified as I, group I; II, group II; A, archaeal; S, spliceosomal; U, unknown.
b
Cell compartment of rRNA gene: C, chloroplast; M, mitochondrion; N, nucleus. Introns in archaea and bacteria are defined as
belonging to the nucleus. The only known rRNA intron in a bacterial genome occurs at position 1931 in the 23 S rRNA.
c
Position of nucleotide immediately prior to intron in Escherichia coli reference sequence. TBD, to be determined. These intron
sequences were published without flanking exon sequence and their insertion sequences could not be determined.
d
Number of intron sequences presently known at that position.
38 rRNA Introns in 3D
the total number of intron sequences. Conse-
quently, the general trends in the data are likely to
remain the same.
Non-random distribution of introns in
rRNA genes
Within 16 S (and 16 S-like) rRNA genes, 920
intron sequences are distributed over 84 sites,
among 1542 E. coli positions in the mature rRNA.
Among 23 S (and 23 S-like) rRNA genes, 325
introns are found at 69 sites, out of 2904 E. coli pos-
itions. Although 23 S rRNA is twice the length of
the 16 S rRNA, threefold more introns have been
documented in 16 S rRNA genes. This apparent
bias is largely due to unequal sampling of rRNA
sequences. As of February 2002, the CRW database
contains 7527 complete 16 S rRNA sequences and
960 complete 23 S rRNA sequences. When normal-
ized against the number of reported rRNA
sequences, the 16 S and 23 S rRNAs have approxi-
mately the same number of introns per rRNA
nucleotide, with ,1024
introns per sequence per
nucleotide in each gene.
Comparison of the data with a Poisson distri-
bution based on the mean frequency of rRNA
introns shows that many fewer sites contain
introns than would be expected if introns were
inserted randomly within rRNA genes (Figure 1).
A few sites in the rRNA account for a large fraction
of the intron sequences in the CRW database
(Table 1). These include positions 516, 943, 1506
and 1516 in the 16 S rRNA, with more than 100
known introns each, and positions 798, 2449, and
2500 in the 23 S rRNA, which each have more
than 25 intron sequences. This skewed distribution
persists even when sequence entries from closely
related species or strains are counted only once.
Hence, the presence of many intron sequences at a
few sites in the rRNA is not due to biased
sampling of rRNA sequences.
The tendency of introns to occur in particular
positions of the rRNA can be explained by two
mechanisms; (1) vertical inheritance of introns
from ancestral genes, or (2) preferential horizontal
transfer of intron sequences to certain target sites.
The presence of similar introns at the same
position in the rRNAs of related organisms is
usually attributed to stable inheritance of the
intron within the lineage. Examples of this inheri-
tance include 54 group IE introns that occur at
position 1199 of Ascomycota nuclear 16 S rRNAs35
and 128 group I introns at position 1516 in the
16 S rRNA genes of Lecanorineae (Table 1).
In contrast, different subclasses of introns at the
same rRNA position, or closely related introns
in unrelated organisms, are best explained by
horizontal transfer of the intron sequences. For
example, group IC1 introns with a distinctive
structural motif are inserted after position 516 in
the 16 S rRNA of red and brown algae but are not
found in intermediate relatives on the phylogenetic
tree.19
This suggests that the introns were acquired
independently after the red and brown algae
diverged. Introns have probably been inserted
(and lost) at position 516 many times during
evolution,14
as introns from two structural classes
(IC1 and IE) have been identified at this position
in four phylogenetic groups.5
In the 23 S
rRNA, nine group I introns corresponding to three
different subclasses were found at position 1931
in the mitochondrial, chloroplast and nuclear
rDNA of diverse organisms.16,17,36
Thus, the
phylogenetic data suggest that introns are repeat-
edly inserted at the same positions in rRNA
genes. Therefore, the prevalence of introns at
certain positions in rRNA genes is due not only
to inheritance of a few stable introns but also to
site-selective transposition and retention of introns.
Intron positions at the functional heart of
the ribosome
To determine whether the location of intron sites
depends on the higher-order structure of the
Figure 1. Distribution of introns in rRNAs. Histogram
of the number of positions in the (a) 16 S rRNA or (b)
23 S rRNA with a given number of intron insertions.
Positions are numbered according to the sequence of the
E. coli rRNAs. Shaded bars indicate the actual number
of intron-containing positions (Table 1); lines indicate
the expected number of intron-containing positions
expected from a Poisson distribution, based on the mean
frequency of introns in rRNA sequences. Insets show an
expansion of the y-axis.
rRNA Introns in 3D 39
rRNA, intron-containing positions were mapped
onto secondary structures of the 16 S and 23 S
rRNAs determined from comparative sequence
analysis (Figures 2(a) and 3(a)).3,5,37
The intron
positions were also visualized on three-dimen-
sional structures of the 30 S and 50 S ribosomal
subunits recently determined by X-ray crystal-
lography28,29
(Figures 2(b) and 3(b) and Further
information; see Methods.
Introns are found in every domain of the 16 S
rRNAs. A large number of intron-containing sites
lie near highly conserved regions that function in
decoding and translocation, such as the 530 loop,
the “switch helix” (nucleotides (nt) 885–912), and
the 30
-terminal helix (nt 1506–1529). Other introns
are found near residues that form the ribosome
subunit interface, such as four intron insertions
within the 790 loop (Figure 2(a)). When mapped
onto the structure of the Thermus thermophilus
30 S ribosome29
(Figure 2(b)), intron-containing
positions are concentrated towards the center of
the subunit, as suggested by their locations in the
secondary structure. A third of intron positions
lie on the face of the 30 S particle that contacts the
50 S subunit. Strikingly, positions with large
numbers of introns (516, 788, 943, 1199, 1506 and
Figure 2. Location of introns in
16 S rRNAs. (a) Secondary structure
of 16 S rRNA, showing nucleotides
that are present in more than 95%
of known sequences.5,37
Red dot,
$98% conserved; gray dot, 90–
98% conserved; open circle, ,90%
conserved. Positions for which an
intron is documented are numbered
according to the sequence of the
E. coli 16 S rRNA and colored
according to intron type: green,
group I; red, group II; cyan,
archaeal; blue, spliceosomal;
orange, other; black, multiple intron
types. Nucleotides that are within
10 A˚ of mRNA or tRNA bound
at the A, P or E sites42
are shaded
yellow. Brackets indicate positions
that are not conserved among all
rRNA sequences. The asterisk (p)
indicates an insertion relative to
the E. coli sequence. (b) Three-
dimensional structure of the 30 S
ribosomal subunit (1FJF),29
with
intron-containing positions colored
according to intron type as in (a).
Large spheres represent positions
with more than 100 known intron
sequences. The 16 S rRNA is
shown as light gray ribbon; proteins
as pink ribbons.
40 rRNA Introns in 3D
1516) all lie near functionally important residues at
the subunit interface.
Among 23 S rRNA genes, introns are also found
near functionally important positions in domains
II, IV and V of the 23 S rRNA (Figure 3(a)).
In domain IV of the 23 S rRNA, many introns are
found in sequences that form the interface with
the 30 S subunit, with a large number of insertions
near helix 69 (nt 1905–1920) at the center of the
subunit (Figure 3). This stem–loop interacts with
tRNAs bound in the A and P sites, and makes
contacts with 16 S rRNA residues in the decoding
site of the ribosome.38,39
In three-dimensional
views of the Haloarcula marismortui 50 S ribosome,28
the intron-containing positions in domain IV can
be seen running across the central ridge and
down the face that contacts the 30 S subunit
(Figure 3(b)).
Many introns are inserted adjacent to highly
conserved nucleotides in domain V of the 23 S
rRNA that form the peptidyl transferase center of
the ribosome (Figure 3(b)). These nucleotides line
both sides of the peptidyl transferase cleft (Figure
3(b)), placing the sites of intron insertions in close
proximity to the acceptor stems of bound tRNAs.
Among nucleotides deep in the peptidyl transfer-
ase cleft are several positions that frequently
precede intron sites, such as nt 2449 and 2500
(Table 1).
A smaller number of introns are also found
throughout domain II of 23 S rRNA genes (Figure
3). Several introns are found in the L11-binding
region (nt 1050–1100), which interacts with
elongation factors in the 70 S ribosome.40,41
Surpris-
ingly, no introns were found at rRNA positions
that face the solvent side of the 50 S subunit or in
the bottom surface created by domains III and VI
of the 23 S rRNA. Thus, intron-containing positions
in the 23 S rRNA are concentrated in the center of
the 50 S particle and on the surface that contacts
the 30 S subunit.
Three-dimensional clusters of intron sites
To quantify the extent to which intron insertion
sites are clustered within the three-dimensional
structure of the 70 S ribosome, we computed a
center of mass from the coordinates of the 30
phos-
phate group of each intron insertion site. The
standard deviations of the x, y and z coordinates
were summed to create a “cluster score,” with a
lower number representing a tighter group (see
Methods). As illustrated in Figure 4(a), nucleotides
that precede introns are closer to each other than
the same number of randomly selected phosphate
groups in the rRNA. This is particularly true of
positions that contain large numbers of introns.
Hence, intron-containing positions are more tightly
clustered in space when each coordinate is
weighted by the number of rRNA genes containing
an intron at that position (Figure 4(a)).
Proximity to conserved rRNA nucleotides
To analyze the extent to which introns are
present at conserved rRNA sites, the conservation
of nucleotides 50
and 30
of each intron insertion
site was determined. The level of conservation
was extracted from alignments of rRNA sequences
representing organisms from the three primary
phylogenetic domains, plus chloroplasts and
mitochondria,5
and is available at the CRW Site
(see Methods). Nucleotides that are more than
98% identical (category A) or 90–98% identical
(category B) are highlighted in red and gray in the
secondary structure diagrams in Figures 2(a) and
3(a), and in space-filling representations of the
ribosomal subunits in Figure 5.
Only 12% and 5% of all 16 S and 23 S rRNA
nucleotides, respectively, are conserved in more
than 98% of all known rRNA sequences (category
A). In contrast, 21% of 16 S and 25% of 23 S rRNA
nucleotides that precede an intron-insertion site
are conserved at this level (Table 2(A)). We observe
a similar bias in the number of category B nucleo-
tides (90–98% identical) preceding intron positions
(Table 2(A)). Alternatively, we counted the number
of intron-containing positions that occur near a
category A nucleotide (Table 2(B)). About a third
(36%) of 16 S rRNA intron positions were adjacent
to a category A nucleotide, compared with 18% of
all 16 S rRNA nucleotides. In the 23 S rRNA, 32%
of intron-containing positions were adjacent to a
category A nucleotide, compared with 8% overall.
Positions with many introns (16 S positions 516,
788, 943, 1199, 1506, 1512, 1516; 23 S positions 798,
1921, 2449, 2500) are even more likely to be next to
highly conserved nucleotides than positions that
contain only one or a few known introns. This
bias is quite strong, particularly among 16 S rRNA
sequences. Thus, introns are much more likely to
lie within conserved regions of the rRNA than
expected if they were randomly inserted in rRNA
genes.
Although introns tend to occur in conserved
regions of the rRNA, not all highly conserved
regions contain known introns. The most con-
served regions of the 16 S and 23 S rRNAs that do
not contain any known introns as of February
2002 are the 250 region in domain I and the alpha-
sarcin/ricin loop in domain VI (2650–2670) of the
23 S rRNA, and the 1400 region of the 16 S rRNA
that forms part of the mRNA decoding center.
In fact, the lack of introns in domain VI of the
23 S rRNA is striking, given its conservation and
functional importance.
Proximity to tRNA binding sites
Most introns tend to be inserted near residues
that form tRNA binding sites (Figures 2 and 3).
This trend was evaluated by computing the frac-
tion of intron sites within a certain distance of the
A, P or E sites of the 70 S ribosome (see Methods).
As shown in Figure 6, intron-containing positions
rRNA Introns in 3D 41
Figure 3 (legend opposite)
42 rRNA Introns in 3D
are closer in space to bound tRNAs than average
positions in the 16 S or 23 S rRNAs. Among 153
positions that contain introns, 26 are within 10 A˚
of a tRNA site and include rRNA nucleotides that
make direct interactions with bound tRNAs.31
Roughly 60% of introns lie within 20 A˚ of the A, P
or E sites, and 90% are within 40 A˚ . In contrast,
the mean distance of all rRNA residues from the
tRNA sites is 50 A˚ . Nearly all of the rRNA sites
that contain a large number of introns are within
20 A˚ of a bound tRNA. These include positions
516, 943 and 1506 in the 16 S rRNA and 2449 and
2500 in the 23 S rRNA. Position 1506 in the 16 S
rRNA is even closer (10 A˚ ) to the mRNA channel.42
The proximity of intron-containing positions
to tRNA binding sites explains the clustering of
introns within the three-dimensional structure of
the 70 S ribosome and is congruent with their
proximity to highly conserved rRNA residues.
A corollary of the association between introns
and functional sites in the ribosome is that intron-
containing positions occur in RNA-rich areas of
the ribosome. Qualitative examination of the 30 S
ribosome structure showed that very few intron-
containing positions interact directly with riboso-
mal proteins. Almost no intron-containing
positions in the 23 S rRNA contact proteins, as
most proteins bind the solvent face of the large
subunit,28
which is devoid of introns.
Comparison of intron types
Introns belonging to different classes (group I, II,
archaeal and spliceosomal) are rarely inserted at
precisely the same position within the rRNA.5
Two exceptions are position 788 in the 16 S rRNA,
which contains group I and group II introns
(Figure 2(a) and Table 1A), and position 787 in the
Figure 4. Three-dimensional
clustering of rRNA intron sites
within the 70 S ribosome. Cluster
scores were computed as described
in Methods; a lower score indicates
a narrower distribution of x, y and
z coordinates. (a) Comparison of
rRNA positions in the 16 S and
23 S rRNAs (70 S), 23 S rRNA only
(50 S), or 16 S rRNA only (30 S).
Dark gray, all rRNA positions; light
gray, intron-containing positions;
white, intron-containing sequences;
hatched, P-site tRNA. (b) Compari-
son of intron types. Dark gray, 70 S
ribosome; light gray, 50 S ribosome;
white, 30 S subunit. All positions
corresponds to all rRNA residues;
reverse splicing refers to positions
that were experimentally found to
react with the Tetrahymena group I
intron.47,48
Figure 3. Location of introns in 23 S rRNAs. (a) Secondary structure of 23 S rRNA, showing nucleotides that are
present in more than 95% of known sequences.3,5
Nucleotides are represented as in Figure 2(a). Positions that contain
a documented intron are numbered according to the sequence of the E. coli 23 S rRNA and colored according to intron
type as in Figure 2(a). (b) Three-dimensional structure of the 50 S ribosome (1JJ2),76
colored as in Figure 2(b). Spheres
represent intron-containing positions; large spheres indicate positions with more than 29 known intron sequences.
rRNA Introns in 3D 43
23 S rRNA, which contains group II and
spliceosomal introns (Figure 3(b) and Table 1B). A
few sites contain both group I and unclassified
introns (16 S rRNA positions 516 and 943; 23 S
positions 1923 and 1974). As each intron type has
a different splicing mechanism, distinct insertion
sites may reflect different constraints on splice-site
recognition. For example, the 50
splice site of
group I introns is nearly always preceded by a
U,43,44
whereas group II and archaeal exon bound-
aries are variable in sequence.26,45
Although their precise insertion sites do not
overlap, most introns are inserted near rRNA resi-
dues that form the center of the 70 S ribosome
(Figures 2(b) and 3(b)). Nucleotides near the center
of the ribosome tend to be conserved, and group I,
group II and archaeal introns are particularly likely
to be inserted at locations within three positions of
a category A nucleotide (Table 2B). These data
suggest that the evolutionary forces that shape the
distribution of group I, group II and archaeal
introns within rRNA genes originate from the
sequence and structure of the rRNA itself rather
than the type of intron.
In contrast, spliceosomal introns are less
clustered with respect to the three-dimensional
structure of the ribosome than group I and group
II introns (Figure 4(b)). Moreover, they are less
likely to be near a highly conserved nucleotide
than other types of introns (Table 2B), and more
likely to be found in residues far from tRNA
binding sites. For example, only one spliceosomal
intron is found in 23 S rRNA domain IV (1849),
although this region contains many group I and
group II intron insertions (Figure 3(a)). As spliceo-
somal introns do not have a conserved secondary
structure, their splicing may be less sensitive to
the structure of the rRNA exons than group I,
group II and archaeal introns, which must fold
into a precise structure in order to be spliced.
Solvent accessibility of intron positions
Introns may preferentially exist near rRNA
residues that interact with tRNAs, because these
residues are on the surface of the ribosome. If
introns are inserted into the rRNA by reverse
splicing, we would expect them to integrate more
frequently into accessible positions of the
rRNA.23,46
Alternatively, introns may be retained
more frequently at exposed positions where they
might not interfere with ribosome assembly.
To address this question, the solvent-accessible
surface area (ASA) of each atom was computed
from the coordinates for the T. thermophilus
30 S ribosome and the 50 S subunit from
H. marismortui, as described in Methods.
The 30
oxygen atom was used as a proxy for
intron insertion sites, although similar results
were obtained with other backbone atoms. As
expected, many 30
oxygen atoms are fully buried
(ASA ¼ 0 A˚ 2
), while others are at least partially
exposed to solvent (ASA . 4 A˚ 2
) (Figure 7). The
latter correspond to residues that are on the surface
Figure 5. Conservation of rRNA. Intron positions (yellow spheres) are indicated on a space-filling model of the ribo-
some. Red, rRNA nucleotides that are .98% conserved among all sequences (category A); pink, nucleotides that are
90–98% conserved (category B).5
Atoms are rendered at twice the van der Waal’s radius; intron positions are rendered
four times larger. (a) 30 S (front view). (b) 50 S (crown view). The figure was prepared with Insight II.
44 rRNA Introns in 3D
of the ribosomal subunit. Contrary to our expec-
tations, 30
oxygen atoms preceding intron-contain-
ing sites were exposed to solvent in the same
proportion at all rRNA positions (Figure 7). From
these data, we conclude that intron-containing
positions are not systematically biased towards
rRNA positions that are solvent-accessible in
mature subunits.
Visual inspection of the three-dimensional struc-
tures of the 30 S and 50 S subunits (Figures 2 and
3) and the data in Figure 7 show that introns are
inserted in both accessible and buried positions.
Some regions of the 23 S rRNA that contain many
introns are highly exposed to solvent in the 50 S
ribosome, such as 1080 and 1915. Others, such as
position 2449, have an average accessibility, while
a few, such as 2500, are buried deeply within the
50 S subunit. The 30 S subunit is quite flat, so that
even residues that are buried in the static structure
of the ribosome are not far from the surface of the
Table 2. Conservation of rRNA near intron sites
16 S 23 S
nt %Total nt %Total
A. Conservation of rRNA residues prior to intron insertion sitesa
All rRNA residues
.98% 178 11.5 150 5.2
90–98% 175 11.3 203 7.0
80–90% 116 7.5 168 5.8
Total 1542 100.0 2904 100.0
nt prior to:
All introns
.98% 18 21.4 17 24.7
90–98% 14 16.7 12 17.4
80–90% 12 14.3 5 7.2
Total 84 100.0 69 100.0
Group I
.98% 11 28.2 10 24.4
90–98% 4 10.3 4 9.8
80–90% 4 10.3 4 9.8
Total 39 100.0 41 100.0
Group II
.98% 1 33.3 3 37.5
90–98% 1 33.3 2 25.0
80–90% 0 0 0 0
Total 3 100.0 8 100.0
Archaeal
.98% 4 28.6 3 50.0
90–98% 7 50.0 2 33.3
80–90% 1 7.1 0 0
Total 14 100.0 6 100.0
Spliceosomal
.98% 0 0 1 7.1
90–98% 2 9.1 4 28.6
80–90% 7 31.8 1 7.1
Total 22 100.0 14 100.0
B. Number of intron-containing positions adjacent to highly conserved
(.98%) rRNA residues
Distance to nearest conserved rRNA position:
All rRNA residues
Adjacent to at least one 274 17.8 240 8.3
Within 2 nt 351 22.8 308 10.6
Within 3 nt 417 27.1 359 12.4
All introns
Adjacent to at least one 30 35.7 22 31.9
Within two nt 41 48.8 33 47.8
Within three nt 50 59.5 40 57.9
Flanked by two 6 7.1 9 13.0
Group I
Adjacent to at least one 19 48.7 12 29.3
Within 2 nt 24 61.5 19 46.4
Within 3 nt 29 74.4 22 53.7
Flanked by two 4 10.3 4 9.8
Group II
Adjacent to at least one 1 33.3 4 50.0
Within 2 nt 3 100.0 6 75.0
Within 3 nt 3 100.0 7 87.5
Flanked by two 0 0 2 25.0
Archaeal
Adjacent to at least one 7 50.0 4 66.7
Within 2 nt 8 57.1 4 66.7
Within 3 nt 8 57.1 4 66.7
Flanked by two 1 7.1 3 50.0
(continued)
Table 2 Continued
16 S 23 S
nt %Total nt %Total
Spliceosomal
Adjacent to at least one 1 4.5 2 14.3
Within 2 nt 4 18.2 4 28.6
Within 3 nt 7 31.8 7 50.0
Flanked by two 0 0 0 0
a
Conservation levels were calculated based on a large
sample of archaeal, bacterial, and eukaryotic nuclear, mito-
chondrial and chloroplast rRNA sequences.5
The three conserva-
tion ranges (.98%, 90–98% and 80–90%) were determined for
positions that are present in more than 95% of all sequences.
Figure 6. Proxmity of intron sites to tRNA bound in
the 70 S ribosome. The total number of rRNA positions
within a given cutoff distance of tRNA bound in the A,
P or E site of the 70 S ribosome was calculated as
described in Methods. Intron-containing positions were
weighted by the number of rRNA sequences with an
intron at that position. Continuous line, all 16 S rRNA
positions; long dash, 16 S introns; medium dash, 23 S
introns; short dash, all 23 S rRNA positions.
rRNA Introns in 3D 45
particle. All the same, introns are found in 16 S
rRNA positions that are among the least accessible
in the 30 S crystal structure.
The similarity between the ASA values for all
rRNA positions and those containing introns
remained constant when spheres of radius 1.4 A˚
and 5 A˚ were used to compute the ASA. We also
observed no difference between the accessibilities
of intron insertion sites and average rRNA nucleo-
tides in the absence of ribosomal proteins,
although the mean accessibility of all residues
increased. Taken together, these data suggest that
intron insertion sites are not exclusively found in
residues that are highly exposed in the static
structures of the 30 S and 50 S ribosomes. It is pos-
sible, however, that many intron-containing
positions are exposed by conformational changes
during translation or during assembly of ribosomal
particles.
Reverse splicing target sites
Previous work showed that the Tetrahymena
group I intron can integrate partially or fully into
many sites in the 23 S rRNA by reverse splicing
when overexpressed in E. coli cells.47,48
An interest-
ing question is whether experimentally observed
reverse splicing targets are clustered in the same
rRNA regions as naturally occurring introns. In
contrast to the sites of natural introns, sites that
react with the Tetrahymena intron are not only
found on the subunit interface but also lie on the
“back” face of the 50 S ribosome, away from the
30 S subunit (Figure 8). The reaction sites are less
tightly clustered with each other than the positions
of natural introns (Figure 4) and not localized
around the tRNA binding sites. They were also
slightly less solvent-accessible than average,
suggesting that accessibility is not a prerequisite
for insertion of introns by reverse splicing. The
difference in the spatial distribution of naturally
occurring introns and reverse splicing targets
suggests that other factors determine the positions
of rRNA introns, such as selective retention after
intron insertion or protein-dependent transposition
mechanisms.
Discussion
The extensive rRNA intron sequence collection
available at the Comparative RNA Web (CRW)
Site5
is the foundation of our investigation of the
distribution of introns within the 16 S and 23 S
rRNAs. Positions of the rRNA that are presently
known to contain introns were mapped to the
secondary structures of the rRNA determined
from comparative sequence analysis3,5,37
and
the three-dimensional structures of the 30 S, 50 S
and 70 S ribosomes determined by X-ray
crystallography.28 – 31
Our results, which are based on the analysis of
more than 1250 rRNA intron sequences, reveal
that introns are not inserted randomly within the
16 S and 23 S rRNA genes. Instead, they are
clustered near tRNA binding sites, which span the
interface between the small and large subunits of
the 70 S ribosome. Surprisingly, intron-containing
residues are not more solvent-accessible in the
30 S and 50 S subunits than typical rRNA residues.
Thus, while the distribution of introns is coupled
to the sequence and structure of the rRNA, it does
not directly correlate with the accessibility of
the splice junction in static three-dimensional
structures of the ribosome.
Horizontal transfer of intron sequences
Comparative analyses of intron sequences
within specific phylogenetic groups have con-
cluded that introns at the same position of the
rRNA are usually more closely related to one
another than introns at different positions.13,14,35,49–51
Figure 7. Surface accessibility of intron positions in
mature ribosomal subunits. The ASA of 30
oxygen atoms
in the 50 S and 30 S ribosomal subunits was calculated
as described in Methods. Histograms show the percen-
tage of rRNA positions with a given ASA value. Shaded
bars, intron-containing positions; white bars, all rRNA
positions. (a) 16 S rRNA in the 30 S subunit. (b) 23 S
rRNA in the 50 S subunit.
46 rRNA Introns in 3D
This has led to the view that some introns are
retained for long periods of evolution after their
initial insertion into a specific position in the rRNA
gene. On the other hand, introns are periodically
lost from certain lineages.13,14
More rarely, they are
regained or transferred to a new lineage.15 – 19,36
Hence, intron mobility may be required to ensure
survival and the spread of intron sequences to
new organisms.10
Besides ensuring intron retention, frequent
deletion and reinsertion of intron sequences may
result in different phylogenetic trees for rRNA
exons and introns, as observed for the group IC1
intron in the rDNA of Tetrahymena.52
This type of
dynamic flux has been invoked to explain
sequence heterogeneity of non-LTR retrotranspo-
sons in insect rDNA.53
As most insertion events
recur at the same site in the rRNA with only
occasional transposition to a new site,54
the
position of the retroelement in the rRNA is main-
tained, although its sequence may vary. If introns
are continually deleted and reinserted, their final
distribution will depend heavily on the selectivity
of retention and the target-specificity of the inser-
tion mechanism. These factors are discussed below.
Target-site selection during intron transposition
Many group I and group II introns and some
archaeal introns have acquired open reading
frames that encode homing DNA endonucleases.10
Although these endonucleases usually recognize
the same position of an intron-less allele of the
same gene, cross-reaction with related sequences
could enable transfer of the intron to homologous
positions in the rRNA genes of a different organ-
ism or occasional transposition to ectopic sites.55 – 57
Because of the large recognition sites of most
homing endonucleases,58
this process would be
more effective in conserved regions of the rRNA,
contributing to the prevalence of introns at those
sites.
The feasibility of intron transfer was demon-
strated by integration of group I introns from
Physarum polycephalum and Tetrahymena thermophila
into the rDNA of yeast upon co-expression of
the I-PpoI endonuclease.56
Because the rRNA
sequences flanking the intron are nearly identical
in these species, the introns are spliced correctly in
yeast.59
Similarly, reverse transcriptases encoded
by group II introns normally promote movement
of the intron to an allelic site, but in some circum-
stances can transpose the intron to an ectopic
site.60 – 63
Although the substrate specificity of proteins
associated with intron mobility might account for
the prevalence of introns in conserved regions of
the rRNA, this mechanism of intron transposition
is inconsistent with the sequence data. First, the
majority of rRNA introns do not contain open
reading frames that encode endonucleases.5
At
some sites that contain many introns, such as 1506
in the 16 S rRNA, only a few of the extant introns
have open reading frames (ORFs) greater than
500 nt.5
While all introns may have contained
ORFs at one time that were subsequently deleted
by non-homologous recombination, only a few
cases of vestigial ORFs have been documented in
rRNA introns (in Naegleria64
and in the order
Bangiales14
). Second, this mechanism requires that
Figure 8. Locations of reverse splicing targets in E. coli 23 S rRNA. Integration sites of the Tetrahymena group I
intron RNA when overexpressed in E. coli47,48
are indicated by white spheres on the crown (left) and back (right)
views of the 50 S subunit.28
Large spheres indicate frequent reaction sites. The 23 S rRNA is shown as a ribbon colored
by domain: blue, domain I; cyan, domain II; yellow, domain III; green, domain IV; red, domain V; pink, domain VI;
magenta, 5 S rRNA. Proteins are not shown.
rRNA Introns in 3D 47
the endonuclease be properly expressed in the
recipient organism, which could be problematic in
cases of intron transfer between organelles.16
Transposition via reverse splicing
An alternative to protein-dependent mecha-
nisms is that introns integrate into rRNAs by
reverse splicing.46
This could lead to transposition
of the intron to novel sites in the genome, follow-
ing reverse transcription and recombination with
the intron-containing cDNA. An appealing aspect
of this mechanism is that integration of the intron
into the rRNA would be immediately sensitive to
the conformation of the rRNA.24
Another advan-
tage is that group I reverse splicing is much less
site-specific than reactions catalyzed by homing
endonucleases.48
Despite the appeal of RNA-based intron trans-
position, the biochemical support for such a
mechanism is weak. Reverse splicing of the
Tetrahymena group I intron into rRNA in E. coli
was observed by RT-PCR,65
but stable integration
of a group I intron by reverse splicing has not yet
been demonstrated. Group II introns also reverse
splice, but the target is allelic DNA rather than
RNA.66
In addition, we find no correlation between
the frequency of introns and solvent accessibility of
the rRNA insertion site. Thus, if reverse splicing
contributes significantly to the spectrum of introns
found in rRNA genes, then the intron must react
with pre-rRNA that is not fully assembled into a
ribosomal complex or the target rRNA must transi-
ently unfold. Finally, reverse splicing products of
the Tetrahymena intron in the E. coli 23 S rRNA
are distributed differently than natural rRNA
introns with respect to the structure of the 50 S sub-
unit. This suggests that the ability of certain
sequences to serve as reverse splicing substrates is
not sufficient to describe the actual distribution of
rRNA introns.
Retention of introns in dynamic regions of
the rRNA
An alternative possibility is that introns are
inserted randomly within the rRNA genes, but
retained only at certain positions. Many random
integration products are expected to be unstable,
either because of recombination with cDNA or
other rDNA repeats or simply because the intron
is not spliced effectively from its new location,
inactivating the rRNA. When a homing endo-
nuclease is continually expressed, aggressive
homing will permit even deleterious rRNA inser-
tions to be maintained.4
However, once the ORF is
not expressed (as in most rRNA introns), introns
that reduce fitness will be lost. Accurate splicing,
degradation of the excised intron RNA (to prevent
cleavage of the spliced exons)67
and proper folding
of the rRNA are all necessary to maintain host cell
functions.
It is interesting to consider whether introns are
more easily retained in positions near the subunit
interface and tRNA binding sites because these
nucleotides undergo structural changes during
translation.68
Flexible sites should accommodate
introns more easily than rigid sites, because in
many cases the secondary structure of the mature
rRNA must unfold to permit splice site
recognition.24,26
Conversely, the presence of highly
structured introns may either interfere less
severely with ribosome assembly or even enhance
folding of the rRNA.
A striking example of introns in a structurally
dynamic region of the rRNA occurs in domain IV
of the 23 S rRNA. Helix 69 (nt 1905–1920) is dis-
ordered in crystals of the 50 S ribosome, suggesting
that it remains mobile under these conditions.28,30
In the 70 S ribosome, this stem–loop and adjacent
sequences form an important bridge to the decod-
ing site in the 30 S subunit.31
Many group I introns
and a few archaeal introns are found between
positions 1915 and 1932 of the 23 S rRNA (Figure
3(a)). Experiments on the group I intron of
Tetrahymena (position 1925) have shown that the
conformation of the rRNA exons is intimately
linked to self-splicing activity.25,69
Similarly, refold-
ing of helix 69 is required for excision of the
archaeal intron at position 1927 in Staphylothermus
marinus.26
In addition to these examples, many other
intron-containing sites are in mobile regions of the
rRNA. For example, the L11-binding region of
the 23 S rRNA is disordered in crystals of 50 S
ribosomes,28,30
and is also highly mobile.31
Within
this region (nt 1050–1108), five sites contain group
I introns, and one site contains an archaeal intron.
Positions 2449 and 2500 in the 23 S rRNA are part
of the peptidyl transferase center and are thought
to undergo conformational changes during ribo-
some translocation.70
In the 16 S rRNA, many
introns are inserted in the 790 loop and in the 30
terminal helix (positions 1506, 1512 and 1516),
which forms an inter-subunit bridge near the
P-site tRNA.31
Position 516 contains a large number
of group I introns; this residue is part of the con-
served 530 loop that undergoes conformational
changes during mRNA decoding.71,72
Regulatory role for introns?
Finally, pre-rRNA splicing may even be an
additional mechanism for regulating ribosome
activity, providing another link between RNA
processing and translation. As introns are not
universally present among all members of a phylo-
genetic group, their function cannot be essential to
translation. Yet, they may provide useful regu-
latory functions under the growth conditions
encountered by specific organisms. Regulation
could occur during or after subunit assembly.
First, the presence of an intron in the pre-ribosomal
RNA could arrest ribosome assembly, either by
sterically blocking tertiary contacts (e.g. in the
48 rRNA Introns in 3D
interior of the 50 S subunit), or by altering local
secondary structures.24
As introns are rarely found
in ribosomal protein binding sites, unspliced
introns would affect protein binding only
indirectly via structural changes in the rRNA.
Second, introns could be required for modification
of the rRNA in some organisms, as in some
tRNAs.73,74
Third, introns could inhibit the trans-
lational activity of nascent subunits. By analogy to
proenzymes, splicing would be required for acti-
vation. Introns inserted in residues at the inter-
subunit interface, for example, might not prevent
assembly of individual subunits, but would
prevent the subunit from forming productive 70 S
ribosomes.75
In summary, our analysis confirms the statistical
correlation between the positions of rRNA introns
and the structure and conservation of the 16 S and
23 S rRNAs. It is now apparent that this bias results
in the preferential location of introns in conserved
rRNA residues that are clustered around tRNA
binding sites in the three-dimensional structure of
the 70 S ribosome. At present, it is difficult to
rationalize the rRNA intron distribution based
only on the selectivity of known intron insertion
mechanisms. This suggests that the extant
positions are selected by either a need to maintain
efficient splicing or the stringent requirements of
pre-rRNA maturation and ribosome assembly.
Both of these effects may favor introns in dynamic
regions of the rRNA.
Methods
rRNA intron database
The rRNA intron sequences were collected, analyzed,
and assembled into the relational database for the CRW
project, as described in detail.5
Only sequences that are
more than 90% complete and publicly available are
included in the database. As of February 2002, the
database contained 8865 rRNA sequences and 1285
annotated rRNA intron sequences. The positions of
rRNA introns are numbered according to the nucleotide
50
of the intron in the sequence of the E. coli 16 S or 23 S
rRNA (GenBank accession number J01695). Our determi-
nation of the intron types utilized the descriptions of
sequence and secondary structure of group I,6
group
IE,33
group II,45
spliceosomal34
and archaeal introns.26
The spliceosomal introns noted here are defined by
homology to pre-mRNA introns;9
the biochemical
mechanism of splicing has not yet been established.
Intron-containing rRNA positions in three-
dimensional ribosome structures
Structural data were obtained from the Protein Data
Bank (PDB).† Introns in 16 S or 16 S-like rRNA genes
were mapped onto the three-dimensional structure of
the T. thermophilus 30 S ribosome (1FJF).29
Introns in 23 S
or 23 S-like rRNA genes were mapped on the structure
of the H. marismortui 50 S ribosome (1JJ2).76
Three intron
positions (1956, 1958 and 1962) lie in regions of the 23 S
rRNA that are not visible in the electron density map.76
These were denoted with spheres at nearby residues:
1956 shown as 1950, 1958 as 1951, 1962 as 1965. Intron
positions were represented by a CPK rendering of the
30
-hydroxyl oxygen atom of the rRNA residue preceding
the intron insertion site. Molecular graphics analyses
were carried out using Insight II (Molecular Simulations
Inc.). Figures were prepared using Ribbons77
and Insight II.
Spatial distribution of rRNA positions
The spatial distribution of rRNA residues in the
T. thermophilus 70 S ribosome was determined from the
atomic coordinates of phosphorus atoms (1GIY and
1GIX).31
For each group of rRNA residues analyzed, we
computed the center of mass and the standard deviation
of the x, y and z coordinates. The “cluster score” is the
sum of the x, y and z standard deviations of the x, y and
z coordinates. A lower cluster score represents a tighter
three-dimensional clustering of nucleotides. Intron inser-
tions were represented by the 50
phosphate group of the
nucleotide 30
of the intron. To compute clustering of
intron sequences, all sequence entries in the CRW rRNA
intron database were used. To compute the clustering of
intron and rRNA positions, each rRNA position was
counted once, regardless of the number of intron
sequences that correspond to an insertion at that
position. Values for the 30 S and 50 S subunits were
computed using only residues from the 16 S or 23 S
RNAs, respectively.
Distance from tRNA binding sites
Subsets of rRNA positions were defined based on
their distance from the combined A, P and E site tRNAs
in the T. thermophilus 70 S ribosome (1GIY and 1GIX).
Distances were computed from the atomic coordinates
of the phosphorus atoms of the rRNAs and the bound
tRNAs,31
or the bound tRNAs and mRNA (see
website).42
† Distance cut-offs were set between 5 and
100 A˚ ; all rRNA phosphate groups are within 120 A˚ of
the tRNAs. The total number of rRNA residues and the
number of intron-containing sequences within the subset
defined by each distance cutoff are plotted in Figure 6.
These rRNA positions are listed on the website associ-
ated with this work (see below).
Solvent-accessibility
The ASA was computed for each atom in the 50 S
(1JJ2) and 30 S (1FJF) ribosomes, using the program
Calc-surf.78
A sphere of radius 1.40 A˚ was chosen to
simulate solvation by a water molecule. The surface
areas shown in Figure 7 were computed for each entire
subunit with protein side-chains, but without water
molecules and heteroatoms. The ASA values for 30
oxygen atoms were binned and plotted in Figure 7.
Further information available on website
Copies of the manuscript Figures and Tables and
additional Figures and Tables related to this work are
available from the CRW Site‡. The intron sites on the
† http://www.rcsb.org/pdb/
‡ http://www.rna.icmb.utexas.edu/ANALYSIS/
INT3D/
rRNA Introns in 3D 49
16 S and 23 S rRNAs (as of February 2002) and related
information are shown on three sets of secondary
structure diagrams (PostScript and PDF formats) and
images of the three-dimensional crystal structures of the
ribosome.
Five dimensions of information are highlighted in this
presentation: (A) intron types (i.e. group I, group II,
archaeal, spliceosomal, and unknown); (B) the number
of introns per site (for 3D images only); (C) rRNA sites
within 10.0 A˚ of tRNA/mRNA; (D) extent of conserva-
tion of the 16 S and 23 S rRNAs; and (E) structural
domains from the 16 S and 23 S rRNA secondary
structure models are distinguished. The three-dimen-
sional images can be viewed interactively using
RasMol.79
Data tables include the source data for Tables
1 and 2, as well as additional data not discussed here.
Acknowledgements
This work was supported by grants from the
NIH (GM48207 to R.R.G. and GM46866 to S.A.W.),
NSF (MCB-0110252 to R.R.G.), the Welch Foun-
dation (F-1427 to R.R.G.) and The Institute for
Cellular and Molecular Biology at The University
of Texas at Austin (to R.R.G.).
References
1. Turmel, M., Mercier, J. P. & Cote, M. J. (1993). Group I
introns interrupt the chloroplast psaB and psbC and
the mitochondrial rrn L gene in Chlamydomonas.
Nucl. Acids Res. 21, 5242–5250.
2. Gerbi, S. A., Gourse, R. L. & Clark, C. G. (1982).
Conserved regions within ribosomal DNA: location
and some possible functions. In The Cell Nucleus
(Busch, H. & Rothblum, L., eds), vol. 10, pp.
351–386, Academic Press, New York.
3. Noller, H. F., Kop, J., Wheaton, V., Brosius, J., Gutell,
R. R., Kopylov, A. M. et al. (1981). Secondary struc-
ture model for 23 S ribosomal RNA. Nucl. Acids Res.
9, 6167–6189.
4. Johansen, S., Muscarella, D. E. & Vogt, V. M. (1996).
Insertion elements in ribosomal DNA. In Ribosomal
RNA: Structure, Evolution, Processing, and Function in
Protein Biosynthesis (Zimmermann, R. A. & Dahlberg,
A. E., eds), pp. 89–108, CRC Press, Boca Raton, FL.
5. Cannone, J. J., Subramanian, S., Schnare, M. N.,
Collett, J. R., D’Souza, L. M., Du, Y. et al. (2002). The
Comparative RNA Web (CRW) site: an online data-
base of comparative sequence and structure infor-
mation for ribosomal, intron, and other RNAs. BMC
Bioinformatics, 3, 2.
6. Michel, F. & Westhof, E. (1990). Modelling of the
three-dimensional architecture of group I catalytic
introns based on comparative sequence analysis.
J. Mol. Biol. 216, 585–610.
7. Michel, F., Umesono, K. & Ozeki, H. (1989). Com-
parative and functional anatomy of group II catalytic
introns–a review. Gene, 82, 5–30.
8. Lykke-Andersen, J., Aagaard, C., Semionenkov, M. &
Garrett, R. A. (1997). Archaeal introns: splicing,
intercellular mobility and evolution. Trends Biochem.
Sci. 22, 326–331.
9. Cubero, O. F., Bridge, P. D. & Crespo, A. (2000).
Terminal-sequence conservation identifies spliceosomal
introns in ascomycete 18 S RNA genes. Mol. Biol. Evol.
17, 751–756.
10. Belfort, M. & Perlman, P. S. (1995). Mechanisms of
intron mobility. J. Biol. Chem. 270, 30237–30240.
11. Paquin, B., Kathe, S. D., Nierzwicki-Bauer, S. A. &
Shub, D. A. (1997). Origin and evolution of group I
introns in cyanobacterial tRNA genes. J. Bacteriol.
179, 6798–6806.
12. Bhattacharya, D., Surek, B., Rusing, M., Damberger,
S. & Melkonian, M. (1994). Group I introns are
inherited through common ancestry in the nuclear-
encoded rRNA of Zygnematales (Charophyceae).
Proc. Natl Acad. Sci. USA, 91, 9916–9920.
13. De Jonckheere, J. F. (1994). Evidence for the ancestral
origin of group I introns in the SSUrDNA of Naegleria
spp. J. Eukaryot. Microbiol. 41, 457–463.
14. Muller, K. M., Cannone, J. J., Gutell, R. R. & Sheath,
R. G. (2001). A structural and phylogenetic analysis
of the group IC1 introns in the order Bangiales
(Rhodophyta). Mol. Biol. Evol. 18, 1654–1667.
15. Biniszkiewicz, D., Cesnaviciene, E. & Shub, D. A.
(1994). Self-splicing group I intron in cyanobacterial
initiator methionine tRNA: evidence for lateral trans-
fer of introns in bacteria. EMBO J. 13, 4629–4635.
16. Lonergan, K. M. & Gray, M. W. (1994). The ribosomal
RNA gene region in Acanthamoeba castellanii mito-
chondrial DNA. A case of evolutionary transfer of
introns between mitochondria and plastids? J. Mol.
Biol. 239, 476–499.
17. Turmel, M., Cote, V., Otis, C., Mercier, J. P., Gray,
M. W., Lonergan, K. M. & Lemieux, C. (1995).
Evolutionary transfer of ORF-containing group I
introns between different subcellular compartments
(chloroplast and mitochondrion). Mol. Biol. Evol. 12,
533–545.
18. Hibbett, D. S. (1996). Phylogenetic evidence for hori-
zontal transmission of group I introns in the nuclear
ribosomal DNA of mushroom-forming fungi. Mol.
Biol. Evol. 13, 903–917.
19. Bhattacharya, D., Cannone, J. J. & Gutell, R. R. (2001).
Group I intron lateral transfer between red and
brown algal ribosomal RNA. Curr. Genet. 40, 82–90.
20. Dujon, B. (1989). Group I introns as mobile genetic
elements: facts and mechanistic speculations–a
review. Gene, 82, 91–114.
21. Woese, C. R. (1987). Bacterial evolution. Microbiol.
Rev. 51, 221–271.
22. Benson, D. A., Karsch-Mizrachi, I., Lipman, D. J.,
Ostell, J., Rapp, B. A. & Wheeler, D. L. (2002).
GenBank. Nucl. Acids Res. 30, 17–20.
23. Turmel, M., Gutell, R. R., Mercier, J. P., Otis, C. &
Lemieux, C. (1993). Analysis of the chloroplast large
subunit ribosomal RNA gene from 17 Chlamydomonas
taxa. Three internal transcribed spacers and 12 group
I intron insertion sites. J. Mol. Biol. 232, 446–467.
24. Woodson, S. A. & Cech, T. R. (1991). Alternative
secondary structures in the 50
exon affect both for-
ward and reverse self-splicing of the Tetrahymena
intervening sequence RNA. Biochemistry, 30,
2042–2050.
25. Rocheleau, G. A. & Woodson, S. A. (1995). Enhanced
self-splicing of Physarum polycephalum intron 3 by a
second group I intron. RNA, 1, 183–193.
26. Kjems, J. & Garrett, R. A. (1991). Ribosomal RNA
introns in archaea and evidence for RNA confor-
mational changes associated with splicing. Proc. Natl
Acad. Sci. USA, 88, 439–443.
50 rRNA Introns in 3D
27. Kuo, T. C. & Herrin, D. L. (2000). A kinetically
efficient form of the Chlamydomonas self-splicing
ribosomal RNA precursor. Biochem. Biophys. Res.
Commun. 273, 967–971.
28. Ban, N., Nissen, P., Hansen, J., Moore, P. B. & Steitz,
T. A. (2000). The complete atomic structure of the
large ribosomal subunit at 2.4 A˚ resolution. Science,
289, 905–920.
29. Wimberly, B. T., Brodersen, D. E., Clemons, W. M., Jr,
Morgan-Warren, R. J., Carter, A. P. et al. (2000).
Structure of the 30 S ribosomal subunit. Nature, 407,
327–339.
30. Schluenzen, F., Tocilj, A., Zarivach, R., Harms, J.,
Gluehmann, M., Janell, D. et al. (2000). Structure of
functionally activated small ribosomal subunit at
3.3 A˚ resolution. Cell, 102, 615–623.
31. Yusupov, M. M., Yusupova, G. Z., Baucom, A.,
Lieberman, K., Earnest, T. N., Cate, J. H. & Noller,
H. F. (2001). Crystal structure of the ribosome at
5.5 A resolution. Science, 292, 883–896.
32. Brosius, J., Dull, T. J., Sleeter, D. D. & Noller, H. F.
(1981). Gene organization and primary structure of
a ribosomal RNA operon from Escherichia coli. J. Mol.
Biol. 148, 107–127.
33. Suh, S. O., Jones, K. G. & Blackwell, M. (1999).
A group I intron in the nuclear small subunit rRNA
gene of Cryptendoxyla hypophloia, an ascomycetous
fungus: evidence for a new major class of group I
introns. J. Mol. Evol. 48, 493–500.
34. Bhattacharya, D., Lutzoni, F., Reeb, V., Simon, D.,
Nason, J. & Fernandez, F. (2000). Widespread occur-
rence of spliceosomal introns in the rDNA genes of
ascomycetes. Mol. Biol. Evol. 17, 1971–1984.
35. Nikoh, N. & Fukatsu, T. (2001). Evolutionary
dynamics of multiple group I introns in nuclear ribo-
somal RNA genes of endoparasitic fungi of the
genus Cordyceps. Mol. Biol. Evol. 18, 1631–1642.
36. Everett, K. D., Kahane, S., Bush, R. M. & Friedman,
M. G. (1999). An unspliced group I intron in 23 S
rRNA links Chlamydiales, chloroplasts, and mito-
chondria. J. Bacteriol. 181, 4734–4740.
37. Woese, C. R., Magrum, L. J., Gupta, R., Siegel, R. B.,
Stahl, D. A., Kop, J. et al. (1980). Secondary structure
model for bacterial 16 S ribosomal RNA: phylo-
genetic, enzymatic and chemical evidence. Nucl.
Acids Res. 8, 2275–2293.
38. Joseph, S., Weiser, B. & Noller, H. F. (1997). Mapping
the inside of the ribosome with an RNA helical
ruler. Science, 278, 1093–1098.
39. Cate, J. H., Yusupov, M. M., Yusupova, G. Z., Earnest,
T. N. & Noller, H. F. (1999). X-ray crystal structures of
70 S ribosome functional complexes. Science, 285,
2095–2104.
40. Skold, S. E. (1983). Chemical crosslinking of
elongation factor G to the 23 S RNA in 70 S ribo-
somes from Escherichia coli. Nucl. Acids Res. 11,
4923–4932.
41. Moazed, D., Robertson, J. M. & Noller, H. F. (1988).
Interaction of elongation factors EF-G and EF-Tu
with a conserved loop in 23 S RNA. Nature, 334,
362–364.
42. Yusupova, G. Z., Yusupov, M. M., Cate, J. H. &
Noller, H. F. (2001). The path of messenger RNA
through the ribosome. Cell, 106, 233–241.
43. Davies, R. W., Waring, R. B., Ray, J. A., Brown, T. A.
& Scazzocchio, C. (1982). Making ends meet: a
model for RNA splicing in fungal mitochondria.
Nature, 300, 719–724.
44. Michel, F., Jacquier, A. & Dujon, B. (1982). Compari-
son of fungal mitochondrial introns reveals extensive
homologies in RNA secondary structure. Biochimie,
64, 867–881.
45. Michel, F. & Ferat, J. L. (1995). Structure and activi-
ties of group II introns. Annu. Rev. Biochem. 64,
435–461.
46. Woodson, S. A. & Cech, T. R. (1989). Reverse self-
splicing of the Tetrahymena group I intron: impli-
cation for the directionality of splicing and for intron
transposition. Cell, 57, 335–345.
47. Roman, J. & Woodson, S. A. (1998). Integration of the
Tetrahymena group I intron into bacterial rRNA by
reverse splicing in vivo. Proc. Natl Acad. Sci. USA,
95, 2134–2139.
48. Roman, J., Rubin, M. N. & Woodson, S. A. (1999).
Sequence specificity of in vivo reverse splicing of
the Tetrahymena group I intron. RNA, 5, 1–13.
49. Bhattacharya, D., Friedl, T. & Damberger, S. (1996).
Nuclear-encoded rDNA group I introns: origin and
phylogenetic relationships of insertion site lineages
in the green algae. Mol. Biol. Evol. 13, 978–989.
50. DePriest, P. T. (1993). Small subunit rDNA variation
in a population of lichen fungi due to optional
group-I introns. Gene, 134, 67–74.
51. Schroeder-Diedrich, J. M., Fuerst, P. A. & Byers, T. J.
(1998). Group-I introns with unusual sequences
occur at three sites in nuclear 18 S rRNA genes of
Acanthamoeba lenticulata. Curr. Genet. 34, 71–78.
52. Sogin, M. L., Ingold, A., Karlok, M., Nielsen, H. &
Engberg, J. (1986). Phylogenetic evidence for the
acquisition of ribosomal RNA introns subsequent to
the divergence of some of the major Tetrahymena
groups. EMBO J. 5, 3625–3630.
53. Perez-Gonzalez, C. E. & Eickbush, T. H. (2001).
Dynamics of R1 and R2 elements in the rDNA locus
of Drosophila simulans. Genetics, 158, 1557–1567.
54. Munoz, E., Villadas, P. J. & Toro, N. (2001). Ectopic
transposition of a group II intron in natural bacterial
populations. Mol. Microbiol. 41, 645–652.
55. Bryk, M., Belisle, M., Mueller, J. E. & Belfort, M.
(1995). Selection of a remote cleavage site by I-tevI,
the td intron-encoded endonuclease. J. Mol. Biol.
247, 197–210.
56. Muscarella, D. E. & Vogt, V. M. (1993). A mobile
group I intron from Physarum polycephalum can insert
itself and induce point mutations in the nuclear ribo-
somal DNA of Saccharomyces cerevisiae. Mol. Cell. Biol.
13, 1023–1033.
57. Edgell, D. R., Belfort, M. & Shub, D. A. (2000).
Barriers to intron promiscuity in bacteria. J. Bacteriol.
182, 5281–5289.
58. Belfort, M. & Roberts, R. J. (1997). Homing endo-
nucleases: keeping the house in order. Nucl. Acids
Res. 25, 3379–3388.
59. Lin, J. & Vogt, V. M. (1998). I-PpoI, the endonuclease
encoded by the group I intron PpLSU3, is expressed
from an RNA polymerase I transcript. Mol. Cell.
Biol. 18, 5809–5817.
60. Zimmerly, S., Guo, H., Perlman, P. S. & Lambowitz,
A. M. (1995). Group II intron mobility occurs by tar-
get DNA-primed reverse transcription. Cell, 82,
545–554.
61. Eskes, R., Yang, J., Lambowitz, A. M. & Perlman, P. S.
(1997). Mobility of yeast mitochondrial group II
introns: engineering a new site specificity and retro-
homing via full reverse splicing. Cell, 88, 865–874.
rRNA Introns in 3D 51
62. Cousineau, B., Lawrence, S., Smith, D. & Belfort, M.
(2000). Retrotransposition of a bacterial group II
intron. Nature, 404, 1018–1021.
63. Guo, H., Karberg, M., Long, M., Jones, J. P., 3rd,
Sullenger, B. & Lambowitz, A. M. (2000). Group II
introns designed to insert into therapeutically rele-
vant DNA target sites in human cells. Science, 289,
452–457.
64. Vader, A., Naess, J., Haugli, K., Haugli, F. &
Johansen, S. (1994). Nucleolar introns from Physarum
flavicomum contain insertion elements that may
explain how mobile group I introns gained their
open reading frames. Nucl. Acids Res. 22, 4553–4559.
65. Roman, J. & Woodson, S. A. (1998). Integration of the
Tetrahymena group I intron into bacterial rRNA by
reverse splicing in vivo. Proc. Natl Acad. Sci. USA,
95, 2134–2139.
66. Yang, J., Zimmerly, S., Perlman, P. S. & Lambowitz,
A. M. (1996). Efficient integration of an intron RNA
into double-stranded DNA by reverse splicing.
Nature, 381, 332–335.
67. Margossian, S. P., Li, H., Zassenhaus, H. P. & Butow,
R. A. (1996). The DExH box protein Suv3p is a
component of a yeast mitochondrial 30
- to 50
-exoribo-
nuclease that suppresses group I intron toxicity. Cell,
84, 199–209.
68. Wilson, K. S. & Noller, H. F. (1998). Molecular
movement inside the translational engine. Cell, 92,
337–349.
69. Woodson, S. A. & Emerick, V. L. (1993). An alterna-
tive helix in the 26 S rRNA promotes excision and
integration of the Tetrahymena intervening sequence.
Mol. Cell. Biol. 13, 1137–1145.
70. Polacek, N., Patzke, S., Nierhaus, K. H. & Barta, A.
(2000). Periodic conformational changes in rRNA:
monitoring the dynamics of translating ribosomes.
Mol. Cell, 6, 159–171.
71. Powers, T. & Noller, H. F. (1994). Selective pertur-
bation of G530 of 16 S rRNA by translational miscod-
ing agents and a streptomycin-dependence mutation
in protein S12. J. Mol. Biol. 235, 156–172.
72. Lodmell, J. S. & Dahlberg, A. E. (1997). A confor-
mational switch in Escherichia coli 16 S ribosomal
RNA during decoding of messenger RNA. Science,
277, 1262–1267.
73. Strobel, M. C. & Abelson, J. (1986). Effect of
intron mutations on processing and function of
Saccharomyces cerevisiae SUP53 tRNA in vitro and in
vivo. Mol. Cell. Biol. 6, 2663–2673.
74. Choffat, Y., Suter, B., Behra, R. & Kubli, E. (1988).
Pseudouridine modification in the tRNA(Tyr) anti-
codon is dependent on the presence, but indepen-
dent of the size and sequence, of the intron in
eucaryotic tRNA(Tyr) genes. Mol. Cell. Biol. 8,
3332–3337.
75. Nikolcheva, T. & Woodson, S. A. (1997). Association
of a group I intron with its splice junction in 50 S
ribosomes: implications for intron toxicity. RNA, 3,
1016–1027.
76. Klein, D. J., Schmeing, T. M., Moore, P. B. & Steitz,
T. A. (2001). The kink-turn: a new RNA secondary
structure motif. EMBO J. 20, 4214–4221.
77. Carson, M. (1997). Ribbons. Methods Enzymol. 277B,
493–505.
78. Gerstein, M. (1992). A resolution-sensitive procedure
for comparing protein surfaces and its application to
the comparison of antigen-combining sites. Acta
Crystallog. sect. A, 48, 271–276.
79. Bernstein, H. J. (2000). Recent changes to RasMol,
recombining the variants. Trends Biochem. Sci. 25,
453–455.
Edited by J. Doudna
(Received 30 April 2002; received in revised form 15 August 2002; accepted 19 August 2002)
52 rRNA Introns in 3D

More Related Content

Viewers also liked

Muchas cosas para ti
Muchas cosas para tiMuchas cosas para ti
Muchas cosas para tiBecomecol
 
Vesna vulovic.pptx. horacio stiusso
Vesna vulovic.pptx. horacio stiussoVesna vulovic.pptx. horacio stiusso
Vesna vulovic.pptx. horacio stiussoAntonioCabrala
 
Convergencia multimedia de la información noticiosa y publicitaria de un medi...
Convergencia multimedia de la información noticiosa y publicitaria de un medi...Convergencia multimedia de la información noticiosa y publicitaria de un medi...
Convergencia multimedia de la información noticiosa y publicitaria de un medi...Alban Herlan
 
Inteligência Coletiva em Ambientes Corporativos
Inteligência Coletiva em Ambientes CorporativosInteligência Coletiva em Ambientes Corporativos
Inteligência Coletiva em Ambientes CorporativosElvis Fusco
 
iPhone Programming [2/17] : Introduction to iOS Programming
iPhone Programming [2/17] : Introduction to iOS ProgrammingiPhone Programming [2/17] : Introduction to iOS Programming
iPhone Programming [2/17] : Introduction to iOS ProgrammingIMC Institute
 
Tipos de aprendizaje
Tipos de aprendizajeTipos de aprendizaje
Tipos de aprendizajeDiegolopezkz
 
The Hunger
The  HungerThe  Hunger
The Hungerrlakis
 
Technical Excellence Award
Technical Excellence AwardTechnical Excellence Award
Technical Excellence AwardBhaumik Patel
 
Inteligência Coletiva Empresarial
Inteligência Coletiva EmpresarialInteligência Coletiva Empresarial
Inteligência Coletiva EmpresarialElvis Fusco
 
Rawnet Lightning Talk - Swift iOS Development
Rawnet Lightning Talk -  Swift iOS DevelopmentRawnet Lightning Talk -  Swift iOS Development
Rawnet Lightning Talk - Swift iOS DevelopmentRawnet
 
UVA Unofficial Transcript
UVA Unofficial TranscriptUVA Unofficial Transcript
UVA Unofficial TranscriptJodie Benjamin
 
Scott Transcript
Scott TranscriptScott Transcript
Scott TranscriptScott Allen
 

Viewers also liked (19)

Muchas cosas para ti
Muchas cosas para tiMuchas cosas para ti
Muchas cosas para ti
 
Vesna vulovic.pptx. horacio stiusso
Vesna vulovic.pptx. horacio stiussoVesna vulovic.pptx. horacio stiusso
Vesna vulovic.pptx. horacio stiusso
 
Convergencia multimedia de la información noticiosa y publicitaria de un medi...
Convergencia multimedia de la información noticiosa y publicitaria de un medi...Convergencia multimedia de la información noticiosa y publicitaria de un medi...
Convergencia multimedia de la información noticiosa y publicitaria de un medi...
 
Journal 9
Journal 9Journal 9
Journal 9
 
iFluids_Trainings 2016
iFluids_Trainings 2016iFluids_Trainings 2016
iFluids_Trainings 2016
 
Inteligência Coletiva em Ambientes Corporativos
Inteligência Coletiva em Ambientes CorporativosInteligência Coletiva em Ambientes Corporativos
Inteligência Coletiva em Ambientes Corporativos
 
iPhone Programming [2/17] : Introduction to iOS Programming
iPhone Programming [2/17] : Introduction to iOS ProgrammingiPhone Programming [2/17] : Introduction to iOS Programming
iPhone Programming [2/17] : Introduction to iOS Programming
 
Tipos de aprendizaje
Tipos de aprendizajeTipos de aprendizaje
Tipos de aprendizaje
 
C004
C004C004
C004
 
Open office pantallazos
Open office pantallazosOpen office pantallazos
Open office pantallazos
 
Trabajo tics
Trabajo ticsTrabajo tics
Trabajo tics
 
The Hunger
The  HungerThe  Hunger
The Hunger
 
On The Spot Award
On The Spot AwardOn The Spot Award
On The Spot Award
 
Technical Excellence Award
Technical Excellence AwardTechnical Excellence Award
Technical Excellence Award
 
Next Play Ground
Next Play GroundNext Play Ground
Next Play Ground
 
Inteligência Coletiva Empresarial
Inteligência Coletiva EmpresarialInteligência Coletiva Empresarial
Inteligência Coletiva Empresarial
 
Rawnet Lightning Talk - Swift iOS Development
Rawnet Lightning Talk -  Swift iOS DevelopmentRawnet Lightning Talk -  Swift iOS Development
Rawnet Lightning Talk - Swift iOS Development
 
UVA Unofficial Transcript
UVA Unofficial TranscriptUVA Unofficial Transcript
UVA Unofficial Transcript
 
Scott Transcript
Scott TranscriptScott Transcript
Scott Transcript
 

Similar to Gutell 084.jmb.2002.323.0035

bioinfo_6th_20070720
bioinfo_6th_20070720bioinfo_6th_20070720
bioinfo_6th_20070720sesejun
 
Lecture 9 10 dna-genes_chromosomes
Lecture 9 10 dna-genes_chromosomesLecture 9 10 dna-genes_chromosomes
Lecture 9 10 dna-genes_chromosomesAbo Ali
 
Msa & rooted/unrooted tree
Msa & rooted/unrooted treeMsa & rooted/unrooted tree
Msa & rooted/unrooted treeSamiul Ehsan
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 

Similar to Gutell 084.jmb.2002.323.0035 (9)

Sequence Analysis.ppt
Sequence Analysis.pptSequence Analysis.ppt
Sequence Analysis.ppt
 
bioinfo_6th_20070720
bioinfo_6th_20070720bioinfo_6th_20070720
bioinfo_6th_20070720
 
Day1 2 5
Day1 2 5Day1 2 5
Day1 2 5
 
Lecture 9 10 dna-genes_chromosomes
Lecture 9 10 dna-genes_chromosomesLecture 9 10 dna-genes_chromosomes
Lecture 9 10 dna-genes_chromosomes
 
Msa & rooted/unrooted tree
Msa & rooted/unrooted treeMsa & rooted/unrooted tree
Msa & rooted/unrooted tree
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
07
0707
07
 
Levitan
LevitanLevitan
Levitan
 
Bioinformatica t3-scoringmatrices v2014
Bioinformatica t3-scoringmatrices v2014Bioinformatica t3-scoringmatrices v2014
Bioinformatica t3-scoringmatrices v2014
 

More from Robin Gutell

Gutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xiGutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xiRobin Gutell
 
Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803Robin Gutell
 
Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013Robin Gutell
 
Gutell 121.bibm12 alignment 06392676
Gutell 121.bibm12 alignment 06392676Gutell 121.bibm12 alignment 06392676
Gutell 121.bibm12 alignment 06392676Robin Gutell
 
Gutell 120.plos_one_2012_7_e38320_supplemental_data
Gutell 120.plos_one_2012_7_e38320_supplemental_dataGutell 120.plos_one_2012_7_e38320_supplemental_data
Gutell 120.plos_one_2012_7_e38320_supplemental_dataRobin Gutell
 
Gutell 119.plos_one_2017_7_e39383
Gutell 119.plos_one_2017_7_e39383Gutell 119.plos_one_2017_7_e39383
Gutell 119.plos_one_2017_7_e39383Robin Gutell
 
Gutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfigGutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfigRobin Gutell
 
Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473Robin Gutell
 
Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011Robin Gutell
 
Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011Robin Gutell
 
Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768Robin Gutell
 
Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497Robin Gutell
 
Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485Robin Gutell
 
Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195Robin Gutell
 
Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277Robin Gutell
 
Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769Robin Gutell
 
Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200Robin Gutell
 
Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2Robin Gutell
 
Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043Robin Gutell
 
Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016Robin Gutell
 

More from Robin Gutell (20)

Gutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xiGutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xi
 
Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803
 
Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013
 
Gutell 121.bibm12 alignment 06392676
Gutell 121.bibm12 alignment 06392676Gutell 121.bibm12 alignment 06392676
Gutell 121.bibm12 alignment 06392676
 
Gutell 120.plos_one_2012_7_e38320_supplemental_data
Gutell 120.plos_one_2012_7_e38320_supplemental_dataGutell 120.plos_one_2012_7_e38320_supplemental_data
Gutell 120.plos_one_2012_7_e38320_supplemental_data
 
Gutell 119.plos_one_2017_7_e39383
Gutell 119.plos_one_2017_7_e39383Gutell 119.plos_one_2017_7_e39383
Gutell 119.plos_one_2017_7_e39383
 
Gutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfigGutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfig
 
Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473
 
Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011
 
Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011
 
Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768
 
Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497
 
Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485
 
Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195
 
Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277
 
Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769
 
Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200
 
Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2
 
Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043
 
Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016
 

Recently uploaded

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Gutell 084.jmb.2002.323.0035

  • 1. Distribution of rRNA Introns in the Three-dimensional Structure of the Ribosome Scott A. Jackson1 , Jamie J. Cannone2 , Jung C. Lee3 , Robin R. Gutell2 * and Sarah A. Woodson4 * 1 Department of Chemistry and Biochemistry University of Maryland College Park, MD 20497-2021 USA 2 The Institute for Cellular and Molecular Biology and Section of Integrative Biology, The University of Texas at Austin, TX 78712 USA 3 Division of Medicinal Chemistry, College of Pharmacy, The University of Texas at Austin, TX 78712 USA 4 T. C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 N. Charles Street, Baltimore MD 21218-4118, USA More than 1200 introns have been documented at over 150 unique sites in the small and large subunit ribosomal RNA genes (as of February 2002). Nearly all of these introns are assigned to one of four main types: group I, group II, archaeal and spliceosomal. This sequence information has been organized into a relational database that is accessible through the Comparative RNA Web Site (http://www.rna.icmb.utexas.edu/) While the rRNA introns are distributed across the entire tree of life, the majority of introns occur within a few phylogenetic groups. We analyzed the distributions of rRNA introns within the three-dimensional structures of the 30 S and 50 S ribosomes. Most sites in rRNA genes that contain introns contain only one type of intron. While the intron insertion sites occur at many different coordinates, the majority are clustered near conserved residues that form tRNA binding sites and the subunit interface. Contrary to our expectations, many of these positions are not accessible to solvent in the mature ribosome. The correlation between the frequency of intron insertions and proximity of the insertion site to functionally important residues suggests an association between intron evolution and rRNA function. q 2002 Elsevier Science Ltd. All rights reserved Keywords: group I/II introns; ribosomal RNA; intron transposition; reverse splicing; sequence database*Corresponding authors Introduction Introns in ribosomal RNA genes are found pre- dominantly found within conserved sequences near tRNA and mRNA binding sites, suggesting a possible link between intron evolution and rRNA function.1 – 3 Examples of every major intron class have been identified in rRNA genes.4,5 These include group I and group II introns,6,7 tRNA-like introns in archaeal genomes,8 a newly defined family of “spliceosomal” introns in eukaryotic nuclear rDNA with splice sites that resemble the conserved splice site sequences of nuclear pre- mRNA introns,9 and a small number of introns that cannot be assigned to one of these four groups. The sporadic appearance of group I and group II introns among the rRNA genes of organisms from all three phylogenetic kingdoms points to a complex evolutionary past.10 Although examples of introns that have descended through ancient lineages are known,11 – 14 the appearance of similar introns in different genes or unrelated organisms suggests that they were inserted into host genomes many times during their evolution.15 – 19 Conse- quently, the distribution of known rRNA introns is the product of multiple insertions and selective losses.20 Since ribosomal RNAs are excellent chrono- meters by which to measure phylogenetic relationships,21 many laboratories are determining rRNA sequences from organisms spanning the entire tree of life. Consequently, GenBank contains nearly 10,000 complete 16 S and 23 S (and 16 S-like and 23 S-like) sequences.22 From this diverse collection of rRNA sequences, approximately 1200 introns have been identified, sequenced, and 0022-2836/02/$ - see front matter q 2002 Elsevier Science Ltd. All rights reserved E-mail addresses of the corresponding authors: swoodson@jhu.edu; robin.gutell@mail.utexas.edu Abbreviations used: ASA, accessible surface area; CRW, Comparative RNA Web. doi:10.1016/S0022-2836(02)00895-1 available online at http://www.idealibrary.com on Bw J. Mol. Biol. (2002) 323, 35–52
  • 2. Table 1. Number of known intron sequences at each rRNA position Intron typea Cell locationb Positionc No. intronsd I II A S U C M N A. 16 S rRNA TBD 20 20 – – – – – – 20 40 1 1 – – – – – – 1 114 2 2 – – – – – – 2 156 3 3 – – – – – – 3 170 1 1 – – – – – – 1 263 1 – – 1 – – – – 1 265 3 – – – 3 – – – 3 287 2 2 – – – – – – 2 297 4 – – – 4 – – – 4 298 2 – – – 2 – – – 2 299 11 – – – 11 – – – 11 300 1 – – – 1 – – – 1 322 1 – – 1 – – – – 1 323 3 3 – – – – – – 3 330 21 – – – 21 – – – 21 331 7 – – – 7 – – – 7 332 1 – – – 1 – – – 1 333 1 – – – 1 – – – 1 337 1 – – – 1 – – – 1 374 2 – – 2 – – – – 2 390 1 – – – 1 – – – 1 392 1 1 – – – – – – 1 393 11 – – – 11 – – – 11 400 1 – – – 1 – – – 1 497 2 2 – – – – – – 2 508 1 1 – – – – 1 – – 516 118 116 – – – 2 – – 118 529 1 1 – – – – – – 1 531 2 2 – – – – 1 1 – 532 1 – – 1 – – – – 1 548 1 – – 1 – – – – 1 568 1 1 – – – – – – 1 569 6 6 – – – – – 6 – 570 1 1 – – – – – 1 – 651 1 1 – – – – – – 1 674 3 – – – 3 – – – 3 742 1 – – – 1 – – 1 781 4 – – 4 – – – – 4 788 25 22 2 – – 1 – 2 23 789 1 1 – – – – – – 1 793 6 6 – – – – 1 5 – 879 1 – – – – 1 – – 1 882 1 – – – 1 – – – 1 883 5 – – – 5 – – – 5 891 1 1 – – – – – – 1 896 1 – – – – 1 – – 1 901 2 – – 2 – – – – 2 908 12 – – 12 – – – – 12 911 1 – 1 – – – – 1 – 915 2 – – – – 2 – 2 – 934 1 1 – – – – – – 1 939 8 – – – 8 – – – 8 940 9 9 – – – – – – 9 943 126 125 – – – 1 – – 126 952 5 – 5 – – – – 5 – 956 6 6 – – – – – – 6 966 1 1 – – – – – – 1 989 24 24 – – – – – – 24 1046 10 10 – – – – – – 10 1052 2 2 – – – – – – 2 1062 1 1 – – – – – – 1 1068 8 – – 8 – – – – 8 1071 1 – – – 1 – – – 1 1083 1 – – – 1 – – – 1 1092 1 – – 1 – – – – 1 1139 2 2 – – – – – – 2 1197 1 – – – 1 – – – 1 1199 66 66 – – – – – 66 (continued) 36 rRNA Introns in 3D
  • 3. Table 1 Continued Intron typea Cell locationb Positionc No. intronsd I II A S U C M N 1205 5 – – 5 – – – – 5 1210 9 9 – – – – – 5 4 1213 5 – – 5 – – – – 5 1224 2 2 – – – – – 2 – 1226 2 – – – 2 – – – 2 1229 6 – – – 6 – – – 6 1247 1 1 – – – – – 1 – 1363 1 – – 1 – – – – 1 1389 7 7 – – – – – – 7 1391 1 – – 1 – – – – 1 1502 2 – – – – 2 – – 2 1506 152 152 – – – – – – 152 1508 1 – – – – 1 – – 1 1511 4 4 – – – – – – 4 1512 29 29 – – – – – – 29 1514 1 – – – 1 – – – 1 1516 145 145 – – – – – – 145 Total 950 790 8 45 92 15 3 31 916 B. 23 S rRNA 575 1 – 1 – – – – 1 – 580 1 1 – – – – – 1 – 678 12 – – – 12 – – – 12 681 1 – – – 1 – – – 1 711 1 – – – 1 – – – 1 730 2 2 – – – – 2 – – 775 1 – – – 1 – – – 1 776 4 – – – 4 – – – 4 779 1 1 – – – – – 1 – 780 1 – – – 1 – – – 1 784 1 – – – 1 – – – 1 786 1 – – – 1 – – – 1 787 2 – 1 – 1 – – 1 1 796 1 1 – – – – – 1 – 798 30 30 – – – – – – 30 799 1 1 – – – – – – 1 800 3 3 – – – – – – 3 824 1 – – – 1 – – – 1 858 2 – – – 2 – – – 2 958 3 3 – – – – 3 – – 978 1 – – – 1 – – – 1 1025 3 3 – – – – – – 3 1065 4 4 – – – – 3 1 – 1066 1 1 – – – – – 1 – 1085 9 – – 9 – – – – 9 1090 1 1 – – – – – – 1 1091 1 – – – 1 – – – 1 1094 6 6 – – – – – – 6 1255 3 3 – – – – 2 – – 1685 1 1 – – – – – 1 – 1699 1 1 – – – – – 1 – 1766 2 2 – – – – 2 – – 1787 2 – 2 – – – – 2 – 1809 1 – – 1 – – – – 1 1849 1 – – – 1 – – – 1 1915 1 1 – – – – – 1 – 1917 1 1 – – – – 1 – – 1921 13 13 – – – – – – 13 1923 14 13 – – – 1 8 3 3 1925 9 9 – – – – – – 9 1926 4 4 – – – – – – 4 1927 7 – – 7 – – – – 7 1931 16 16 – – – – 11 4 1 1939 1 1 – – – – – 1 – 1943 1 1 – – – – – 1 – 1949 12 12 – – – – – 7 5 1951 3 3 – – – – 2 1 – 1952 1 – – 1 – – – – 1 1974 2 1 – – – 1 – 2 – (continued) rRNA Introns in 3D 37
  • 4. organized into a relational database.5 Since the rRNA sequences flanking the introns are con- served, the intron/exon boundaries have been mapped unambiguously. The collections of rRNA and intron sequences are sufficiently large to inde- pendently determine the phylogenetic relation- ships of the introns and the host organisms. The prevalence of introns in regions of the 16 S and 23 S rRNAs that bind tRNAs and elongation factors sparked the suggestion in 1993 that this dis- tribution arises from reverse splicing into rRNA sites that are solvent-accessible in ribosomes.23 Other experiments showed that the efficiency of pre-rRNA splicing depends on interactions between the intron and the surrounding rRNA.24–27 Both of these results suggest that introns are more frequent in certain regions of rRNA genes than others, due to differences in the conformations of the mature rRNAs at those sites. Here, we revisit this question in light of the sig- nificant increase in the collection of rRNA introns and the recent high-resolution structures of the ribosome.28 – 31 We have determined that intron- containing sites are strongly clustered around pos- itions of the rRNA that interact with tRNAs and mRNA, but these positions are not more accessible to solvent in the mature ribosome than average. Although the splicing mechanism of each intron class is distinct, most types of introns cluster within the same regions of the rRNA, suggesting that the movement and retention of introns is driven by structural features in the rRNA. The implications of these findings for mechanisms of intron mobility and retention are discussed. Results A total of 1253 rRNA sequences containing introns were retrieved from GenBank as of February 2002, representing organisms from all major phylogenetic groups. The positions of the introns within the rRNA gene were determined from alignments of the mature rRNAs, as pre- viously described,5 and numbered according to the sequences of the Escherichia coli 16 S and 23 S rRNAs.32 Sequence entries were sorted according to the position of the intron in the 16 S or 23 S rRNA, and the number of rRNA sequences con- taining an intron at each position was noted (Table 1). The introns were also classified as group I6,33 or group II7 families, tRNA-like archaeal introns,26 or nuclear spliceosomal introns,9,34 based on their conformity to consensus secondary structures and conserved sequences. Group I and group II introns were further subdivided into structural families.6,7,33 The database of rRNA introns is continually updated as new rRNA sequences are deposited in GenBank.5 Hence, the precise statistics reported here are expected to change as the number of available intron-containing sequences increases. However, the number of intron-containing pos- itions in the rRNA is increasing more slowly than Table 1 Continued Intron typea Cell locationb Positionc No. intronsd I II A S U C M N 2004 1 – – – – 1 – – 1 2059 4 – 4 – – – – 4 – 2066 6 6 – – – – – – 6 2067 1 1 – – – – – 1 – 2256 1 1 – – – – – 1 – 2262 5 5 – – – – 5 – – 2449 48 48 – – – – 7 30 11 2451 1 – 1 – – – – 1 – 2455 1 – 1 – – – – 1 – 2499 1 1 – – – – – 1 – 2500 32 32 – – – – 10 22 – 2504 5 5 – – – – – 5 – 2509 2 – 2 – – – – 2 – 2552 1 – – 1 – – – – 1 2563 7 7 – – – – – – 7 2585 2 2 – – – – – 2 – 2593 16 16 – – – – 13 3 – 2596 3 3 – – – – 3 – – 2601 2 – – 2 – – – – 2 2610 1 – 1 – – – – 1 – Total 335 269 13 21 29 3 76 105 154 a Intron types are classified as I, group I; II, group II; A, archaeal; S, spliceosomal; U, unknown. b Cell compartment of rRNA gene: C, chloroplast; M, mitochondrion; N, nucleus. Introns in archaea and bacteria are defined as belonging to the nucleus. The only known rRNA intron in a bacterial genome occurs at position 1931 in the 23 S rRNA. c Position of nucleotide immediately prior to intron in Escherichia coli reference sequence. TBD, to be determined. These intron sequences were published without flanking exon sequence and their insertion sequences could not be determined. d Number of intron sequences presently known at that position. 38 rRNA Introns in 3D
  • 5. the total number of intron sequences. Conse- quently, the general trends in the data are likely to remain the same. Non-random distribution of introns in rRNA genes Within 16 S (and 16 S-like) rRNA genes, 920 intron sequences are distributed over 84 sites, among 1542 E. coli positions in the mature rRNA. Among 23 S (and 23 S-like) rRNA genes, 325 introns are found at 69 sites, out of 2904 E. coli pos- itions. Although 23 S rRNA is twice the length of the 16 S rRNA, threefold more introns have been documented in 16 S rRNA genes. This apparent bias is largely due to unequal sampling of rRNA sequences. As of February 2002, the CRW database contains 7527 complete 16 S rRNA sequences and 960 complete 23 S rRNA sequences. When normal- ized against the number of reported rRNA sequences, the 16 S and 23 S rRNAs have approxi- mately the same number of introns per rRNA nucleotide, with ,1024 introns per sequence per nucleotide in each gene. Comparison of the data with a Poisson distri- bution based on the mean frequency of rRNA introns shows that many fewer sites contain introns than would be expected if introns were inserted randomly within rRNA genes (Figure 1). A few sites in the rRNA account for a large fraction of the intron sequences in the CRW database (Table 1). These include positions 516, 943, 1506 and 1516 in the 16 S rRNA, with more than 100 known introns each, and positions 798, 2449, and 2500 in the 23 S rRNA, which each have more than 25 intron sequences. This skewed distribution persists even when sequence entries from closely related species or strains are counted only once. Hence, the presence of many intron sequences at a few sites in the rRNA is not due to biased sampling of rRNA sequences. The tendency of introns to occur in particular positions of the rRNA can be explained by two mechanisms; (1) vertical inheritance of introns from ancestral genes, or (2) preferential horizontal transfer of intron sequences to certain target sites. The presence of similar introns at the same position in the rRNAs of related organisms is usually attributed to stable inheritance of the intron within the lineage. Examples of this inheri- tance include 54 group IE introns that occur at position 1199 of Ascomycota nuclear 16 S rRNAs35 and 128 group I introns at position 1516 in the 16 S rRNA genes of Lecanorineae (Table 1). In contrast, different subclasses of introns at the same rRNA position, or closely related introns in unrelated organisms, are best explained by horizontal transfer of the intron sequences. For example, group IC1 introns with a distinctive structural motif are inserted after position 516 in the 16 S rRNA of red and brown algae but are not found in intermediate relatives on the phylogenetic tree.19 This suggests that the introns were acquired independently after the red and brown algae diverged. Introns have probably been inserted (and lost) at position 516 many times during evolution,14 as introns from two structural classes (IC1 and IE) have been identified at this position in four phylogenetic groups.5 In the 23 S rRNA, nine group I introns corresponding to three different subclasses were found at position 1931 in the mitochondrial, chloroplast and nuclear rDNA of diverse organisms.16,17,36 Thus, the phylogenetic data suggest that introns are repeat- edly inserted at the same positions in rRNA genes. Therefore, the prevalence of introns at certain positions in rRNA genes is due not only to inheritance of a few stable introns but also to site-selective transposition and retention of introns. Intron positions at the functional heart of the ribosome To determine whether the location of intron sites depends on the higher-order structure of the Figure 1. Distribution of introns in rRNAs. Histogram of the number of positions in the (a) 16 S rRNA or (b) 23 S rRNA with a given number of intron insertions. Positions are numbered according to the sequence of the E. coli rRNAs. Shaded bars indicate the actual number of intron-containing positions (Table 1); lines indicate the expected number of intron-containing positions expected from a Poisson distribution, based on the mean frequency of introns in rRNA sequences. Insets show an expansion of the y-axis. rRNA Introns in 3D 39
  • 6. rRNA, intron-containing positions were mapped onto secondary structures of the 16 S and 23 S rRNAs determined from comparative sequence analysis (Figures 2(a) and 3(a)).3,5,37 The intron positions were also visualized on three-dimen- sional structures of the 30 S and 50 S ribosomal subunits recently determined by X-ray crystal- lography28,29 (Figures 2(b) and 3(b) and Further information; see Methods. Introns are found in every domain of the 16 S rRNAs. A large number of intron-containing sites lie near highly conserved regions that function in decoding and translocation, such as the 530 loop, the “switch helix” (nucleotides (nt) 885–912), and the 30 -terminal helix (nt 1506–1529). Other introns are found near residues that form the ribosome subunit interface, such as four intron insertions within the 790 loop (Figure 2(a)). When mapped onto the structure of the Thermus thermophilus 30 S ribosome29 (Figure 2(b)), intron-containing positions are concentrated towards the center of the subunit, as suggested by their locations in the secondary structure. A third of intron positions lie on the face of the 30 S particle that contacts the 50 S subunit. Strikingly, positions with large numbers of introns (516, 788, 943, 1199, 1506 and Figure 2. Location of introns in 16 S rRNAs. (a) Secondary structure of 16 S rRNA, showing nucleotides that are present in more than 95% of known sequences.5,37 Red dot, $98% conserved; gray dot, 90– 98% conserved; open circle, ,90% conserved. Positions for which an intron is documented are numbered according to the sequence of the E. coli 16 S rRNA and colored according to intron type: green, group I; red, group II; cyan, archaeal; blue, spliceosomal; orange, other; black, multiple intron types. Nucleotides that are within 10 A˚ of mRNA or tRNA bound at the A, P or E sites42 are shaded yellow. Brackets indicate positions that are not conserved among all rRNA sequences. The asterisk (p) indicates an insertion relative to the E. coli sequence. (b) Three- dimensional structure of the 30 S ribosomal subunit (1FJF),29 with intron-containing positions colored according to intron type as in (a). Large spheres represent positions with more than 100 known intron sequences. The 16 S rRNA is shown as light gray ribbon; proteins as pink ribbons. 40 rRNA Introns in 3D
  • 7. 1516) all lie near functionally important residues at the subunit interface. Among 23 S rRNA genes, introns are also found near functionally important positions in domains II, IV and V of the 23 S rRNA (Figure 3(a)). In domain IV of the 23 S rRNA, many introns are found in sequences that form the interface with the 30 S subunit, with a large number of insertions near helix 69 (nt 1905–1920) at the center of the subunit (Figure 3). This stem–loop interacts with tRNAs bound in the A and P sites, and makes contacts with 16 S rRNA residues in the decoding site of the ribosome.38,39 In three-dimensional views of the Haloarcula marismortui 50 S ribosome,28 the intron-containing positions in domain IV can be seen running across the central ridge and down the face that contacts the 30 S subunit (Figure 3(b)). Many introns are inserted adjacent to highly conserved nucleotides in domain V of the 23 S rRNA that form the peptidyl transferase center of the ribosome (Figure 3(b)). These nucleotides line both sides of the peptidyl transferase cleft (Figure 3(b)), placing the sites of intron insertions in close proximity to the acceptor stems of bound tRNAs. Among nucleotides deep in the peptidyl transfer- ase cleft are several positions that frequently precede intron sites, such as nt 2449 and 2500 (Table 1). A smaller number of introns are also found throughout domain II of 23 S rRNA genes (Figure 3). Several introns are found in the L11-binding region (nt 1050–1100), which interacts with elongation factors in the 70 S ribosome.40,41 Surpris- ingly, no introns were found at rRNA positions that face the solvent side of the 50 S subunit or in the bottom surface created by domains III and VI of the 23 S rRNA. Thus, intron-containing positions in the 23 S rRNA are concentrated in the center of the 50 S particle and on the surface that contacts the 30 S subunit. Three-dimensional clusters of intron sites To quantify the extent to which intron insertion sites are clustered within the three-dimensional structure of the 70 S ribosome, we computed a center of mass from the coordinates of the 30 phos- phate group of each intron insertion site. The standard deviations of the x, y and z coordinates were summed to create a “cluster score,” with a lower number representing a tighter group (see Methods). As illustrated in Figure 4(a), nucleotides that precede introns are closer to each other than the same number of randomly selected phosphate groups in the rRNA. This is particularly true of positions that contain large numbers of introns. Hence, intron-containing positions are more tightly clustered in space when each coordinate is weighted by the number of rRNA genes containing an intron at that position (Figure 4(a)). Proximity to conserved rRNA nucleotides To analyze the extent to which introns are present at conserved rRNA sites, the conservation of nucleotides 50 and 30 of each intron insertion site was determined. The level of conservation was extracted from alignments of rRNA sequences representing organisms from the three primary phylogenetic domains, plus chloroplasts and mitochondria,5 and is available at the CRW Site (see Methods). Nucleotides that are more than 98% identical (category A) or 90–98% identical (category B) are highlighted in red and gray in the secondary structure diagrams in Figures 2(a) and 3(a), and in space-filling representations of the ribosomal subunits in Figure 5. Only 12% and 5% of all 16 S and 23 S rRNA nucleotides, respectively, are conserved in more than 98% of all known rRNA sequences (category A). In contrast, 21% of 16 S and 25% of 23 S rRNA nucleotides that precede an intron-insertion site are conserved at this level (Table 2(A)). We observe a similar bias in the number of category B nucleo- tides (90–98% identical) preceding intron positions (Table 2(A)). Alternatively, we counted the number of intron-containing positions that occur near a category A nucleotide (Table 2(B)). About a third (36%) of 16 S rRNA intron positions were adjacent to a category A nucleotide, compared with 18% of all 16 S rRNA nucleotides. In the 23 S rRNA, 32% of intron-containing positions were adjacent to a category A nucleotide, compared with 8% overall. Positions with many introns (16 S positions 516, 788, 943, 1199, 1506, 1512, 1516; 23 S positions 798, 1921, 2449, 2500) are even more likely to be next to highly conserved nucleotides than positions that contain only one or a few known introns. This bias is quite strong, particularly among 16 S rRNA sequences. Thus, introns are much more likely to lie within conserved regions of the rRNA than expected if they were randomly inserted in rRNA genes. Although introns tend to occur in conserved regions of the rRNA, not all highly conserved regions contain known introns. The most con- served regions of the 16 S and 23 S rRNAs that do not contain any known introns as of February 2002 are the 250 region in domain I and the alpha- sarcin/ricin loop in domain VI (2650–2670) of the 23 S rRNA, and the 1400 region of the 16 S rRNA that forms part of the mRNA decoding center. In fact, the lack of introns in domain VI of the 23 S rRNA is striking, given its conservation and functional importance. Proximity to tRNA binding sites Most introns tend to be inserted near residues that form tRNA binding sites (Figures 2 and 3). This trend was evaluated by computing the frac- tion of intron sites within a certain distance of the A, P or E sites of the 70 S ribosome (see Methods). As shown in Figure 6, intron-containing positions rRNA Introns in 3D 41
  • 8. Figure 3 (legend opposite) 42 rRNA Introns in 3D
  • 9. are closer in space to bound tRNAs than average positions in the 16 S or 23 S rRNAs. Among 153 positions that contain introns, 26 are within 10 A˚ of a tRNA site and include rRNA nucleotides that make direct interactions with bound tRNAs.31 Roughly 60% of introns lie within 20 A˚ of the A, P or E sites, and 90% are within 40 A˚ . In contrast, the mean distance of all rRNA residues from the tRNA sites is 50 A˚ . Nearly all of the rRNA sites that contain a large number of introns are within 20 A˚ of a bound tRNA. These include positions 516, 943 and 1506 in the 16 S rRNA and 2449 and 2500 in the 23 S rRNA. Position 1506 in the 16 S rRNA is even closer (10 A˚ ) to the mRNA channel.42 The proximity of intron-containing positions to tRNA binding sites explains the clustering of introns within the three-dimensional structure of the 70 S ribosome and is congruent with their proximity to highly conserved rRNA residues. A corollary of the association between introns and functional sites in the ribosome is that intron- containing positions occur in RNA-rich areas of the ribosome. Qualitative examination of the 30 S ribosome structure showed that very few intron- containing positions interact directly with riboso- mal proteins. Almost no intron-containing positions in the 23 S rRNA contact proteins, as most proteins bind the solvent face of the large subunit,28 which is devoid of introns. Comparison of intron types Introns belonging to different classes (group I, II, archaeal and spliceosomal) are rarely inserted at precisely the same position within the rRNA.5 Two exceptions are position 788 in the 16 S rRNA, which contains group I and group II introns (Figure 2(a) and Table 1A), and position 787 in the Figure 4. Three-dimensional clustering of rRNA intron sites within the 70 S ribosome. Cluster scores were computed as described in Methods; a lower score indicates a narrower distribution of x, y and z coordinates. (a) Comparison of rRNA positions in the 16 S and 23 S rRNAs (70 S), 23 S rRNA only (50 S), or 16 S rRNA only (30 S). Dark gray, all rRNA positions; light gray, intron-containing positions; white, intron-containing sequences; hatched, P-site tRNA. (b) Compari- son of intron types. Dark gray, 70 S ribosome; light gray, 50 S ribosome; white, 30 S subunit. All positions corresponds to all rRNA residues; reverse splicing refers to positions that were experimentally found to react with the Tetrahymena group I intron.47,48 Figure 3. Location of introns in 23 S rRNAs. (a) Secondary structure of 23 S rRNA, showing nucleotides that are present in more than 95% of known sequences.3,5 Nucleotides are represented as in Figure 2(a). Positions that contain a documented intron are numbered according to the sequence of the E. coli 23 S rRNA and colored according to intron type as in Figure 2(a). (b) Three-dimensional structure of the 50 S ribosome (1JJ2),76 colored as in Figure 2(b). Spheres represent intron-containing positions; large spheres indicate positions with more than 29 known intron sequences. rRNA Introns in 3D 43
  • 10. 23 S rRNA, which contains group II and spliceosomal introns (Figure 3(b) and Table 1B). A few sites contain both group I and unclassified introns (16 S rRNA positions 516 and 943; 23 S positions 1923 and 1974). As each intron type has a different splicing mechanism, distinct insertion sites may reflect different constraints on splice-site recognition. For example, the 50 splice site of group I introns is nearly always preceded by a U,43,44 whereas group II and archaeal exon bound- aries are variable in sequence.26,45 Although their precise insertion sites do not overlap, most introns are inserted near rRNA resi- dues that form the center of the 70 S ribosome (Figures 2(b) and 3(b)). Nucleotides near the center of the ribosome tend to be conserved, and group I, group II and archaeal introns are particularly likely to be inserted at locations within three positions of a category A nucleotide (Table 2B). These data suggest that the evolutionary forces that shape the distribution of group I, group II and archaeal introns within rRNA genes originate from the sequence and structure of the rRNA itself rather than the type of intron. In contrast, spliceosomal introns are less clustered with respect to the three-dimensional structure of the ribosome than group I and group II introns (Figure 4(b)). Moreover, they are less likely to be near a highly conserved nucleotide than other types of introns (Table 2B), and more likely to be found in residues far from tRNA binding sites. For example, only one spliceosomal intron is found in 23 S rRNA domain IV (1849), although this region contains many group I and group II intron insertions (Figure 3(a)). As spliceo- somal introns do not have a conserved secondary structure, their splicing may be less sensitive to the structure of the rRNA exons than group I, group II and archaeal introns, which must fold into a precise structure in order to be spliced. Solvent accessibility of intron positions Introns may preferentially exist near rRNA residues that interact with tRNAs, because these residues are on the surface of the ribosome. If introns are inserted into the rRNA by reverse splicing, we would expect them to integrate more frequently into accessible positions of the rRNA.23,46 Alternatively, introns may be retained more frequently at exposed positions where they might not interfere with ribosome assembly. To address this question, the solvent-accessible surface area (ASA) of each atom was computed from the coordinates for the T. thermophilus 30 S ribosome and the 50 S subunit from H. marismortui, as described in Methods. The 30 oxygen atom was used as a proxy for intron insertion sites, although similar results were obtained with other backbone atoms. As expected, many 30 oxygen atoms are fully buried (ASA ¼ 0 A˚ 2 ), while others are at least partially exposed to solvent (ASA . 4 A˚ 2 ) (Figure 7). The latter correspond to residues that are on the surface Figure 5. Conservation of rRNA. Intron positions (yellow spheres) are indicated on a space-filling model of the ribo- some. Red, rRNA nucleotides that are .98% conserved among all sequences (category A); pink, nucleotides that are 90–98% conserved (category B).5 Atoms are rendered at twice the van der Waal’s radius; intron positions are rendered four times larger. (a) 30 S (front view). (b) 50 S (crown view). The figure was prepared with Insight II. 44 rRNA Introns in 3D
  • 11. of the ribosomal subunit. Contrary to our expec- tations, 30 oxygen atoms preceding intron-contain- ing sites were exposed to solvent in the same proportion at all rRNA positions (Figure 7). From these data, we conclude that intron-containing positions are not systematically biased towards rRNA positions that are solvent-accessible in mature subunits. Visual inspection of the three-dimensional struc- tures of the 30 S and 50 S subunits (Figures 2 and 3) and the data in Figure 7 show that introns are inserted in both accessible and buried positions. Some regions of the 23 S rRNA that contain many introns are highly exposed to solvent in the 50 S ribosome, such as 1080 and 1915. Others, such as position 2449, have an average accessibility, while a few, such as 2500, are buried deeply within the 50 S subunit. The 30 S subunit is quite flat, so that even residues that are buried in the static structure of the ribosome are not far from the surface of the Table 2. Conservation of rRNA near intron sites 16 S 23 S nt %Total nt %Total A. Conservation of rRNA residues prior to intron insertion sitesa All rRNA residues .98% 178 11.5 150 5.2 90–98% 175 11.3 203 7.0 80–90% 116 7.5 168 5.8 Total 1542 100.0 2904 100.0 nt prior to: All introns .98% 18 21.4 17 24.7 90–98% 14 16.7 12 17.4 80–90% 12 14.3 5 7.2 Total 84 100.0 69 100.0 Group I .98% 11 28.2 10 24.4 90–98% 4 10.3 4 9.8 80–90% 4 10.3 4 9.8 Total 39 100.0 41 100.0 Group II .98% 1 33.3 3 37.5 90–98% 1 33.3 2 25.0 80–90% 0 0 0 0 Total 3 100.0 8 100.0 Archaeal .98% 4 28.6 3 50.0 90–98% 7 50.0 2 33.3 80–90% 1 7.1 0 0 Total 14 100.0 6 100.0 Spliceosomal .98% 0 0 1 7.1 90–98% 2 9.1 4 28.6 80–90% 7 31.8 1 7.1 Total 22 100.0 14 100.0 B. Number of intron-containing positions adjacent to highly conserved (.98%) rRNA residues Distance to nearest conserved rRNA position: All rRNA residues Adjacent to at least one 274 17.8 240 8.3 Within 2 nt 351 22.8 308 10.6 Within 3 nt 417 27.1 359 12.4 All introns Adjacent to at least one 30 35.7 22 31.9 Within two nt 41 48.8 33 47.8 Within three nt 50 59.5 40 57.9 Flanked by two 6 7.1 9 13.0 Group I Adjacent to at least one 19 48.7 12 29.3 Within 2 nt 24 61.5 19 46.4 Within 3 nt 29 74.4 22 53.7 Flanked by two 4 10.3 4 9.8 Group II Adjacent to at least one 1 33.3 4 50.0 Within 2 nt 3 100.0 6 75.0 Within 3 nt 3 100.0 7 87.5 Flanked by two 0 0 2 25.0 Archaeal Adjacent to at least one 7 50.0 4 66.7 Within 2 nt 8 57.1 4 66.7 Within 3 nt 8 57.1 4 66.7 Flanked by two 1 7.1 3 50.0 (continued) Table 2 Continued 16 S 23 S nt %Total nt %Total Spliceosomal Adjacent to at least one 1 4.5 2 14.3 Within 2 nt 4 18.2 4 28.6 Within 3 nt 7 31.8 7 50.0 Flanked by two 0 0 0 0 a Conservation levels were calculated based on a large sample of archaeal, bacterial, and eukaryotic nuclear, mito- chondrial and chloroplast rRNA sequences.5 The three conserva- tion ranges (.98%, 90–98% and 80–90%) were determined for positions that are present in more than 95% of all sequences. Figure 6. Proxmity of intron sites to tRNA bound in the 70 S ribosome. The total number of rRNA positions within a given cutoff distance of tRNA bound in the A, P or E site of the 70 S ribosome was calculated as described in Methods. Intron-containing positions were weighted by the number of rRNA sequences with an intron at that position. Continuous line, all 16 S rRNA positions; long dash, 16 S introns; medium dash, 23 S introns; short dash, all 23 S rRNA positions. rRNA Introns in 3D 45
  • 12. particle. All the same, introns are found in 16 S rRNA positions that are among the least accessible in the 30 S crystal structure. The similarity between the ASA values for all rRNA positions and those containing introns remained constant when spheres of radius 1.4 A˚ and 5 A˚ were used to compute the ASA. We also observed no difference between the accessibilities of intron insertion sites and average rRNA nucleo- tides in the absence of ribosomal proteins, although the mean accessibility of all residues increased. Taken together, these data suggest that intron insertion sites are not exclusively found in residues that are highly exposed in the static structures of the 30 S and 50 S ribosomes. It is pos- sible, however, that many intron-containing positions are exposed by conformational changes during translation or during assembly of ribosomal particles. Reverse splicing target sites Previous work showed that the Tetrahymena group I intron can integrate partially or fully into many sites in the 23 S rRNA by reverse splicing when overexpressed in E. coli cells.47,48 An interest- ing question is whether experimentally observed reverse splicing targets are clustered in the same rRNA regions as naturally occurring introns. In contrast to the sites of natural introns, sites that react with the Tetrahymena intron are not only found on the subunit interface but also lie on the “back” face of the 50 S ribosome, away from the 30 S subunit (Figure 8). The reaction sites are less tightly clustered with each other than the positions of natural introns (Figure 4) and not localized around the tRNA binding sites. They were also slightly less solvent-accessible than average, suggesting that accessibility is not a prerequisite for insertion of introns by reverse splicing. The difference in the spatial distribution of naturally occurring introns and reverse splicing targets suggests that other factors determine the positions of rRNA introns, such as selective retention after intron insertion or protein-dependent transposition mechanisms. Discussion The extensive rRNA intron sequence collection available at the Comparative RNA Web (CRW) Site5 is the foundation of our investigation of the distribution of introns within the 16 S and 23 S rRNAs. Positions of the rRNA that are presently known to contain introns were mapped to the secondary structures of the rRNA determined from comparative sequence analysis3,5,37 and the three-dimensional structures of the 30 S, 50 S and 70 S ribosomes determined by X-ray crystallography.28 – 31 Our results, which are based on the analysis of more than 1250 rRNA intron sequences, reveal that introns are not inserted randomly within the 16 S and 23 S rRNA genes. Instead, they are clustered near tRNA binding sites, which span the interface between the small and large subunits of the 70 S ribosome. Surprisingly, intron-containing residues are not more solvent-accessible in the 30 S and 50 S subunits than typical rRNA residues. Thus, while the distribution of introns is coupled to the sequence and structure of the rRNA, it does not directly correlate with the accessibility of the splice junction in static three-dimensional structures of the ribosome. Horizontal transfer of intron sequences Comparative analyses of intron sequences within specific phylogenetic groups have con- cluded that introns at the same position of the rRNA are usually more closely related to one another than introns at different positions.13,14,35,49–51 Figure 7. Surface accessibility of intron positions in mature ribosomal subunits. The ASA of 30 oxygen atoms in the 50 S and 30 S ribosomal subunits was calculated as described in Methods. Histograms show the percen- tage of rRNA positions with a given ASA value. Shaded bars, intron-containing positions; white bars, all rRNA positions. (a) 16 S rRNA in the 30 S subunit. (b) 23 S rRNA in the 50 S subunit. 46 rRNA Introns in 3D
  • 13. This has led to the view that some introns are retained for long periods of evolution after their initial insertion into a specific position in the rRNA gene. On the other hand, introns are periodically lost from certain lineages.13,14 More rarely, they are regained or transferred to a new lineage.15 – 19,36 Hence, intron mobility may be required to ensure survival and the spread of intron sequences to new organisms.10 Besides ensuring intron retention, frequent deletion and reinsertion of intron sequences may result in different phylogenetic trees for rRNA exons and introns, as observed for the group IC1 intron in the rDNA of Tetrahymena.52 This type of dynamic flux has been invoked to explain sequence heterogeneity of non-LTR retrotranspo- sons in insect rDNA.53 As most insertion events recur at the same site in the rRNA with only occasional transposition to a new site,54 the position of the retroelement in the rRNA is main- tained, although its sequence may vary. If introns are continually deleted and reinserted, their final distribution will depend heavily on the selectivity of retention and the target-specificity of the inser- tion mechanism. These factors are discussed below. Target-site selection during intron transposition Many group I and group II introns and some archaeal introns have acquired open reading frames that encode homing DNA endonucleases.10 Although these endonucleases usually recognize the same position of an intron-less allele of the same gene, cross-reaction with related sequences could enable transfer of the intron to homologous positions in the rRNA genes of a different organ- ism or occasional transposition to ectopic sites.55 – 57 Because of the large recognition sites of most homing endonucleases,58 this process would be more effective in conserved regions of the rRNA, contributing to the prevalence of introns at those sites. The feasibility of intron transfer was demon- strated by integration of group I introns from Physarum polycephalum and Tetrahymena thermophila into the rDNA of yeast upon co-expression of the I-PpoI endonuclease.56 Because the rRNA sequences flanking the intron are nearly identical in these species, the introns are spliced correctly in yeast.59 Similarly, reverse transcriptases encoded by group II introns normally promote movement of the intron to an allelic site, but in some circum- stances can transpose the intron to an ectopic site.60 – 63 Although the substrate specificity of proteins associated with intron mobility might account for the prevalence of introns in conserved regions of the rRNA, this mechanism of intron transposition is inconsistent with the sequence data. First, the majority of rRNA introns do not contain open reading frames that encode endonucleases.5 At some sites that contain many introns, such as 1506 in the 16 S rRNA, only a few of the extant introns have open reading frames (ORFs) greater than 500 nt.5 While all introns may have contained ORFs at one time that were subsequently deleted by non-homologous recombination, only a few cases of vestigial ORFs have been documented in rRNA introns (in Naegleria64 and in the order Bangiales14 ). Second, this mechanism requires that Figure 8. Locations of reverse splicing targets in E. coli 23 S rRNA. Integration sites of the Tetrahymena group I intron RNA when overexpressed in E. coli47,48 are indicated by white spheres on the crown (left) and back (right) views of the 50 S subunit.28 Large spheres indicate frequent reaction sites. The 23 S rRNA is shown as a ribbon colored by domain: blue, domain I; cyan, domain II; yellow, domain III; green, domain IV; red, domain V; pink, domain VI; magenta, 5 S rRNA. Proteins are not shown. rRNA Introns in 3D 47
  • 14. the endonuclease be properly expressed in the recipient organism, which could be problematic in cases of intron transfer between organelles.16 Transposition via reverse splicing An alternative to protein-dependent mecha- nisms is that introns integrate into rRNAs by reverse splicing.46 This could lead to transposition of the intron to novel sites in the genome, follow- ing reverse transcription and recombination with the intron-containing cDNA. An appealing aspect of this mechanism is that integration of the intron into the rRNA would be immediately sensitive to the conformation of the rRNA.24 Another advan- tage is that group I reverse splicing is much less site-specific than reactions catalyzed by homing endonucleases.48 Despite the appeal of RNA-based intron trans- position, the biochemical support for such a mechanism is weak. Reverse splicing of the Tetrahymena group I intron into rRNA in E. coli was observed by RT-PCR,65 but stable integration of a group I intron by reverse splicing has not yet been demonstrated. Group II introns also reverse splice, but the target is allelic DNA rather than RNA.66 In addition, we find no correlation between the frequency of introns and solvent accessibility of the rRNA insertion site. Thus, if reverse splicing contributes significantly to the spectrum of introns found in rRNA genes, then the intron must react with pre-rRNA that is not fully assembled into a ribosomal complex or the target rRNA must transi- ently unfold. Finally, reverse splicing products of the Tetrahymena intron in the E. coli 23 S rRNA are distributed differently than natural rRNA introns with respect to the structure of the 50 S sub- unit. This suggests that the ability of certain sequences to serve as reverse splicing substrates is not sufficient to describe the actual distribution of rRNA introns. Retention of introns in dynamic regions of the rRNA An alternative possibility is that introns are inserted randomly within the rRNA genes, but retained only at certain positions. Many random integration products are expected to be unstable, either because of recombination with cDNA or other rDNA repeats or simply because the intron is not spliced effectively from its new location, inactivating the rRNA. When a homing endo- nuclease is continually expressed, aggressive homing will permit even deleterious rRNA inser- tions to be maintained.4 However, once the ORF is not expressed (as in most rRNA introns), introns that reduce fitness will be lost. Accurate splicing, degradation of the excised intron RNA (to prevent cleavage of the spliced exons)67 and proper folding of the rRNA are all necessary to maintain host cell functions. It is interesting to consider whether introns are more easily retained in positions near the subunit interface and tRNA binding sites because these nucleotides undergo structural changes during translation.68 Flexible sites should accommodate introns more easily than rigid sites, because in many cases the secondary structure of the mature rRNA must unfold to permit splice site recognition.24,26 Conversely, the presence of highly structured introns may either interfere less severely with ribosome assembly or even enhance folding of the rRNA. A striking example of introns in a structurally dynamic region of the rRNA occurs in domain IV of the 23 S rRNA. Helix 69 (nt 1905–1920) is dis- ordered in crystals of the 50 S ribosome, suggesting that it remains mobile under these conditions.28,30 In the 70 S ribosome, this stem–loop and adjacent sequences form an important bridge to the decod- ing site in the 30 S subunit.31 Many group I introns and a few archaeal introns are found between positions 1915 and 1932 of the 23 S rRNA (Figure 3(a)). Experiments on the group I intron of Tetrahymena (position 1925) have shown that the conformation of the rRNA exons is intimately linked to self-splicing activity.25,69 Similarly, refold- ing of helix 69 is required for excision of the archaeal intron at position 1927 in Staphylothermus marinus.26 In addition to these examples, many other intron-containing sites are in mobile regions of the rRNA. For example, the L11-binding region of the 23 S rRNA is disordered in crystals of 50 S ribosomes,28,30 and is also highly mobile.31 Within this region (nt 1050–1108), five sites contain group I introns, and one site contains an archaeal intron. Positions 2449 and 2500 in the 23 S rRNA are part of the peptidyl transferase center and are thought to undergo conformational changes during ribo- some translocation.70 In the 16 S rRNA, many introns are inserted in the 790 loop and in the 30 terminal helix (positions 1506, 1512 and 1516), which forms an inter-subunit bridge near the P-site tRNA.31 Position 516 contains a large number of group I introns; this residue is part of the con- served 530 loop that undergoes conformational changes during mRNA decoding.71,72 Regulatory role for introns? Finally, pre-rRNA splicing may even be an additional mechanism for regulating ribosome activity, providing another link between RNA processing and translation. As introns are not universally present among all members of a phylo- genetic group, their function cannot be essential to translation. Yet, they may provide useful regu- latory functions under the growth conditions encountered by specific organisms. Regulation could occur during or after subunit assembly. First, the presence of an intron in the pre-ribosomal RNA could arrest ribosome assembly, either by sterically blocking tertiary contacts (e.g. in the 48 rRNA Introns in 3D
  • 15. interior of the 50 S subunit), or by altering local secondary structures.24 As introns are rarely found in ribosomal protein binding sites, unspliced introns would affect protein binding only indirectly via structural changes in the rRNA. Second, introns could be required for modification of the rRNA in some organisms, as in some tRNAs.73,74 Third, introns could inhibit the trans- lational activity of nascent subunits. By analogy to proenzymes, splicing would be required for acti- vation. Introns inserted in residues at the inter- subunit interface, for example, might not prevent assembly of individual subunits, but would prevent the subunit from forming productive 70 S ribosomes.75 In summary, our analysis confirms the statistical correlation between the positions of rRNA introns and the structure and conservation of the 16 S and 23 S rRNAs. It is now apparent that this bias results in the preferential location of introns in conserved rRNA residues that are clustered around tRNA binding sites in the three-dimensional structure of the 70 S ribosome. At present, it is difficult to rationalize the rRNA intron distribution based only on the selectivity of known intron insertion mechanisms. This suggests that the extant positions are selected by either a need to maintain efficient splicing or the stringent requirements of pre-rRNA maturation and ribosome assembly. Both of these effects may favor introns in dynamic regions of the rRNA. Methods rRNA intron database The rRNA intron sequences were collected, analyzed, and assembled into the relational database for the CRW project, as described in detail.5 Only sequences that are more than 90% complete and publicly available are included in the database. As of February 2002, the database contained 8865 rRNA sequences and 1285 annotated rRNA intron sequences. The positions of rRNA introns are numbered according to the nucleotide 50 of the intron in the sequence of the E. coli 16 S or 23 S rRNA (GenBank accession number J01695). Our determi- nation of the intron types utilized the descriptions of sequence and secondary structure of group I,6 group IE,33 group II,45 spliceosomal34 and archaeal introns.26 The spliceosomal introns noted here are defined by homology to pre-mRNA introns;9 the biochemical mechanism of splicing has not yet been established. Intron-containing rRNA positions in three- dimensional ribosome structures Structural data were obtained from the Protein Data Bank (PDB).† Introns in 16 S or 16 S-like rRNA genes were mapped onto the three-dimensional structure of the T. thermophilus 30 S ribosome (1FJF).29 Introns in 23 S or 23 S-like rRNA genes were mapped on the structure of the H. marismortui 50 S ribosome (1JJ2).76 Three intron positions (1956, 1958 and 1962) lie in regions of the 23 S rRNA that are not visible in the electron density map.76 These were denoted with spheres at nearby residues: 1956 shown as 1950, 1958 as 1951, 1962 as 1965. Intron positions were represented by a CPK rendering of the 30 -hydroxyl oxygen atom of the rRNA residue preceding the intron insertion site. Molecular graphics analyses were carried out using Insight II (Molecular Simulations Inc.). Figures were prepared using Ribbons77 and Insight II. Spatial distribution of rRNA positions The spatial distribution of rRNA residues in the T. thermophilus 70 S ribosome was determined from the atomic coordinates of phosphorus atoms (1GIY and 1GIX).31 For each group of rRNA residues analyzed, we computed the center of mass and the standard deviation of the x, y and z coordinates. The “cluster score” is the sum of the x, y and z standard deviations of the x, y and z coordinates. A lower cluster score represents a tighter three-dimensional clustering of nucleotides. Intron inser- tions were represented by the 50 phosphate group of the nucleotide 30 of the intron. To compute clustering of intron sequences, all sequence entries in the CRW rRNA intron database were used. To compute the clustering of intron and rRNA positions, each rRNA position was counted once, regardless of the number of intron sequences that correspond to an insertion at that position. Values for the 30 S and 50 S subunits were computed using only residues from the 16 S or 23 S RNAs, respectively. Distance from tRNA binding sites Subsets of rRNA positions were defined based on their distance from the combined A, P and E site tRNAs in the T. thermophilus 70 S ribosome (1GIY and 1GIX). Distances were computed from the atomic coordinates of the phosphorus atoms of the rRNAs and the bound tRNAs,31 or the bound tRNAs and mRNA (see website).42 † Distance cut-offs were set between 5 and 100 A˚ ; all rRNA phosphate groups are within 120 A˚ of the tRNAs. The total number of rRNA residues and the number of intron-containing sequences within the subset defined by each distance cutoff are plotted in Figure 6. These rRNA positions are listed on the website associ- ated with this work (see below). Solvent-accessibility The ASA was computed for each atom in the 50 S (1JJ2) and 30 S (1FJF) ribosomes, using the program Calc-surf.78 A sphere of radius 1.40 A˚ was chosen to simulate solvation by a water molecule. The surface areas shown in Figure 7 were computed for each entire subunit with protein side-chains, but without water molecules and heteroatoms. The ASA values for 30 oxygen atoms were binned and plotted in Figure 7. Further information available on website Copies of the manuscript Figures and Tables and additional Figures and Tables related to this work are available from the CRW Site‡. The intron sites on the † http://www.rcsb.org/pdb/ ‡ http://www.rna.icmb.utexas.edu/ANALYSIS/ INT3D/ rRNA Introns in 3D 49
  • 16. 16 S and 23 S rRNAs (as of February 2002) and related information are shown on three sets of secondary structure diagrams (PostScript and PDF formats) and images of the three-dimensional crystal structures of the ribosome. Five dimensions of information are highlighted in this presentation: (A) intron types (i.e. group I, group II, archaeal, spliceosomal, and unknown); (B) the number of introns per site (for 3D images only); (C) rRNA sites within 10.0 A˚ of tRNA/mRNA; (D) extent of conserva- tion of the 16 S and 23 S rRNAs; and (E) structural domains from the 16 S and 23 S rRNA secondary structure models are distinguished. The three-dimen- sional images can be viewed interactively using RasMol.79 Data tables include the source data for Tables 1 and 2, as well as additional data not discussed here. Acknowledgements This work was supported by grants from the NIH (GM48207 to R.R.G. and GM46866 to S.A.W.), NSF (MCB-0110252 to R.R.G.), the Welch Foun- dation (F-1427 to R.R.G.) and The Institute for Cellular and Molecular Biology at The University of Texas at Austin (to R.R.G.). References 1. Turmel, M., Mercier, J. P. & Cote, M. J. (1993). Group I introns interrupt the chloroplast psaB and psbC and the mitochondrial rrn L gene in Chlamydomonas. Nucl. Acids Res. 21, 5242–5250. 2. Gerbi, S. A., Gourse, R. L. & Clark, C. G. (1982). Conserved regions within ribosomal DNA: location and some possible functions. In The Cell Nucleus (Busch, H. & Rothblum, L., eds), vol. 10, pp. 351–386, Academic Press, New York. 3. Noller, H. F., Kop, J., Wheaton, V., Brosius, J., Gutell, R. R., Kopylov, A. M. et al. (1981). Secondary struc- ture model for 23 S ribosomal RNA. Nucl. Acids Res. 9, 6167–6189. 4. Johansen, S., Muscarella, D. E. & Vogt, V. M. (1996). Insertion elements in ribosomal DNA. In Ribosomal RNA: Structure, Evolution, Processing, and Function in Protein Biosynthesis (Zimmermann, R. A. & Dahlberg, A. E., eds), pp. 89–108, CRC Press, Boca Raton, FL. 5. Cannone, J. J., Subramanian, S., Schnare, M. N., Collett, J. R., D’Souza, L. M., Du, Y. et al. (2002). The Comparative RNA Web (CRW) site: an online data- base of comparative sequence and structure infor- mation for ribosomal, intron, and other RNAs. BMC Bioinformatics, 3, 2. 6. Michel, F. & Westhof, E. (1990). Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. J. Mol. Biol. 216, 585–610. 7. Michel, F., Umesono, K. & Ozeki, H. (1989). Com- parative and functional anatomy of group II catalytic introns–a review. Gene, 82, 5–30. 8. Lykke-Andersen, J., Aagaard, C., Semionenkov, M. & Garrett, R. A. (1997). Archaeal introns: splicing, intercellular mobility and evolution. Trends Biochem. Sci. 22, 326–331. 9. Cubero, O. F., Bridge, P. D. & Crespo, A. (2000). Terminal-sequence conservation identifies spliceosomal introns in ascomycete 18 S RNA genes. Mol. Biol. Evol. 17, 751–756. 10. Belfort, M. & Perlman, P. S. (1995). Mechanisms of intron mobility. J. Biol. Chem. 270, 30237–30240. 11. Paquin, B., Kathe, S. D., Nierzwicki-Bauer, S. A. & Shub, D. A. (1997). Origin and evolution of group I introns in cyanobacterial tRNA genes. J. Bacteriol. 179, 6798–6806. 12. Bhattacharya, D., Surek, B., Rusing, M., Damberger, S. & Melkonian, M. (1994). Group I introns are inherited through common ancestry in the nuclear- encoded rRNA of Zygnematales (Charophyceae). Proc. Natl Acad. Sci. USA, 91, 9916–9920. 13. De Jonckheere, J. F. (1994). Evidence for the ancestral origin of group I introns in the SSUrDNA of Naegleria spp. J. Eukaryot. Microbiol. 41, 457–463. 14. Muller, K. M., Cannone, J. J., Gutell, R. R. & Sheath, R. G. (2001). A structural and phylogenetic analysis of the group IC1 introns in the order Bangiales (Rhodophyta). Mol. Biol. Evol. 18, 1654–1667. 15. Biniszkiewicz, D., Cesnaviciene, E. & Shub, D. A. (1994). Self-splicing group I intron in cyanobacterial initiator methionine tRNA: evidence for lateral trans- fer of introns in bacteria. EMBO J. 13, 4629–4635. 16. Lonergan, K. M. & Gray, M. W. (1994). The ribosomal RNA gene region in Acanthamoeba castellanii mito- chondrial DNA. A case of evolutionary transfer of introns between mitochondria and plastids? J. Mol. Biol. 239, 476–499. 17. Turmel, M., Cote, V., Otis, C., Mercier, J. P., Gray, M. W., Lonergan, K. M. & Lemieux, C. (1995). Evolutionary transfer of ORF-containing group I introns between different subcellular compartments (chloroplast and mitochondrion). Mol. Biol. Evol. 12, 533–545. 18. Hibbett, D. S. (1996). Phylogenetic evidence for hori- zontal transmission of group I introns in the nuclear ribosomal DNA of mushroom-forming fungi. Mol. Biol. Evol. 13, 903–917. 19. Bhattacharya, D., Cannone, J. J. & Gutell, R. R. (2001). Group I intron lateral transfer between red and brown algal ribosomal RNA. Curr. Genet. 40, 82–90. 20. Dujon, B. (1989). Group I introns as mobile genetic elements: facts and mechanistic speculations–a review. Gene, 82, 91–114. 21. Woese, C. R. (1987). Bacterial evolution. Microbiol. Rev. 51, 221–271. 22. Benson, D. A., Karsch-Mizrachi, I., Lipman, D. J., Ostell, J., Rapp, B. A. & Wheeler, D. L. (2002). GenBank. Nucl. Acids Res. 30, 17–20. 23. Turmel, M., Gutell, R. R., Mercier, J. P., Otis, C. & Lemieux, C. (1993). Analysis of the chloroplast large subunit ribosomal RNA gene from 17 Chlamydomonas taxa. Three internal transcribed spacers and 12 group I intron insertion sites. J. Mol. Biol. 232, 446–467. 24. Woodson, S. A. & Cech, T. R. (1991). Alternative secondary structures in the 50 exon affect both for- ward and reverse self-splicing of the Tetrahymena intervening sequence RNA. Biochemistry, 30, 2042–2050. 25. Rocheleau, G. A. & Woodson, S. A. (1995). Enhanced self-splicing of Physarum polycephalum intron 3 by a second group I intron. RNA, 1, 183–193. 26. Kjems, J. & Garrett, R. A. (1991). Ribosomal RNA introns in archaea and evidence for RNA confor- mational changes associated with splicing. Proc. Natl Acad. Sci. USA, 88, 439–443. 50 rRNA Introns in 3D
  • 17. 27. Kuo, T. C. & Herrin, D. L. (2000). A kinetically efficient form of the Chlamydomonas self-splicing ribosomal RNA precursor. Biochem. Biophys. Res. Commun. 273, 967–971. 28. Ban, N., Nissen, P., Hansen, J., Moore, P. B. & Steitz, T. A. (2000). The complete atomic structure of the large ribosomal subunit at 2.4 A˚ resolution. Science, 289, 905–920. 29. Wimberly, B. T., Brodersen, D. E., Clemons, W. M., Jr, Morgan-Warren, R. J., Carter, A. P. et al. (2000). Structure of the 30 S ribosomal subunit. Nature, 407, 327–339. 30. Schluenzen, F., Tocilj, A., Zarivach, R., Harms, J., Gluehmann, M., Janell, D. et al. (2000). Structure of functionally activated small ribosomal subunit at 3.3 A˚ resolution. Cell, 102, 615–623. 31. Yusupov, M. M., Yusupova, G. Z., Baucom, A., Lieberman, K., Earnest, T. N., Cate, J. H. & Noller, H. F. (2001). Crystal structure of the ribosome at 5.5 A resolution. Science, 292, 883–896. 32. Brosius, J., Dull, T. J., Sleeter, D. D. & Noller, H. F. (1981). Gene organization and primary structure of a ribosomal RNA operon from Escherichia coli. J. Mol. Biol. 148, 107–127. 33. Suh, S. O., Jones, K. G. & Blackwell, M. (1999). A group I intron in the nuclear small subunit rRNA gene of Cryptendoxyla hypophloia, an ascomycetous fungus: evidence for a new major class of group I introns. J. Mol. Evol. 48, 493–500. 34. Bhattacharya, D., Lutzoni, F., Reeb, V., Simon, D., Nason, J. & Fernandez, F. (2000). Widespread occur- rence of spliceosomal introns in the rDNA genes of ascomycetes. Mol. Biol. Evol. 17, 1971–1984. 35. Nikoh, N. & Fukatsu, T. (2001). Evolutionary dynamics of multiple group I introns in nuclear ribo- somal RNA genes of endoparasitic fungi of the genus Cordyceps. Mol. Biol. Evol. 18, 1631–1642. 36. Everett, K. D., Kahane, S., Bush, R. M. & Friedman, M. G. (1999). An unspliced group I intron in 23 S rRNA links Chlamydiales, chloroplasts, and mito- chondria. J. Bacteriol. 181, 4734–4740. 37. Woese, C. R., Magrum, L. J., Gupta, R., Siegel, R. B., Stahl, D. A., Kop, J. et al. (1980). Secondary structure model for bacterial 16 S ribosomal RNA: phylo- genetic, enzymatic and chemical evidence. Nucl. Acids Res. 8, 2275–2293. 38. Joseph, S., Weiser, B. & Noller, H. F. (1997). Mapping the inside of the ribosome with an RNA helical ruler. Science, 278, 1093–1098. 39. Cate, J. H., Yusupov, M. M., Yusupova, G. Z., Earnest, T. N. & Noller, H. F. (1999). X-ray crystal structures of 70 S ribosome functional complexes. Science, 285, 2095–2104. 40. Skold, S. E. (1983). Chemical crosslinking of elongation factor G to the 23 S RNA in 70 S ribo- somes from Escherichia coli. Nucl. Acids Res. 11, 4923–4932. 41. Moazed, D., Robertson, J. M. & Noller, H. F. (1988). Interaction of elongation factors EF-G and EF-Tu with a conserved loop in 23 S RNA. Nature, 334, 362–364. 42. Yusupova, G. Z., Yusupov, M. M., Cate, J. H. & Noller, H. F. (2001). The path of messenger RNA through the ribosome. Cell, 106, 233–241. 43. Davies, R. W., Waring, R. B., Ray, J. A., Brown, T. A. & Scazzocchio, C. (1982). Making ends meet: a model for RNA splicing in fungal mitochondria. Nature, 300, 719–724. 44. Michel, F., Jacquier, A. & Dujon, B. (1982). Compari- son of fungal mitochondrial introns reveals extensive homologies in RNA secondary structure. Biochimie, 64, 867–881. 45. Michel, F. & Ferat, J. L. (1995). Structure and activi- ties of group II introns. Annu. Rev. Biochem. 64, 435–461. 46. Woodson, S. A. & Cech, T. R. (1989). Reverse self- splicing of the Tetrahymena group I intron: impli- cation for the directionality of splicing and for intron transposition. Cell, 57, 335–345. 47. Roman, J. & Woodson, S. A. (1998). Integration of the Tetrahymena group I intron into bacterial rRNA by reverse splicing in vivo. Proc. Natl Acad. Sci. USA, 95, 2134–2139. 48. Roman, J., Rubin, M. N. & Woodson, S. A. (1999). Sequence specificity of in vivo reverse splicing of the Tetrahymena group I intron. RNA, 5, 1–13. 49. Bhattacharya, D., Friedl, T. & Damberger, S. (1996). Nuclear-encoded rDNA group I introns: origin and phylogenetic relationships of insertion site lineages in the green algae. Mol. Biol. Evol. 13, 978–989. 50. DePriest, P. T. (1993). Small subunit rDNA variation in a population of lichen fungi due to optional group-I introns. Gene, 134, 67–74. 51. Schroeder-Diedrich, J. M., Fuerst, P. A. & Byers, T. J. (1998). Group-I introns with unusual sequences occur at three sites in nuclear 18 S rRNA genes of Acanthamoeba lenticulata. Curr. Genet. 34, 71–78. 52. Sogin, M. L., Ingold, A., Karlok, M., Nielsen, H. & Engberg, J. (1986). Phylogenetic evidence for the acquisition of ribosomal RNA introns subsequent to the divergence of some of the major Tetrahymena groups. EMBO J. 5, 3625–3630. 53. Perez-Gonzalez, C. E. & Eickbush, T. H. (2001). Dynamics of R1 and R2 elements in the rDNA locus of Drosophila simulans. Genetics, 158, 1557–1567. 54. Munoz, E., Villadas, P. J. & Toro, N. (2001). Ectopic transposition of a group II intron in natural bacterial populations. Mol. Microbiol. 41, 645–652. 55. Bryk, M., Belisle, M., Mueller, J. E. & Belfort, M. (1995). Selection of a remote cleavage site by I-tevI, the td intron-encoded endonuclease. J. Mol. Biol. 247, 197–210. 56. Muscarella, D. E. & Vogt, V. M. (1993). A mobile group I intron from Physarum polycephalum can insert itself and induce point mutations in the nuclear ribo- somal DNA of Saccharomyces cerevisiae. Mol. Cell. Biol. 13, 1023–1033. 57. Edgell, D. R., Belfort, M. & Shub, D. A. (2000). Barriers to intron promiscuity in bacteria. J. Bacteriol. 182, 5281–5289. 58. Belfort, M. & Roberts, R. J. (1997). Homing endo- nucleases: keeping the house in order. Nucl. Acids Res. 25, 3379–3388. 59. Lin, J. & Vogt, V. M. (1998). I-PpoI, the endonuclease encoded by the group I intron PpLSU3, is expressed from an RNA polymerase I transcript. Mol. Cell. Biol. 18, 5809–5817. 60. Zimmerly, S., Guo, H., Perlman, P. S. & Lambowitz, A. M. (1995). Group II intron mobility occurs by tar- get DNA-primed reverse transcription. Cell, 82, 545–554. 61. Eskes, R., Yang, J., Lambowitz, A. M. & Perlman, P. S. (1997). Mobility of yeast mitochondrial group II introns: engineering a new site specificity and retro- homing via full reverse splicing. Cell, 88, 865–874. rRNA Introns in 3D 51
  • 18. 62. Cousineau, B., Lawrence, S., Smith, D. & Belfort, M. (2000). Retrotransposition of a bacterial group II intron. Nature, 404, 1018–1021. 63. Guo, H., Karberg, M., Long, M., Jones, J. P., 3rd, Sullenger, B. & Lambowitz, A. M. (2000). Group II introns designed to insert into therapeutically rele- vant DNA target sites in human cells. Science, 289, 452–457. 64. Vader, A., Naess, J., Haugli, K., Haugli, F. & Johansen, S. (1994). Nucleolar introns from Physarum flavicomum contain insertion elements that may explain how mobile group I introns gained their open reading frames. Nucl. Acids Res. 22, 4553–4559. 65. Roman, J. & Woodson, S. A. (1998). Integration of the Tetrahymena group I intron into bacterial rRNA by reverse splicing in vivo. Proc. Natl Acad. Sci. USA, 95, 2134–2139. 66. Yang, J., Zimmerly, S., Perlman, P. S. & Lambowitz, A. M. (1996). Efficient integration of an intron RNA into double-stranded DNA by reverse splicing. Nature, 381, 332–335. 67. Margossian, S. P., Li, H., Zassenhaus, H. P. & Butow, R. A. (1996). The DExH box protein Suv3p is a component of a yeast mitochondrial 30 - to 50 -exoribo- nuclease that suppresses group I intron toxicity. Cell, 84, 199–209. 68. Wilson, K. S. & Noller, H. F. (1998). Molecular movement inside the translational engine. Cell, 92, 337–349. 69. Woodson, S. A. & Emerick, V. L. (1993). An alterna- tive helix in the 26 S rRNA promotes excision and integration of the Tetrahymena intervening sequence. Mol. Cell. Biol. 13, 1137–1145. 70. Polacek, N., Patzke, S., Nierhaus, K. H. & Barta, A. (2000). Periodic conformational changes in rRNA: monitoring the dynamics of translating ribosomes. Mol. Cell, 6, 159–171. 71. Powers, T. & Noller, H. F. (1994). Selective pertur- bation of G530 of 16 S rRNA by translational miscod- ing agents and a streptomycin-dependence mutation in protein S12. J. Mol. Biol. 235, 156–172. 72. Lodmell, J. S. & Dahlberg, A. E. (1997). A confor- mational switch in Escherichia coli 16 S ribosomal RNA during decoding of messenger RNA. Science, 277, 1262–1267. 73. Strobel, M. C. & Abelson, J. (1986). Effect of intron mutations on processing and function of Saccharomyces cerevisiae SUP53 tRNA in vitro and in vivo. Mol. Cell. Biol. 6, 2663–2673. 74. Choffat, Y., Suter, B., Behra, R. & Kubli, E. (1988). Pseudouridine modification in the tRNA(Tyr) anti- codon is dependent on the presence, but indepen- dent of the size and sequence, of the intron in eucaryotic tRNA(Tyr) genes. Mol. Cell. Biol. 8, 3332–3337. 75. Nikolcheva, T. & Woodson, S. A. (1997). Association of a group I intron with its splice junction in 50 S ribosomes: implications for intron toxicity. RNA, 3, 1016–1027. 76. Klein, D. J., Schmeing, T. M., Moore, P. B. & Steitz, T. A. (2001). The kink-turn: a new RNA secondary structure motif. EMBO J. 20, 4214–4221. 77. Carson, M. (1997). Ribbons. Methods Enzymol. 277B, 493–505. 78. Gerstein, M. (1992). A resolution-sensitive procedure for comparing protein surfaces and its application to the comparison of antigen-combining sites. Acta Crystallog. sect. A, 48, 271–276. 79. Bernstein, H. J. (2000). Recent changes to RasMol, recombining the variants. Trends Biochem. Sci. 25, 453–455. Edited by J. Doudna (Received 30 April 2002; received in revised form 15 August 2002; accepted 19 August 2002) 52 rRNA Introns in 3D