Académique Documents
Professionnel Documents
Culture Documents
Francois Dragon, Universite du Quebec a` Montreal, Quebec, Canada . Primary and Secondary Structures and Conserved
Elements of Small Nucleolar RNAs
Vincent Lemay, Universite du Quebec a` Montreal, Quebec, Canada . Role of snoRNAs in Pre-rRNA Processing and
Modification
Christian Trahan, Universite du Quebec a` Montreal, Quebec, Canada . snoRNA-associated Proteins
. Expression of snoRNAs
The small nucleolar (sno) ribonucleic acids (RNAs) are mainly involved in the modification . snoRNA Maturation Pathways
and processing of ribosomal RNA precursors, however, some are used to modify small . Small Cajal Body-specific RNAs
nuclear RNAs (snRNAs), messenger RNAs (mRNAs) and transfer RNAs (tRNAs). They . Possible Implications of snoRNAs-modified Targets
function in the form of ribonucleoproteins (snoRNPs). . Origin and Evolution of snoRNAs
ENCYCLOPEDIA OF LIFE SCIENCES & 2006, John Wiley & Sons, Ltd. www.els.net 1
snoRNAs: Biogenesis, Structure and Function
(a)
U
Box D UG
A
G
5 C
A
U Box C
G
A
m m
rRNA 3 N N N N NN 5
3
3
snoRNA 5 N N N N NN CUGA 3
5-16
A m Box
G
U D or D
Box C A
G
U
C
U 5
U
R G
N Box D
A
5 3
(b)
snoRNA
5
3 rRNA 5
3
Figure 1 Structure of guide snoRNAs and mode of action. The schematic secondary structure of snoRNAs are presented on the left; the blue line
represents the target RNA that gets modified. The conserved sequences are boxed and indicated in colour. Models for the selection of specific position to be
modified are shown on the right. In (a), box C/D snoRNAs are characterized by conserved motifs C and D (green) that form a kink-turn (K-turn) or alternate
boxes C and D (orange) that could also form a K-turn; 2-O-ribose methylation is performed on the rRNA residue that is base-paired to the fifth position
upstream from box D (or D). In (b), H/ACA snoRNAs adopt a hairpin-hinge-hairpin-tail structure where box H (red) is found in the hinge region and box
ACA (red) is found three nucleotides upstream of the 3- end; each hairpin usually contains an internal loop called pseudouridylation pocket where C
formation in rRNA occurs on the first unpaired U residue upstream from box H or ACA (usually at a distance of 14 to 16 nucleotides from the box).
(RPR1) can aect pre-rRNA processing, suggesting that events (Figure 2). The great majority of snoRNAs are
RNase P could have a function in the nucleolus. Because of involved in modication rather than processing of pre-
their striking similarity, it has been proposed that RNase P rRNAs: box C/D snoRNAs direct the 2-O-ribose methyl-
and RNase MRP RNAs evolved from a common ancestor. ation of specic residues, whereas box H/ACA snoRNAs
See also: Cellular RNAs: varied roles; Ribonucleases; RNA target specic uridines that are converted into pseudouri-
structure; Transfer RNA synthesis and regulation dines (C), an isomerization process called pseudouridylat-
ion. Only a fraction of snoRNAs are required for the
cleavage reactions. The box C/D U3, U13, U14 and U22
snoRNAs are involved in 18S rRNA production, and U8
Role of snoRNAs in Pre-rRNA Processing participates in processing of the 5.8S and 28S rRNAs.
and Modification Processing reactions involving U3, U8, U13 and U14 like
those participating in modication require specic base
The small- and large-subunit rRNAs are transcribed as pairing interactions between the pre-rRNA and the
part of a large precursor that is subsequently cleaved and snoRNPs. The box H/ACA snoRNAs snR10, snR30/
trimmed to release mature rRNA products, the 18S rRNA, U17 and E3 participate in 18S rRNA synthesis but their
which is part of the small subunit (SSU), and the 5.8S and role is not well dened. It has been shown in yeast that
2528S rRNAs, which are found in the large subunit RNase MRP is involved in the cleavage reaction upstream
(LSU) of the ribosome. In contrast to pre-mRNAs, which of 5.8S rRNA, and RNase P could be involved in cleaving
begin to be processed (spliced) before completion of syn- downstream of the 5.8S rRNA (see Figure 2). Processing
thesis, pre-rRNAs are generally fully transcribed before complexes form early after transcription initiation; they
cleavages occur. Post-transcriptional modications are can be visualized under the electron microscope and ap-
made on the precursors, not the mature products, and pear as granules (or terminal knobs) at the edge of the
some modications could be prerequisites to processing so-called Christmas trees in chromatin spreads of rRNA
2
snoRNAs: Biogenesis, Structure and Function
Guide snoRNPs
( 2-O-ribose methylation,
pseudouridylation)
Me Me Me
Processing snoRNPs
(processome)
E3 U22 U8
Me Me Me
Me Me Me
18 S 5.8S 2528 S
Figure 2 Pre-rRNA modification and processing. The large pre-rRNAs encode the small ribosomal subunit rRNA (18S) and the large ribosomal subunit
rRNAs (5.8S and 2528S). The coding sequences are separated by spacer sequences named 5ETS (5 external transcribed spacer), ITS1 (the internal
transcribed spacer 1), ITS2 (internal transcribed spacer 2), and 3ETS (3 external transcribed spacer). The guide snoRNPs are involved in modification
reactions; C/D snoRNPs direct 2-O-ribose methylation (Me), and H/ACA snoRNPs direct pseudouridylation (C). The processing snoRNPs (represented by
spheres) are involved in cleavage reactions. Cleavages are likely orchestrated by a large complex containing many snoRNPs, which is referred to as the
processome (by analogy with the spliceosome). The transcribed spacers are removed by a series of endo- and exonucleolytic reactions, and mature rRNAs
are liberated and packaged into ribosomal subunits.
transcription units. Terminal knob formation and subse- The box C/D snoRNAs generally contain long se-
quent processing requires the U3 snoRNA and all of its quences (1021 nucleotides) that are complementary to
associated proteins; this very large U3 complex ( 80S) rRNA. These sequences are located upstream from box D
contains over 30 dierent proteins and it has been coined (or D) and specify the site of 2-O-methylation in rRNA:
the SSU processome. Because of its large size and function the position complementary to the fth nucleotide from
in early processing events, it is very likely that the SSU box D gets methylated (Figure 1a). The box H/ACA sno-
processome corresponds to the terminal knob seen at the 5 RNAs select the target rRNA sequences by forming two
end of nascent pre-rRNA transcripts. See also: mRNA short helical structures (310 base pairs) that are separated
splicing: regulated and dierential; Nucleotides: uncom- by two unpaired rRNA nucleotides. The rst unpaired
mon, modied and synthetic; rRNA structure base in the rRNA sequence (in a 5 to 3 orientation) is a U
3
snoRNAs: Biogenesis, Structure and Function
residue that gets isomerized into C. The C is always po- bind specically to RNA in vitro, but gains specicity
sitioned 1416 nucleotides from box H or ACA (Figure 1b). when associated with the other H/ACA proteins. All four
H/ACA proteins are essential for cell viability and pre-
rRNA maturation. Except for GAR1, these proteins
snoRNA-associated Proteins are also necessary for H/ACA snoRNA accumulation
in vivo.
The snoRNAs function in the form of ribonucleoproteins It has been known for sometime that human RNase
(RNPs). Box C/D snoRNAs share four common proteins: MRP and RNase P share the Th/To antigen, a protein of
Fibrillarin (named Nop1p in yeast), NOP5/NOP58, 40 kDa that is recognized by autoimmune sera of pa-
NOP56 and 15.5 K (Snu13p in yeast). Fibrillarin is an tients suering from scleroderma. However, most of the
abundant nucleolar protein, and an autoimmune antigen known protein constituents of these RNases have been
in humans. It is also well conserved among eukaryotes, and discovered in the yeast S. cerevisiae. The rst characterized
even has a homologue in archaebacteria. Nop1p is essential protein subunit of yeast RNase MRP was Snm1p: this
for growth in yeast, and mutations in Nop1p can prevent protein is MRP-specic, it is essential for growth, and it
pre-rRNA processing and 2-O-methylation. A methylt- contains a zinc-cluster domain, a leucine zipper motif and a
ransferase-like domain has been identied in Nop1 from serine/lysine-rich tail. Yeast RNase MRP shares many
the archaebacterium Methanococcus jannaschii, and it is common proteins with RNase P, namely Pop1p, Pop3p,
assumed that this protein is responsible for the methylase Pop4p, Pop5p, Pop6p, Pop7p (Rpp20 in humans), Pop8p
activity of C/D snoRNPs. The yeast Nop5/Nop58p and and Rpp1p (Rpp30 in humans); each one of them is
Nop56p proteins are essential for growth, and depletion of essential for viability. Rpr2p is RNase P-specic and es-
Nop5/Nop58p leads to decreased levels of C/D snoRNAs sential for its activity. Human proteins Rpp30 and Rpp38
and defects in pre-rRNA processing. Snu13p (15.5 K) has cross-react with autoimmune patient sera, and it has been
been shown to bind the K-turn motif formed by boxes C proposed that Rpp38 ( 38 kDa) could be the Th/To
and D, and this protein plays a nucleating role in RNP antigen. See also: Autoimmune disease; Autoimmune
assembly. disease: genetics
Box H/ACA snoRNAs, including the H/ACA-like do-
main of vertebrate telomerase RNA (see above), also
share four common proteins: dyskerin (named Cbf5p in Expression of snoRNAs
yeast), GAR1, NHP2 and NOP10. These four proteins
can form a complex before binding H/ACA snoRNAs. Eukaryotic cells have developed dierent strategies to ex-
Cbf5p has strong homology to the Escherichia coli pro- press snoRNAs. Indeed, snoRNAs are not only transcribed
tein truB, a pseudouridine synthase that converts U55 from independent genes, but they can also be encoded
into C55 in most tRNAs. Thus, Cbf5p could be the within introns of pre-messenger RNAs (pre-mRNAs), as
enzyme responsible for the U to C isomerization in gene clusters or intronic gene clusters (Figure 3). See also:
pre-rRNAs. Apart from the C synthase signature, Messenger RNA in eukaryotes
dyskerin and Cbf5p contain a nuclear localization signal
(NLS) and an acidic/lysine-rich domain (KKE/D mo- snoRNAs encoded by independent genes
tifs), which is also found in other nucleolar proteins. In-
terestingly, mutations in dyskerin cause a genetic disease Most snoRNAs are encoded by independent genes in yeast
called dyskeratosis congenita, which is characterized by (S. cerevisiae). However, there are seven intronic genes and
a progressive bone marrow failure, abnormal skin pig- ve polycistronic snoRNA gene clusters in this organism.
mentation, nail dystrophy and mucosal leucoplakias; Most of the snoRNAs produced from independent genes
note that mutations in the RNA component of human are transcribed by RNA polymerase II and thus contain a
telomerase have been linked to a dominant, autosomal 7-methylguanosine (m7G) cap at their 5 end. The m7G
form of the disease. GAR1 is composed of a highly con- gets further methylated to form a 2,2,7-methyl guanosine
served core domain, which contains all the elements (m3G) cap, also called TMG (trimethylguanosine) cap.
required for its subnuclear localization and function. This particular cap structure is also typical of spliceosomal
The core region is anked by glycine- and arginine-rich snRNAs and telomerase RNA. There are some exceptions
(GAR) domains; these domains (as well as the GAR however, like the 7-2/MRP RNA and plant U3 snoRNA,
domain of brillarin) interact with survival of motor which are produced by RNA polymerase III and therefore
neuron (SMN), a protein that plays a critical role in have no conventional cap structure. In humans and yeast,
the biogenesis of various nuclear RNPs. NOP10 is a U3 is transcribed by RNA polymerase II. Note that yeast
small protein of about 60 amino acids with no known MRP RNA may be transcribed by RNA polymerase II.
motif. NHP2 is very similar to 15.5 K (Snu13p), and See also: Caps on eukaryotic mRNAs; mRNA splicing:
these proteins could have evolved from a common an- role of snRNAs; RNA polymerase II holoenzyme and
cestor (see below). In contrast to 15.5 K, NHP2 does not transcription factors; Spliceosomal machinery
4
snoRNAs: Biogenesis, Structure and Function
Yeast
Independent P P Metazoa
Plants
Animals
Intronic P Exon Exon Exon Exon Exon Yeast
Plants
Yeast
Gene cluster P Plants
Trypanosomes
Intronic
gene cluster P Exon Exon Plants
Figure 3 Expression of snoRNAs. Various types of snoRNA production are illustrated: synthesis from independent genes, pre-mRNA introns, gene clusters
and intronic gene clusters. The promoter regions (P) are shown as white boxes, exons (E) as grey boxes, snoRNA coding sequences as coloured boxes and
snoRNAs as coloured arrows.
snoRNAs encoded within pre-mRNA introns splicing: regulated and dierential; Protein-coding seg-
ments: evolution of exonintron gene structure
In contrast to yeast snoRNAs, the great majority of verte-
brate snoRNAs are encoded within introns of pre-mRNA. Polycistronic snoRNAs
Of all known human snoRNAs, only four (U3, U8, U13 and
7-2/MRP) are encoded by independent genes. A pre-mRNA In plants, most snoRNAs are produced from polycistrons,
can harbour many snoRNAs (generally one snoRNA per thus more than one snoRNA is found in a transcription
intron but this can vary; see below), which allows the pro- unit. In yeast for example, snR190 and U14 are cotran-
duction of a large number of snoRNAs from a relatively scribed; endonucleolytic cleavage of the polycistronic
small number of genes. The genes containing intronic sno- RNA by Rnt1p (the orthologue of bacterial RNase III)
RNAs usually encode proteins involved in ribosome syn- generates pre-snR190 and pre-U14 RNAs, which get fur-
thesis or function, i.e. nucleolar proteins, ribosomal proteins ther trimmed at their 5 and 3 ends to produce mature
or translation factors. However, some snoRNA host genes snoRNAs. The most striking examples of polycistronic
do not encode proteins; their spliced poly(A)+ RNA has no snoRNAs are found in trypanosomes: up to nine dierent
protein-coding potential, thus the functional product of snoRNAs have been found in a single polycistron (which is
these genes is contained in the introns only. Interestingly, the repeated multiple times in tandem), and since no exons are
primary transcripts of genes hosting snoRNAs usually start present in the inter-snoRNA regions, it is quite certain that
with a C residue followed by a stretch of pyrimidines, which endonucleolytic cleavages, not splicing, are responsible for
is characteristic of the 5-terminal oligopyrimidine (5 TOP) the production of individual snoRNAs. Exceptionally in
mRNAs coding for ribosomal and some house-keeping plants, some snoRNAs are found as intronic gene clusters.
proteins. It is known that the 5 TOP mRNAs are transi- In contrast to vertebrate and yeast, there is more than one
tionally regulated, but since some snoRNA hosts do not snoRNA per intron. See also: Gene expression in plants;
code for proteins, it is possible that the 5 oligopyrimidine Gene expression in yeast
tracts at the transcription start site also function as tran-
scriptional signals in order to regulate the production of
snoRNAs with that of proteins of the translational appa- snoRNA Maturation Pathways
ratus. The observation that dierent organisms can encode a
particular snoRNA from a dierent gene strongly suggests Rapid and ecient production of large quantities of sno-
that intronic snoRNAs are mobile. See also: mRNA RNAs may be crucial for cell survival. The snoRNAs
5
snoRNAs: Biogenesis, Structure and Function
expressed from independent gene units have strong pro- are highly conserved. These motifs have been found in
moters, and the intronic snoRNAs are usually found in humans, Drosophila and Arabidopsis, although the impor-
highly expressed genes. Upon splicing, the excised intron is tance of the CAB boxes has not been conrmed experi-
released in the form of a lariat. After debranching of the mentally in the last two organisms.
intron lariat, the intronic RNA is degraded from both 5 It has been observed that box C/D snoRNAs assemble
and 3 ends by exonucleases acting in the 5!3 and in the or mature in CBs before transiting to the nucleolus. Some
3!5 directions. The exonucleolytic trimming of introns data tend to show that it could be the same for H/ACA
stops at the borders of the snoRNA, most likely due to the snoRNAs. For example, over expressed U64 snoRNA in
snoRNA sequence being packaged into an snoRNP. transiently transfected HeLa cells appears rst in CBs and
Processing of plant intronic snoRNA gene clusters re- later in nucleoli, and human telomerase RNA, which con-
quires endonucleolitic activity, with or without splicing of tains an H/ACA-like domain, transits to CBs before ac-
the intron. Specic signals are required for faithful process- cumulating in the nucleoli of transfected Xenopus oocytes.
ing of intronic snoRNAs, namely boxes C and D, and the For H/ACA scaRNAs, which reside and meet their targets
terminal helix for C/D snoRNAs, and boxes H and ACA, in the CBs, the CAB box could serve as a retention signal.
and the adjacent stems for H/ACA snoRNAs. These sig- No CAB box has been identied in scaRNAs of the C/D
nals may serve as binding sites for snoRNP proteins. The class, however, they most probably contain a specic el-
general mechanism for the production of intronic snoRNA ement that allow their retention in CBs, whereas genuine
has some exceptions. There are examples where snoRNAs C/D snoRNAs are able to accumulate in the nucleolus.
located in poorly spliced pre-mRNAs are excised from the
intron by endonucleases, and the remaining extra nucleo-
tides anking the snoRNA are then removed by exonuc-
leases. Since the nal steps of maturation are the result of
Possible Implications of snoRNAs-
exonucleolytic trimming, intronic and polycistronic sno- modified Targets
RNAs have a monophosphate at their 5 end and a free
hydroxyl group at their 3 end. See also: Genes: denition As seen above, the majority of snoRNAs act as guides for
and structure the site-specic modication of rRNAs. The function of
modied nucleosides in rRNAs is not known; however, the
modied residues are clustered in evolutionarily conserved
Small Cajal Body-specific RNAs and functional regions of rRNAs, and it has been hypoth-
esized that they could be required for folding or some other
Cajal bodies (CBs) are spherical nuclear organelles that aspects of ribosome function. Likewise, many modied
are often seen juxtaposed or near the nucleolus. CBs ap- nucleosides are found in functionally important regions of
pear to be sites of biogenesis or recycling of various spliceosomal snRNAs, supporting the idea that modica-
RNPs. These organelles harbour a specic subset of tions confer important folding or other functional advan-
small RNAs called scaRNAs (small Cajal body-specic tages. Addition of methyl groups generates more
RNAs) that are structurally and functionally indistin- hydrophobic surfaces and usage of C residues increases
guishable from snoRNAs. Some of them contain one the hydrogen bonding potential (three H bonds instead of
C/D or one H/ACA domain, like U90 and U91 or U92, two). These properties might contribute to rRNA folding
respectively. There is also one scaRNA harbouring two or interaction with proteins. In the same order of idea,
H/ACA domains in tandem (U93). However, most processing snoRNAs could serve as chaperons necessary
scaRNAs are hybrids since they are composed of one for proper rRNA folding which might be essential for
H/ACA and C/D core domain, such as U85, U87, U88 cleavage reactions. One can also imagine that modica-
and U89. The scaRNAs are believed to contain all of the tions such as 2-O-methylation are made to protect crucial
RNP proteins known to be associated to C/D and H/ regions of rRNA from endonucleolytic degradation. Al-
ACA snoRNAs, although this assumption has only been ternatively, the use of snoRNAs that base pair throughout
veried for brillarin and GAR1 proteins. the molecule could ensure proofreading and proper folding
Most scaRNAs are implicated in post-transcriptional of critical rRNA sequences, and modications would only
modications of polII-transcribed spliceosomal snRNAs constitute check marks. Eukaryotic rRNAs are bigger and
(U1, U2, U4 and U5). These modication events take place contain many more modications then prokaryotic
in the CBs, with the exception of the polIII-transcribed U6 rRNAs; larger (eukaryotic) rRNAs may require more
snRNA, which is most probably modied in the nucleolus quality control checkpoints. It is worth noting that deletion
by snoRNAs. The H/ACA scaRNAs possess two CB lo- of individual or even multiple modication guide sno-
calization signals (CAB boxes) that are located in the ter- RNAs is not lethal in yeast, suggesting that modications
minal loop of the 5 and 3 hairpins. The consensus are not critical for function; however it is not known what
sequence of the CAB boxes is ugAG, where the rst ug would happen if tens of snoRNAs were missing. See also:
nucleotides are variable but the last AG nucleotides RNA editing; RNA synthesis
6
snoRNAs: Biogenesis, Structure and Function