Académique Documents
Professionnel Documents
Culture Documents
639
Exon
1
2a
2b
3
4
5
6
7
8
M
H
M
H
M
H
M
H
M
H
M
H
M
H
M
H
M
H
3 Splice Junction
Exon Size
(bp)
Nucleotide Nos.
Amino Acids
Amino Acid
No.
5 Splice Junction
Intron Size
(bp)
tttctccctcttcag/AGT
tcttaccctttccag/AGC
ttactgttttcatag/CAT
ttcctattttcgtag/CAT
tcttytcttttgtag/TGG
tgatttcttttgtag/TGG
cttttctttttaaag/AAT
tttttctttttaaag/AAT
aaatattctttatag/CCA
aaatattccttatag/CCA
tttttccatttgcag/ATA
tttttctgtctccag/ATA
atgctctctttacag/GGT
ttattttccttacag/GGT
ttcttwcaaattcag/gtt
atttctcatttgcag/gaa
72
81
72
72
120
120
201
201
148
153
96
96
111
111
53
56
338
570
1,3331,404
121,152121,265
3,0243,095
107,423107,494
3,3073,426
104,826104,945
4,2444,444
103,776103,976
4,6534,799
103,464103,616
7,5207,615
101,584101,679
8,7998,909
100,161100,271
12,12612,177
94,34494,397
13,81414,154
93,32893,899
124
127
2548
2851
4988
5291
89115
92158
156204
1592209
205236
210241
237273
242278
274288
279294
24
27
24
24
40
40
67
67
49
51
32
32
37
37
15*
16*
CAG/gtaaggtcgcggctc
CAG/gtgaggtcgcagcca
AAG/gtatgaaatggttaa
AAG/gtatgaaatgcttgn
CAG/gtgatagattgattt
CAG/gttattttaaaatgt
GAG/gtaggccccgaaaga
GAG/gtaaggatacaaaaa
AAG/gtaaatgttctgttg
AAG/gtaaaccttctatga
CCA/gtaagtacacaagag
CCA/gtaagtaaaaaagag
ATG/gtaaggactccctgg
ATG/gtaagtaatcactca
gaa/gtaagtctgtcattt
gga/gtaagtctgccagca
1,619
13,841
211
2,667
817
1,168
208
511
2,720
2,031
1,183
1,517
3,216
5,926
1,636
1,068
Note: Nucleotide numbering for the human sequence corresponds to Chen et al. (1998). The asterisks indicate a translational stop codon.
640
Fig. 3. Amino acid sequence homology of SMNrelated proteins. SMN protein sequences from human
(U18423), mouse (U77714, U63294), canine
(U50746), rat (U75369), zebrafish (AF083557), C. elegans-related protein (Z81048), S. pombe-related protein (Z54354), and a partial bovine SMN (AF016590, AF026810) were
aligned with CLUSTAL. The exon boundaries are shown above the sequence, the SIP-1 (aa13-44), Sm (aa240-267), and SMN oligomerization
(aa249-278) domains are indicated by bold, double, and triple underlines,
respectively. Dashes represent breaks in the actual amino acid sequence of
the proteins to maximize sequence alignment. The murine Smn protein is
97%, 85%, 83%, and 52% identical to rat, canine, human, and zebrafish
orthologs, respectively.
of unique sequence, owing to the density and distribution of repetitive elements. In the future, as more of the mouse genome is
sequenced, it would be interesting to analyze how the density and
distribution of repetitive elements within gene targeting constructs
affect HR frequency. For the present, our findings are valuable for
those studying the in vivo consequences of SMA missense mutations, since most of these are located in exon 6. Furthermore, given
the repetitive nature of this region, the sequence information provided by these studies will facilitate PCR-based screening strategies to identify HR events.
In conclusion, the work presented illustrates how the nucleotide sequence of a gene can be used in a variety of ways from
determining gene structure to creating and identifying subtle mutations in vivo. Moreover, this sequence can now be used for
interspecies sequence comparisons to identify conserved domains
within SMN that regulate gene expression. If such domains do
exist, they may provide a target for the development of novel
therapies for the treatment of SMA.
Acknowledgments. Special thanks to Ken Morgan and Mary Fujiwara for
helpful discussions, and Sebastien Meilleur for technical assistance. This
work was supported by grants from the Medical Research Council (MRC)
of Canada and Families of SMA. C.J. DiDonato is an MRC postdoctoral
fellow, and L.R. Simard is a Fonds de la Recherche en Sante du Quebec
Scholar.
References
Altschul S, Warren G, Miller W, Myers E, Lipman D (1990) Basic local
alignment search tool. J Mol Biol, 215, 403410
Battaglia G, Princivalle A, Forti F, Lizier C, Zeviani M (1997) Expression
of the SMN gene, the spinal muscular atrophy determining gene, in the
mammalian central nervous system. Hum Mol Genet 6, 19611971
Boshart M, Kluppel M, Schmidt A, Schutz G, Luckow B (1992) Reporter
constructs with low background activity utilizing the cat gene. Gene 110,
129130
Brahe C, Clermont O, Zappata S, Tiziano F, Melki J et al. (1996) Frameshift mutation in the survival motor neuron gene in a severe case of SMA
type I. Hum Mol Genet 5, 19711976
Breathnach R, Benoist C, OHare K, Gannon F, Chambon P (1978) Ovalbumin gene: evidence for a leader sequence in mRNA and DNA sequences at the exon-intron boundaries. Proc Natl Acad Sci USA 75,
49534957
Burge C, Karlin S (1997) Prediction of complete gene structures in human
genomic DNA. J Mol Biol 268, 7894
Burglen L, Lefebvre S, Clermont O, Burlet P, Viollet L et al. (1996)
Structure and organization of the human survival motor neurone (SMN)
gene. Genomics 32, 479482
Bussaglia E, Clermont O, Tizzano E, Lefebvre S, Burglen L et al. (1995)
A frame-shift deletion in the survival motor neuron gene in Spanish
spinal muscular atrophy patients. Nat Genet 5, 335337
Chen Q, Baird SD, Mahadevan M, Besner-Johnston A, Farahani R et al.
(1998) Sequence of a 131-kb region of 5q13.1 containing the spinal
muscular atrophy candidate genes SMN and NAIP. Genomics 48, 121
127
Coovert DD, Le TT, McAndrew PE, Strasswimmer J, Crawford TO et al.
(1997) The survival motor neuron protein in spinal muscular atrophy.
Hum Mol Genet 6, 12051214
DiDonato CJ, Chen XN, Noya D, Korenberg JR, Nadeau JH et al. (1997)
Cloning, characterization, and copy number of the murine survival motor neuron gene: homolog of the spinal muscular atrophy-determining
gene. Genome Res 7, 339352
641
Fischer U, Liu Q, Dreyfuss G (1997) The SMN-SIP1 complex has an
essential role in spliceosomal snRNP biogenesis. Cell 90, 10231029
Francis JW, Sandrock AW, Bhide PG, Vonsattel JP, Brown RH Jr (1998)
Heterogeneity of subcellular localization and electrophoretic mobility of
survival motor neuron (SMN) protein in mammalian neural cells and
tissues. Proc Natl Acad Sci USA 95, 64926497
Gennarelli M, Lucarelli M, Capon F, Pizzuti A, Merlini L et al. (1995)
Survival motor neuron gene transcript analysis in muscles from spinal
muscular atrophy patients. Biochem Biophys Res Commun 213, 342
348
Hahnen ET, Wirth B (1996) Frequent DNA variant in exon 2a of the
survival motor neuron gene (SMN): a further possibility for distinguishing the two copies of the gene. Hum Genet 98, 122123
Lefebvre S, Burglen L, Reboullet S, Clermont O, Burlet P et al. (1995)
Identification and characterization of a spinal muscular atrophydetermining gene. Cell 80, 155165
Lefebvre S, Burlet P, Liu Q, Bertrandy S, Clermont O et al. (1997) Correlation between severity and SMN protein level in spinal muscular
atrophy. Nat Genet 16, 265269
Lester LB, Coghlan VM, Nauert B, Scott JD (1996) Cloning and characterization of a novel A-kinase anchoring protein. AKAP 220, association
with testicular peroxisomes. J Biol Chem 271, 94609465
Liu Q, Dreyfuss G (1996) A novel nuclear structure containing the survival
of motor neurons protein. EMBO J 15, 35553565
Liu Q, Fischer U, Wang F, Dreyfuss G (1997) The spinal muscular atrophy
disease gene product, SMN, and its associated protein SIP1 are in a
complex of spliceosomal snRNP proteins. Cell 90, 10131021
Lorson CL, Strasswimmer J, Yao JM, Baleja JD, Hahnen E et al. (1998)
SMN oligomerization defect correlates with spinal muscular atrophy
severity. Nat Genet 19, 6366
Munsat TM, Davies KE (1992) Meeting report: International SMA consortium Meeting. Neuromusc Disorders 2, 423428
Pugh BF, Tijan R (1991) Transcription from a TATA-less promoter requires a multisubunit TFIID complex. Genes Dev 5, 19351945
Rochette CF, Surh LC, Ray PN, McAndrew PE, Prior TW et al. (1991)
Molecular diagnosis of non-deletion SMA patients using quantitative
PCR of SMN exon 7. Neurogenetics 1, 101107
Schrank B, Gotz R, Gunnersen JM, Ure JM, Toyka KV et al. (1997)
Inactivation of the survival motor neuron gene, a candidate gene for
human spinal muscular atrophy, leads to massive cell death in early
mouse embryos. Proc Natl Acad Sci USA 94, 99209925
Schug J, Overton G (1997) TESS: Transcription Element Search Software
on the WWW Technical Report CBIL-TR-1997-1001-v0.0, of the
Computational Biology and Informatics Laboratory, School of Medicine, University of Pennsylvania
Solovyev VV, Salamov AA, Lawrence CB (1994a) The prediction of human exons by oligonucleotide composition and discriminant analysis of
spliceable open reading frames. In The Second International Conference
on Intelligent Systems for Molecular Biology, Altman R, Brutlag D,
Karp R, Latrop R, Searls D (eds.) (Menlo Park, Calif.: AAAI Press) p
354362
Solovyev VV, Salamov AA, Lawrence CB (1994b) Predicting internal
exons by oligonucleotide composition and discriminant analysis of
spliceable open reading frames, Nucleic Acids Res 22, 51565163
Solovyev VV, Salamov AA, Lawrence CB (1995) Identification of human
gene structure using linear discriminant functions and dynamic programming. In Proceedings of the Third International Conference on Intelligent Systems for Molecular Biology. Rawling C, Clark D, Altman R,
Hunter L, Lengauer T, Wodak S (eds.) (Cambridge, England: AAAI
Press) p 367375
Viollet L, Bertrandy S, Bueno Brunialti AL, Lefebvre S, Burlet P et al.
(1997) cDNA isolation, expression, and chromosomal localization of the
mouse survival motor neuron gene (Smn). Genomics 40, 185188