Académique Documents
Professionnel Documents
Culture Documents
point was that all deletions ended up in one group (be it the plus group or the minus
group), and all additions ended up in the other group.
Having so classified the mutations, crick and coworkers next performed the critical
experiment. They isolated recombinants carrying various combinations of plus and
minus mutations. When two plus mutstions were present in a recombinant, its
phenotype was always mutant. The same was true in the case of recombinants with
two minus mutations. However, recombinans contraining three plus mutations
(b) or three minus mutations frequently had wild mean type phenotypes. Thus,
the wild-type reading frame for the distal portion of the gene was restored by three
single base-pair additions or three single base-pair deletions, but was altered by
either one or two base-pair additions or deleions. These results are most easily
explained if each codon containts three bases.
Mutants. Recombinants carrying three (+)mutations (Fig. 10.30b) or three (-)
mutations, howover, often had wild-type phenotypes. This indicated that the
additions of three base-pairs or the deletion of three base-pairs left the distal
portion of the gene with the correct (wild-type) reading frame, a result that would
be expected only if each codon contained three nucleotides.
Confirmation that the coding ratio (nucleotidas to amino acids) is indeed three has
come from many sources. Considerable evidence favoring a triplet code evolved
from studies using in vitro translation systems. The following observations were of
major importance. (1) trinucleotides were found sufficient to stimulate specific
binding of aminoacly-tRNAs to ribosomes. For example, 5-UUC-3 stimulates
ribosomal binding of phenylalanyl-tRNA phe. (2) chemically synthesized RNA
molecules, containing repeating dinucleotide sequences, directed the synthesis of
copolymers withalternating amino acid sequences. Polu (UG) n, for example, when
used as an artificial mRNA in an in vitro system, directed the synthesis of copolymer
(cys-val)n. (3) molecules with repeating trinucleotide sequences, on the other hand,
directed the synthesis of mixture ojfthree homopolymers (initiation being random on
such an mRNA in an in vitro system ). Poly (UGG)n, for example, directedthe
synthesis of a mixture of polyserine, polyarginine, and polyvaline. Againd, these
results are only consistent white a triplet code. Ultimately, the triplet nature of code
was difinitivelyestablished by the results of correlated nucliec acid and protein
sequence (e.g., see Fig. 10.31 and chapter 12,Fig 12.26).
Deciphering the Code
The decipering of the genetic code-that is, determining (1) which codons specify
which amino acids, (2) how many of the 64 posible codons are used, (3) how the
code is punctuated, and (4) whether different species use the same or bdifferent
codons-took place during the early 1960s and was one of the most exciting periodes
in the history of science. The craking of the genetic code had an effect on the life
sciences like the splitting of the atom did on the phlysical sciences. It opened up a
vast new field of study of gene expression.
The first major breakthrough came in 1961 when M.W. Nirenberg (1968 Nobel Prize
recipient) and J.H. Matthaei and then S. Ochoa (1995 Nobel Prize recipient) and
cowokers demonstrated that synthetic RNA molecules could be used as artificial
mRNAs to direct in vitro protein synthesis. That is, when ribosomes, aminoaclytRNAs, and the soluble proten factors required for translation are purified free of
natural mRNAs, these components can be cambined in vitro and stimulated to
synthesize polypeptides by the addition of chemically synthesized RNA molecules. If
these synthetic mRNA molecules are of known composition, the composition of the
polypeptides synthesized can be used to deduce which codons specify which amino
acids.
The first codon assignment (UUU for phenylalanine) was made when Nirenberg and
Matthaei demonstated that polyuridylic acid [polu U + (U)n ]directed the synthesis
of polyphenylalnine [(phenylalnine)n]. Ochoa and others continued this approach
using synthetic RNAs with random sequences of know nucleotide composition, such
as 50 percent U and 50 percent G. The friquencies of the different triplets in such a
rondom copolymer can be easily calculated. For example, the 50 percent U/50
percent G copolymer will caontain 12.5 percent (1/2 x 1/2x1/2=1/8) of each of the
eightpossible codont: UUU, UUG,UGU,GUU,,UGG,GUG,GGU and GGG. These can
then be compared with the amino acidsancorporated (phenylalnine, leucine,
cysteine, valine, tryptophan, and glycine) when this rondom copolymer is use in an
in vinto protein-synthesizing system. By verying the composition, for example, to
75 percent U and 25 percent G, one can very the relative with the relative
frequences of the eigth codons and correlate them with the relative frequencies of
the amino acids in the polypeptides synthesized. Such experiments provided agreat
deal of information about tha nature of the code.
More definitive data were later obtained by H. G. Khorana using in vitron systems
that were activated by synthetic mRNAs of know nucleotide sequences. Khoranas
experiments permitted direct camparisons between nucleotide sequences and the
amino acids incorporated in response to these sequences. The ultimate crecking
of the code accured when trinusleotides were found to function as mine-mRNAs in
directing the specific binding of aminoacyl-tRNAs to ribosomes. By using all the 64
possible trinucleotide sequences in such amino-tRNA binding experiments, it was
possible to veryfi the codon assignment made from data of earlier experiments.
On the basis of extensive data accumulated aver several years, the codon
assignments shown in Table 10.1 became firmaly established. Two important
question remained to be answered. (1) are the assigments based on in vitro
experiments valid in vivo? (2) is the code universal, that is, do the codons specify
the same amino acids in all organisms? Several lines of evidence now indicate that
these codon assigments are correct for protein synthesis in vivo for most, if not all,
species. When the amino acid substitutions that result from mutations induced with
chemical mutagens with specific mutagenic affects (see Chapter 11) are determined
by amino acid sequencing, the substitutions are almost always consistent with the
codon assigments given in table 10.1 and the known affect of the mutagen. More
convingcingly, when the nucleotide sequences of genesor of mRNAs are determined
and compored with the amino acid sequences of the polypeptides encoded by those
genes or mRNAs, the observed correlations are always found to be those predicted
from the accepted codon assigments (Table 10.1). this can be illustrated by
comparing the nucleotide sequence of the gene coding for the protein coat or
capsid of bacteriophage MS2 with the amino acid sequence of the capsid
polypeptide (Fig. 10.31). phage MS2 stores its genetic information in RNA (like TMV
virus; see Chapter 5, pp. 96-97). Its chromosome is equivalent to an mRNA molecule
in organisms with DNA genomes. (Also see Chapter 12, Fig. 12.26.)
Figure 10.31
Correlated nucleotide sequence of the coat protein gene of the RNA bacteriophage
MS2 and the amino acid sequence of the poly peptide (coat protein) that is
specifies. The initial sequences of the MS2 replicase (RNA polymerase) gene and the
correlated six amino-terminal amino acids are also shown. An untranslated
interhanec squences separates the genes. Translation proceeds from left to rigth as
drawn, and from the coat gene to the replicase gene. Bolh polypeptides are
initiated by f-methionene at the AUG codons andicated. The methionine is cleaved
off following (or during) translation, yielding the alanine terminus on the coat
protein and the serine terminus on the replicase. The coat pretein gene has two
tandem periodes (two termination codons) at the end, as though to make
adsolutely certain that translation terminates at this point. In addition, a third
termination codon is located,in proper reading frame, seven base triplets from the
second tandem termination. Each of the three termination codons is present once
between the translated sequence of the coat gene and the translated sequenceof
replicase gene. Note that the amino acid sequence of this protein, synthesized in
vivo, is precisely that predicted from the nucleotide squence using the codon
assigment presented in Table 10.1. (Data from W. Min Jou, G. Haegeman, Y.
Ysebaert, and W. Fiers, Nature 237: 82-88, 1972)
Degeneracy and Wobble
All the amino acids except metionine and trytophan are specified by more that one
codon (Table 10.1). three amino acids, leucine, serine, and arginine, are each
specified by six different codons. Isolaucine has three codons. The other amino
acids each have either two of four codons. The accorence of more that one codon
per amino acid is called degeneracy (though the usual connotations of the term are
hardly approtiate). The degeneracy in the genetic code is not at rodom; instead, it is
highly ordered. Usually, the