Académique Documents
Professionnel Documents
Culture Documents
Chapters 4, 5 and 6
Amino acids
Amino acids are the building blocks of proteins
Proteins are the most versatile macromolecule in living organism Crucial functions: Catalyst, Transport, Store other molecules like oxygen, Mechanical support Immune protection Generate body movement Transmit nerve impulse Cell growth and differentiation
There are 20 commonly occurring amino acids in nature that form all these diverse classes of proteins
Amino acids
What are the structures and properties of amino acids? What are the acid-base properties of amino acids? What reactions do amino acids undergo? What are the optical and stereochemical properties of amino acids? What are the spectroscopic properties of amino acids? How are amino acid mixtures separated and analyzed? What is the fundamental structural pattern in proteins?
Figure 5.6 Bovine pancreatic ribonuclease A contains 124 amino acid residues,
Structures and Properties of Amino Acids, the Building Blocks of Proteins? Amino acids contain a central tetrahedral carbon atom There are 20 common amino acids Amino acids can join via peptide bonds Several amino acids occur only rarely in proteins Some amino acids are not found in proteins
Figure 4.1 Anatomy of an amino acid. Except for proline and its derivatives, all of the amino acids commonly found in proteins possess this type of structure.
Figure 4.6 The ionic forms of the amino acids, shown without consideration of any ionizations on the side chain. The cationic form is the low pH form, and the titration of the cationic species with base yields the zwitterion and finally the anionic form.
Figure 4.2 Two amino acids can react with loss of a water molecule to form a covalent bond. The bond joining the two amino acids is called a peptide bond.
You should know names, structures, pKa values, 3-letter and 1-letter codes Aliphatic amino acids Aromatic amino acids Polar, uncharged amino acids Acidic amino acids Basic amino acids
Figure 4.3 Some of the nonpolar (hydrophobic) amino acids.
Figure 4.3 Some of the polar, uncharged amino acids. Figure 4.3 Some of the nonpolar (hydrophobic) amino acids.
Figure 4.3 The acidic amino acids. Figure 4.3 Some of the polar, uncharged amino acids.
We'll see some of these in later chapters Selenocysteine in many organisms Pyrrolysine in several archaeal species Hydroxylysine, hydroxyproline - collagen Carboxyglutamate - blood-clotting proteins Pyroglutamate in bacteriorhodopsin GABA, epinephrine, histamine, serotonin act as neurotransmitters and hormones Phosphorylated amino acids a signaling device
Figure 4.4 (b) Some amino acids are less common, but nevertheless found in certain proteins. Hydroxylysine and hydroxyproline are found in connective-tissue proteins; carboxyglutamate is found in blood-clotting proteins; pyroglutamate is found in bacteriorhodopsin (see Chapter 9).
Amino acids
Properties of Amino Acids Acid-Base Properties What Reactions Do Amino Acids Undergo? Peptide bond formation Optical and Stereochemical Properties Spectroscopic Properties
Figure 4.4 (c) Several amino acids that act as neurotransmitters and hormones.
Amino Acids are Weak Polyprotic Acids The degree of dissociation depends on the pH of the medium H2A+ + H2O = HA0 + H3O+
The second dissociation (the amino group in the case of glycine): HA0 + H2O = A + H3O+
K a1 =
[HA ][H 3O ] [H 2 A + ]
Ka2
[A ][H 3O + ] = [HA 0 ]
Figure 4.7 Titration of glycine, a simple amino acid. The isoelectric point, pI, the pH where the molecule has a net charge of 0, is defined as (pK1+ pK2)/2.
A Sample Calculation
What is the pH of a glutamic acid solution if the alpha carboxyl is 1/4 dissociated? pH = 2 + log10 [1] [3] pH = 2 + (-0.477) pH = 1.523
Figure 4.8 Titration of lysine.
What is the pH of a lysine solution if the side chain amino group is 3/4 dissociated?
pH = 10.5 + log10
[3] [1]
Carboxyl groups form amides & esters Amino groups form Schiff bases and amides Side chains show unique reactivities Cys residues can form disulfides and can be easily alkylated Few reactions are specific to a single kind of side chain
Figure 4.9 Typical reactions of the common amino acids (see text for details).
Figure 4.10 The pathway of the ninhydrin reaction, which produces a colored product called Ruhemanns Purple that absorbs light at 570 nm. Note that the reaction involves and consumes two molecules of ninhydrin.
Cysteine
N-Ethylmaleimide
Iodoacetate
Acrylonitrile
Ellmans reagent
p-Hydroxy
Lysine
Figure 4.12 Enantiomeric molecules based on a chiral carbon atom. Enantiomers are nonsuperimposable mirror images of each other.
Figure 4.13 The configuration of the common L-amino acids can be related to the configuration of L(-)glyceraldehyde as shown. These drawings are known as Fischer projections. The horizontal lines of the Fischer projections are meant to indicate bonds coming out of the page from the central carbon, and vertical lines represent bonds extending behind the page from the central carbon atom.
Figure 4.14 The stereoisomers of isoleucine and threonine. The structures at the far left are the naturally occurring isomers.
Spectroscopic Properties
All amino acids absorb at infrared wavelengths Only Phe, Tyr, and Trp absorb UV Absorbance at 280 nm is a good diagnostic device for amino acids NMR spectra are characteristic of each residue in a protein, and high resolution NMR measurements can be used to elucidate three-dimensional structures of proteins
The assignment of (R) and (S) notation for glyceraldehyde and L-alanine .
Figure 4.15 The ultraviolet absorption spectra of the aromatic amino acids at pH 6. (From Wetlaufer, D.B., 1962. Ultraviolet spectra of proteins and amino acids. Advances in Protein Chemistry 17:303 390.)
Figure 4.16 Proton NMR spectra of several amino acids. Zero on the chemical shift scale is defined by the resonance of tetramethylsilane (TMS). (Adapted from Aldrich Library of NMR Spectra.)
Mikhail Tswett, a Russian botanist, first separated colorful plant pigments by chromatography Many chromatographic methods exist for separation of amino acid mixtures Ion exchange chromatography High-performance liquid chromatography
Figure 4.18 Cation (a) and anion (b) exchange resins commonly used for biochemical separations.
Figure 4.19 Operation of a cation exchange column, separating a mixture of Asp, Ser, and Lys. (a) The cation exchange resin in the beginning, Na+ form. (b) A mixture of Asp, Ser, and Lys is added to the column containing the resin. (c) A gradient of the eluting salt (e.g., NaCl) is added to the column. Asp, the least positively charged amino acid, is eluted first. (d) As the salt concentration increases, Ser is eluted. (e) As the salt concentration is increased further, Lys, the most positively charged of the three amino acids, is eluted last.
Figure 4.21 Chromatographic fractionation of a synthetic mixture of amino acids on ion exchange columns using Amberlite IR-120, a sulfonated polystyrene resin similar to Dowex-50. A second column with different buffer conditions is used to resolve the basic amino acids. (Adapted from Moore, S., Spackman, D., and Stein, W., 1958. Chromatography of amino acids on sulfonated polystyrene resins. Analytical Chemistry 30:11851190.)
Figure 5.1 Anatomy of an amino acid. Except for proline and its derivatives,
all of the amino acids commonly found in proteins possess this type of structure.
Figure 4.2
The -COOH and NH3+ groups of two amino acids can react with the resulting loss of a water molecule to form a covalent amide bond.
NH3+ groups of two amino acids can react with the resulting loss of a water molecule to form a covalent amide bond.
Figure 5.2 Anatomy of an amino acid. Except for proline and its derivatives,
all of the amino acids commonly found in proteins possess this type of structure.
The Coplanar Nature of the Peptide Bond Six atoms of the peptide group lie in a plane!
Peptides
Short polymers of amino acids Each unit is called a residue 2 residues - dipeptide 3 residues - tripeptide 12-20 residues - oligopeptide many - polypeptide
Figure 5.4
Anatomy of an amino acid. Except for proline and its derivatives, all of the amino acids commonly found in proteins possess this type of structure.
Peptide bond
By Convention: The amino end is taken to be the beginning of polypeptide chain Ala-Gly-Trp (AGW) is tripeptide Trp-Gly-Ala (WGA) is also a tripeptide AGW and WGA are different tripeptide
Protein
One or more polypeptide chains One polypeptide chain - a monomeric protein More than one - multimeric protein Homomultimer - one kind of chain Heteromultimer - two or more different chains Hemoglobin, for example, is a heterotetramer It has two alpha chains and two beta chains
10
Protein
One or more polypeptide chains One polypeptide chain - a monomeric protein More than one - multimeric protein Homomultimer - one kind of chain Heteromultimer - two or more different chains
Chapter 5
Proteins: Their primary structure and Biological functions
Outline
What is the fundamental structural pattern in proteins? What architectural arrangements characterize protein structure? How are proteins isolated and purified from cells? How is the amino acid analysis of proteins performed? How is the primary structure of a protein determined? Can polypeptides be synthesized in the laboratory? What is the nature of amino acid sequences? Do proteins have chemical groups other than amino acids? What are the many biological functions of proteins?
Bovine pancreatic ribonuclease A contains 124 amino acid residues,. Four intrachain disulfide bridges (S-S) form crosslinks in this polypeptide between
11
Secondary Structures
Through H- bonding interactions between adjacent amino acids residues, polypeptide chain arrange themselves in a characteristic helical or pleated sheet architecture Extend along one dimension, like coils of a spring
Alpha- hlix Beta-Sheet
Local structures Both of these structures owe their stability to the formation of hydrogen bonds between NH and O=C functions along the polypeptide backbone.
Tertiary Structures
Polypeptide chains bend and fold to obtain a more compact three dimensional shape- Tertiary (3o) structure is generated Globular structures are generated when 3o is formed Globular structures: lowest surface to volume ratio, minimizing the interaction of the protein with surrounding environment
Important to note
Primary structure is determined by the covalently linked amino acid residues in the polypeptide backbone 2o, 3o and 4o structures are determined by the non covalent forces such as H-bonds, ionic interactions, van der Waals and hydrophobic interactions
Figure 5.5 Hemoglobin is a tetramer consisting of two and two polypeptide chains.
All the information necessary for the protein molecule to achieve its intricate architecture is contained within its primary structure
12
The solubility is markedly influenced by pH and ionic strength. solubility of a typical protein as a function of pH and salt concentrations.
Increasing ionic strength at first increases the solubility of proteins (salting-in), then decreases it (salting-out)
Frederick Sanger was the first - in 1953, he sequenced the two chains of insulin. Sanger's results established that all of the molecules of a given protein have the same sequence. Proteins can be sequenced in two ways: - real amino acid sequencing - sequencing the corresponding DNA in the gene
13
The hormone insulin consists of two polypeptide chains, A and B, held together by two disulfide cross-bridges (SS). The A chain has 21 amino acid residues and an intrachain disulfide; the B polypeptide contains 30 amino acids. The sequence shown is for bovine insulin.
Step 1:
Separation of chains Subunit interactions depend on weak forces Separation is achieved with: - extreme pH - 8M urea - 6M guanidine HCl - high salt concentration (usually ammonium sulfate)
Step 2:
Cleavage of Disulfide bridges Performic acid oxidation Sulfhydryl reducing agents - mercaptoethanol - dithiothreitol or dithioerythritol - to prevent recombination, follow with an alkylating agent like iodoacetate
14
Step 3A:
Identify N- and C-terminal residues
N-terminal analysis: Edman's reagent Phenylisothiocyanate derivatives are phenylthiohydantoins ( PTH derivatives)
Chromatographic method can be used to identify the PTH derivative Importantly, in this procedure, rest of the polypeptide remains intact And can be re-subjected to further round of Edmans degradation to identify successive amino acids Automated Edman sequenator Up to 50 cycles For larger protein less 10-20 cycles
Step 3B: :
C-terminal analysis Enzymatic analysis (carboxypeptidase)
Carboxypeptidase A cleaves any residue except Pro, Arg, and Lys Carboxypeptidase B only works on Arg and Lys Carboxypeptidase C and Y works on any C-terminal residue
Steps 4 and 5:
Fragmentation of the chains
Enzymatic fragmentation trypsin, chymotrypsin, clostripain, staphylococcal protease Chemical fragmentation cyanogen bromide
Chemical Fragmentation
Cyanogen bromide
CNBr acts only on methionine residues CNBr is useful because proteins usually have only a few Met residues a peptide with a C-terminal homoserine lactone
15
Enzymatic Fragmentation
Trypsin Chymotrypsin Clostripain Staphylococcal protease
Enzymatic Fragmentation
Trypsin - cleavage on the C-side of Lys, Arg Chymotrypsin - C-side of Phe, Tyr, Trp
Trypsin - cleavage on the C-side of Lys, Arg
Clostripain - like trypsin, but attacks Arg more than Lys Staphylococcal protease C-side of Glu, Asp in phosphate buffer specific for Glu in acetate or bicarbonate buffer
Steps 6:
Reconstructing the Sequence Use two or more fragmentation agents in separate fragmentation experiments Sequence all the peptides produced (usually by Edman degradation) Compare and align overlapping peptide sequences to learn the sequence of the original polypeptide chain
Tryptic peptide
Tryptic peptide
16
Figure 5.18 Summary of the sequence analysis of catrocollastatinC, a 23.6-kD protein found in the venom of the western diamondback rattlesnake Crotalus atrox. Sequences shown are given in the one-letter amino acid code.
(Adapted from Shimokawa, K., et al., 1997. Archives of Biochemistry and Biophysics 343:3543.)
Step 7:
Location of Disulfide Cross-Bridges Peptide fragments may be linked by disulfide bridges Diagonal electrophoresis may be used to identify links Off-diagonal peptide fragments can be isolated and sequenced
17
Consider lysozyme and human milk -lactalbumin These proteins are identical at 48 positions (out of 129 in lysozyme and 123 in human milk -lactalbumin Functions of these two are not related
Figure 5.30 The tertiary structures of hen egg white lysozyme and human -lactalbumin are very similar. (Adapted from Acharya, K. R., et al., 1990. Journal of Protein Chemistry 9:549563; and
Acharya, K. R., et al., 1991. Journal of Molecular Biology 221:571581.)
Chapter 6
Proteins: Secondary, Tertiary, and Quaternary Structure
18
Outline
What Are the Noncovalent Interactions That Dictate and Stabilize Protein Structure? What Role Does the Amino Acid Sequence Play in Protein Structure? What Are the Elements of Secondary Structure in Proteins, and How Are They Formed? How Do Polypeptides Fold into Three-Dimensional Protein Structures? How Do Protein Subunits Interact at the Quaternary Level of Protein Structure?
All of the information necessary for folding the peptide chain into its "native structure is contained in the primary amino acid structure of the peptide.
6.1 - What Are the Noncovalent Interactions That Dictate and Stabilize Protein Structures?
What are they? What are the relevant numbers? van der Waals: 0.4 - 4 kJ/mol hydrogen bonds: 12-30 kJ/mol ionic bonds: 20 kJ/mol hydrophobic interactions: <40 kJ/mol
Figure 6.1 An electrostatic interaction between the -amino group of a lysine and the carboxyl group of a glutamate residue.
6.2 - What Role Does the Amino Acid Sequence Play in Protein Structures?
How do proteins recognize and interpret the folding information? Certain loci along the chain may act as nucleation points Protein chain must avoid local energy minima Chaperones may help
All of the information necessary for folding the peptide chain into its "native structure is contained in the primary amino acid structure of the peptide.
19
6.3 - What Are the Elements of Secondary Structure in Proteins, and How Are They Formed?
The atoms of the peptide bond lie in a plane The resonance stabilization energy of the planar structure is 88 kJ/mol A twist about the C-N bond involves a twist energy of 88 kJ/mol times the square of the twist angle. Twists can occur about either of the bonds linking the alpha carbon to the other atoms of the peptide backbone
Unfavorable orbital overlap precludes some combinations of phi and psi phi = 0 psi = 180is unfavorable , phi = 180 psi = 0is unfavorable , phi = 0 psi = 0is unfavorable ,
20
All these are local structures that are stabilized by hydrogen bonds Alpha helix Other helices Beta sheet (composed of "beta strands") Tight turns (aka beta turns or beta bends) Beta bulge
Figure 6.7 The three-dimensional structures of two proteins that contain substantial amounts of -helix in their structures. The helices are represented by the regularly coiled sections of the ribbon drawings. Myohemerythrin is the oxygen-carrying protein in certain invertebrates, including Sipunculids, a phylum of marine worm. (Jane Richardson)
21
Figure 6.8 The arrangement of NH and C=O groups (each with an individual dipole moment) along the helix axis creates a large net dipole for the helix. Numbers indicate fractional charges on respective atoms.
Figure 6.9 Four NH groups at the N-terminal end of an -helix and four C=O groups at the C-terminal end cannot participate in hydrogen bonding. The formation of Hbonds with other nearby donor and acceptor groups is referred to as helix capping. Capping may also involve appropriate hydrophobic interactions that accomodate nonpolar side chains at the ends of helical segments.
Figure 6.10 A pleated sheet of paper with an antiparallel -sheet drawn on it. (Irving Geis)
allows the peptide chain to reverse direction carbonyl C of one residue is H-bonded to the amide proton of a residue three residues away proline and glycine are prevalent in beta turns
22
Figure 6.12 The structures of two kinds of -turns (also called tight turns or -bends) (Irving Geis) Figure 6.13 Three different kinds of -bulge structures involving a pair of adjacent polypeptide chains. (Adapted from Richardson, J. S., 1981. Advances in Protein Chemistry 34:167339.)
6.4 How Do Polypeptides Fold into ThreeDimensional Protein Structures? Several important principles: Secondary structures form wherever possible (due to formation of large numbers of H bonds) Helices and sheets often pack close together
The resonance stabilization energy of the planar structure is 88 kJ/mol A twist about the C-N bond involves a twist energy of 88 kJ/mol times the square of the twist angle. Twists can occur about either of the bonds linking the alpha carbon to the other atoms of the peptide backbone
Unfavorable orbital overlap precludes some combinations of phi and psi phi = 0 psi = 180is unfavorable , phi = 180 psi = 0is unfavorable , phi = 0 psi = 0is unfavorable ,
23
Globular Proteins
Some design principles
Figure 6.18 Poly(Gly-Pro-Pro), a collagen-like righthanded triple helix composed of three left-handed helical chains. (Adapted from Miller, M. H., and Scheraga, H. A., 1976, Calculation of the structures of collagen models. Role of interchain interactions in determining the triple-helical coiled-coil conformation. I. Poly(glycyl-prolyl-prolyl). Journal of Polymer Science Symposium 54:171200.)
Most polar residues face the outside of the protein and interact with solvent Most hydrophobic residues face the interior of the protein and interact with each other Packing of residues is close However, ratio of vdw volume to total volume is only 0.72 to 0.77, so empty space exists The empty space is in the form of small cavities
24
Globular Proteins
More design principles
"Random coil" is not random Structures of globular proteins are not static Various elements and domains of protein move to different degrees Some segments of proteins are very flexible and disordered Know the kinds and rates of protein motion
Globular Proteins
The Forces That Drive Folding Peptide chain must satisfy the constraints inherent in its own structure Peptide chain must fold so as to "bury" the hydrophobic side chains, minimizing their contact with water Peptide chains, composed of L-amino acids, have a tendency to undergo a "right-handed twist"
Figure 6.27 The natural right-handed twist exhibited by polypeptide chains, and the variety of structures that arise from this twist.
25
Figure 6.28 Examples of protein domains with different numbers of layers of backbone structure. (a) Cytochrome c' with two layers of -helix. (b) Domain 2 of phosphoglycerate kinase, composed of a -sheet layer between two layers of helix, three layers overall. (c) An unusual five-layer structure, domain 2 of glycogen phosphorylase, a -sheet layer sandwiched between four layers of -helix. (d) The concentric layers of -sheet (inside) and -helix (outside) in triose phosphate isomerase. Hydrophobic residues are buried between these concentric layers in the same manner as in the planar layers of the other proteins. The hydrophobic layers are shaded yellow. (Jane Richardson)
Figure 6.30 Parallel -array proteinsthe eight-stranded -barrels of triose phosphate isomerase (a, side view, and b, top view) and (c) pyruvate kinase. (Jane Richardson) Figure 6.31 Several typical doubly wound parallel -sheet proteins. (Jane Richardson)
26
Figure 6.33 Examples of the so-called Greek key antiparallel barrel structure in proteins.
Thermodynamics of Folding
Read the box on page 185
Separate the enthalpy and entropy terms for the peptide chain and the solvent Further distinguish polar and nonpolar groups The largest favorable contribution to folding is the entropy term for the interaction of nonpolar residues with the solvent
Molecular Chaperones
Why are chaperones needed if the information for folding is inherent in the sequence? to protect nascent proteins from the concentrated protein matrix in the cell and perhaps to accelerate slow steps Chaperone proteins were first identified as "heat-shock proteins" (hsp60 and hsp70)
Protein Modules
An important insight into protein structure Many proteins are constructed as a composite of two or more "modules" or domains Each of these is a recognizable domain that can also be found in other proteins Sometimes modules are used repeatedly in the same protein There is a genetic basis for the use of modules in nature
Figure 6.36 Ribbon structures of several protein modules utilized in the construction of complex multimode proteins. (a) The complement control protein module. (b) The immunoglobulin module. (c) The fibronectin type I module. (d) The growth factor module. (e) The kringle module. (Adapted from
Baron, M., Norman, D., and Campbell, I., 1991, Protein modules. Trends in Biochemical Sciences 16:13-17.)
27
Predictive Algorithms
Figure 6.37 A sampling of proteins that consist of mosaics of individual protein modules. The modules shown include CG, a module containing carboxyglutamate residues; G, an epidermal growth-factorlike module; K, the kringle domain, named for a Danish pastry; C, which is found in complement proteins; F1, F2, and F3, first found in fibronectin; I, the immunoglobulin superfamily domain; N, found in some growth factor receptors; E, a module homologous to the calcium-binding EF hand domain; and LB, a lectin module found in some cell surface proteins.
(Adapted from Baron, M., Norman, D., and Campbell, , 1991, Protein modules. Trends in Biochemical Sciences 16:1317.)
If the sequence holds the secrets of folding, can we figure it out? Many protein chemists have tried to predict structure based on sequence Chou-Fasman: each amino acid is assigned a "propensity" for forming helices or sheets Chou-Fasman: is only modestly successful and doesn't predict how sheets and helices arrange
Quaternary Structure
What are the forces driving quaternary association? Typical Kd for two subunits: 10-8 to 10-16M! These values correspond to energies of 50-100 kJ/mol at 37 C Entropy loss due to association - unfavorable Entropy gain due to burying of hydrophobic groups - very favorable!
Figure 6.43 The quaternary structure of liver alcohol dehydrogenase. Within each subunit is a six-stranded parallel sheet. Between the two subunits is a two-stranded antiparallel sheet. The point in the center is a C2 symmetry axis. (Jane Richardson)
Figure 6.44 Isologous and heterologous associations between protein subunits. (a) An isologous interaction between two subunits with a twofold axis of symmetry perpendicular to the plane of the page. (b) A heterologous interaction that could lead to the formation of a long polymer. (c) A heterologous interaction leading to a closed structurea tetramer. (d) A tetramer formed by two sets of isologous interactions.
Figure 6.45 The polypeptide backbone of the prealbumin dimer. The monomers associate in a manner that continues the msheets. A tetramer is formed by isologous interactions between the side chains extending outward from sheet DAGHHGAD in both dimers, which pack together nearly at right angles to one another. (Jane Richardson)
28
What are the structural and functional advantages driving quaternary association?
Know these!
Figure 6.47 Schematic drawing of an immunoglobulin molecule showing the intramolecular and intermolecular disulfide bridges. (A space-filling model of the same molecule is shown in Figure 1.11.)
Stability: reduction of surface to volume ratio Genetic economy and efficiency Bringing catalytic sites together Cooperativity
29