Académique Documents
Professionnel Documents
Culture Documents
biochemistry and cell biology. All of this information is central to understanding the
rationale for the development of new therapeutic agents based on the molecular
modification of cellular metabolism. To download the ABO's for the Doctor of Pharmacy
program, CLICK HERE. Specific learning objectives for the course and the ABOs they
address are as follows:
Identify the basic chemical building blocks (amino acids, sugars, fatty acids,
steroids) within each class of biomolecule, and describe how they contribute to
biomolecular structure (ABOs 1.5 and 1.16).
Understand the structure, function and biosynthesis of proteins (ABOs 1.5 and
1.16).
Understand the cellular functions of peptide and steroid hormones (ABOs 1.3,
1.5, 1.7, 1.8, 3.2, 4.1, 4.4).
Understand the structure, function and biosynthesis of nucleic acids (ABOs 1.5
and 1.16).
Understand the cellular utilization and production of energy (ABOs 1.5, 1.16,
4.1, 4.4).
Proteins play crucial roles in almost every biological process. They are responsible in
one form or another for a variety of physiological functions including:
Structural support - skin and bone are strengthened by the protein collagen.
Immune protection - antibodies are protein structures that are responsible for
reacting with specific foreign substances in the body.
Generation and transmission of nerve impulses - some amino acids act as
neurotransmitters, which transmit electrical signals from one nerve cell to
another. In addition, receptors for neurotransmitters, drugs, etc. are protein in
nature. An example of this is the acetylcholine receptor, which is a protein
structure that is embedded in postsynaptic neurons.
Control of growth and differentiation - proteins can be critical to the control of
growth, cell differentiation and expression of DNA. For example, repressor
proteins may bind to specific segments of DNA, preventing expression and thus
the formation of the product of that DNA segment. Also, many hormones and
growth factors that regulate cell function, such as insulin or thyroid stimulating
hormone are proteins.
Like most biological macromolecules, proteins are made up of simple building blocks;
in the case of proteins, these building blocks are called amino acids. As shown below,
the amino and carboxyl moieties in an amino acid are alpha to one another; also located
on the alpha carbon is an "R" group. The nature of this R-group (called the side chain)
determines the identity of a particular amino acid. There are a total of 20 amino acids
which are used to make up proteins (some modified or otherwise unusual amino acids
exist that we will discuss later in the course). In solution at physiological pH (7.4),
amino acids undergo an acid-base reaction to form zwitterions. In a zwitterion, the + and
- charges cancel to give a molecule with a net charge of zero. However, the pKa values
for a typical amino acid (glycine for example) are 9.6 and 2.3 for the amino and carboxyl
groups, respectively. If the pH of an amino acid solution is lowered significantly from
7.4, a species results in which the amine group has a positive charge, while the
carboxyl is neutral. Likewise, If the pH is raised from 7.4, a species results in which the
amine group is neutral, while the carboxyl has a negative charge. Thus, the ionization
state of amino acids is pH dependent.
All amino acids except glycine (R = H) are chiral. Every amino acid in mammalian
systems exists in the L-configuration, where "L" signifies that the amino acid in Fischer
projection is similar to L-glyceraldehyde. This description of stereochemistry is
outdated, and is seldom used except in trivial names. However, all natural amino acids
are also in the S-configuration, which is determined by assigning priorities based on the
Cahn-Ingold-Prelog rules.
As was mentioned above, there are 20 amino acids which are used to make up proteins
in mammalian biological systems. The amino acids are amphipathic molecules, meaning
that they contain both polar and non-polar functional groups, and thus have a tendency to
form interfaces between hydrophilic and hydrophobic molecules. The properties of each
amino acid are dictated by the side chain, which can vary in size, shape, charge,
reactivity and ability to hydrogen bond. The amino acids are grouped according to the
properties of their sidechains, as shown in the figure below. Each amino acid has a
standard three letter abbreviation which is used in lieu of a full structure, as seen in
the figure.
The first six amino acids, glycine (GLY), alanine (ALA), leucine (LEU), isoleucine
(ILE), proline (PRO) and valine (VAL) are aliphatic in nature. Glycine and alanine are
too small to have a hydrophobic effect in proteins, but they are considered aliphatic
amino acids. Proline is also aliphatic, and because of its cyclic structure, it can often be
found in the bend portion of a protein chain. Valine, leucine and isoleucine are
hydrophobic aliphatic, and although they can be found anywhere in the chain, they
prefer to cluster in the inside region of a protein, away from water. This effect causes a
significant stabilization of the protein structure.
There are three aromatic amino acids, phenylalanine (PHE), tyrosine (TYR) and
tryptophan (TRP). These amino acids have sidechains which contain delocalized pi
electrons that can interact with other pi systems in biomolecules. In addition, the
phenolic hydroxyl of TYR can ionize under physiological conditions, and thus increase
water solubility. Two of the amino acids are sulfur-containing, namely cysteine (CYS)
and methionine (MET). These amino acids have special properties that will be covered
at a later time. Finally, there are two hydroxyl-containing amino acids, serine (SER)
and threonine (THR). These two amino acids have sidechains which can hydrogen bond
to water or to other groups on neighboring macromolecules.
Five of the 20 amino acids are considered hydrophilic, in that they are able to ionize at
physiological pH. The amino acids lysine (LYS), arginine (ARG) and histidine (HIS)
are considered basic hydrophilic, since they contain basic sidechain groups that will
have a positive charge at pH 7.4. The amino acids aspartic acid (ASP) and glutamic
acid (GLU) are considered acidic hydrophilic, since they contain acidic sidechain
groups that will have a negative charge at pH 7.4. These two amino acids also have
amide counterparts, asparagine (ASN) and glutamine (GLN).
Note that 8 of the 20 amino acids have ionizable sidechains. Arginine, lysine and
histidine can have a positive charge, while aspartic acid and glutamic acid can possess a
negative charge under physiological conditions. It is also possible for serine, tyrosine
and cysteine to ionize to a negatively charged species during certain biological
processes.
Protein chains are held together by peptide bonds, which are simply amide linkages
between neighboring amino acids. When two amino acids interact, an equilibrium is set
up between unbound amino acids and a species in which two amino acids are linked,
called a dipeptide. Since this equilibrium favors the unlinked forms of the amino acids, it
is clear that formation of a peptide bond requires energy. When a few amino acids
become linked, the protein species is called an oligopeptide, and when many are linked,
the species is called a polypeptide. Polypeptides are generally between 50 and 2000
amino acids. Their molecular weights are expressed in Daltons, where 1 Dalton is equal
to 1 atomic mass unit (the weight of one hydrogen atom). 1000 Daltons is called a
kilodalton (kD). Most proteins weigh in between 5500 and 220,000 Daltons.
Each peptide chain has two free ends, the amino terminus, which is always drawn on the
left by convention, and the carboxyl terminus, which is always drawn on the right. This
convention extends to peptide chains expressed using three letter abbreviations. Thus, the
oligopeptide ALA-GLY-TRP-SER-GLU has an alanine at the amino terminus, and a
glutamic acid at the carboxyl terminus.
Amino acids in a protein are determined by the genetic code, wherein a three base
sequence of nucleotides called a codon calls for a specific amino acid to be added to the
growing chain. The process of converting the sequence of codons into a sequence of
amino acids entails transcription (the conversion of a segment of DNA into
complimentary mRNA) and translation (the conversion of the mRNA code into protein).
You will learn a great deal more about protein synthesis later in the semester.
As shown below, amino acids can participate in reactions that occur after they are
positioned in a peptide chain. These reactions are called post-translational
modifications, and can be of enormous biological significance. One example of a post-
translational modification is the crosslinking of two cysteines to form a new amino acid,
called cystine. This modification most often occurs in extracellular proteins, and can
contribute to their three-dimensional structure.
There are other post-translational modifications of biological significance, three of which
are shown below. In some proteins, acetylation of the amino terminus occurs. This
modification greatly decreases protein degradation, since many proteases require an
amino terminus to act. In structural protein such as collagen, hydroxylation of proline
occurs to afford hydroxyproline (HPRO). Since hydroxyproline has a hydrogen-
bonding sidechain, it is used to lend additional strength to the collagen structure, and
hence to tendons and other like tissues. Finally, the amino acids serine, threonine and
tyrosine can be phosphorylated within a protein chain. This modification is often used
by the cell to turn on or off a critical biological process.
A bit of history:
The peptide bond has unique characteristics which contribute to the overall structure of
proteins. The peptide bond itself is rigid, and thus is not free to rotate. This rigidity arises
because the amide bond is involved in a tautomerization that gives it considerable
double bond character. The other bonds in a peptide ar not rigid, and can freely rotate,
giving the protein chain many degrees of rotational freedom. The amide bond, together
with the bonds on either side of it that connect to the alpha carbons, are called the
backbone of the protein chain.
Primary structure - this term refers to the amino acid sequence of a protein,
including cystines that are formed during crosslinking. Sequence can dictate three
dimensional structure, since amino acid residues need to be in a specific order to
foster proper protein folding, and since disulfides must be formed from properly
positioned cysteines to afford an active protein. An example of primary structure
is the hypertensive octapeptide angiotensin II, which has the sequence ASP-
ARG-VAL-TYR-ILE-HIS-PRO-PHE.
Secondary structure - this term refers to the arrangement of amino acids that
are close together in a chain. Examples of secondary structures are helices and
pleated sheets. An alpha helix is a tightly coiled, rodlike structure which has an
average of 3.6 amino acids per turn. The helix is stabilized by hydrogen bonding
between the backbone carbonyl of one amino acid and the backbone NH of the
amino acid four residues away. All main chain amino and carboxyl groups are
hydrogen bonded, and the R groups stick out from the structure in a spiral
arrangement. As seen in the table below, there are several types of alpha helix that
arise from the degree of hydrogen bonding in the helix.
Another type of secondary structure, the beta pleated sheet is composed of two or more
straight chains that are hydrogen bonded side by side. If the amino termini are on the
same end of each chain, the sheet is termed parallel, and if the chains run in the opposite
direction (amino termini on opposite ends), the sheet is termed antiparallel (see below
left). All of the amides are hydrogen bonded except those on the outer strands. Pleated
sheets may be formed from a single chain if it contains a beta turn, which forms a
hairpin loop structure. Often a proline can be found in a beta turn, since it places a "kink"
in the chain. When the beta sheet curves around itself and the outer edges on either side
hydrogen bond to one another, it forms a structure called a beta barrel, which is a
common structural motif in proteins.
Tertiary structure - tertiary structure refers to the arrangement of amino acids
that are far apart in the chain. Each protein ultimately folds into a three
dimensional shape with a distinct inside and outside. The interior of a protein
molecule contains a preponderance of hydrophobic amino acids, which tend to
cluster and exclude water. The core is stabilized by Van der Waals forces and
hydrophobic bonding. By contrast, the exterior of a protein molecule is largely
composed of hydrophilic amino acids, which are charged or able to H-bond with
water. This allows a protein to have greater water solubility. A protein will
spontaneously fold to preserve the relationships outlined above.
Quaternary structure - protein chains can associate with other chains to form
dimers, trimers, and other higher orders of oligomers. Generally they contain
between 2 and 6 subunits which may be chains with the same sequence
(homodimers) or different chains (heterodimers).
Proteins can be associated with membranes, and in fact carry out almost every membrane
function. Interestingly, membrane proteins have special characteristics that allow them to
exist in this lipid environment. Proteins that sit on the inner or outer surface of the
membrane are called extrinsic or peripheral, and have a large percentage of
hydrophobic amino acids in the portion of the molecule that is close to the hydrophobic
membrane structure. The amino acids on the outer portion of the protein (facing the
aqueous environment of the cytoplasm or extracellular fluid) are mostly hydrophilic,
allowing the protein to be compatable with water. Proteins can also traverse the
membrane, and in this case they are called intrinsic or integral. The portion of the
protein that passes through the membrane is composed of hydrophobic amino acid
residues, while the inner and outer portions exposed to water are largely hydrophilic.
Transmembrane proteins can move laterally in the membrane, but cannot flip-flop.
Proteins are a unique class of biomolecules, in that they can recognize and interact with
diverse substances. The contain complimentary clefts and surfaces which are designed
to bind to specific molecules. Often only a single molecule or even a single stereoisomer
can bind to a complimentary protein surface. Once this binding takes place, a complex is
formed. This induces a conformational change which may act as a signal within the cell,
or may serve to activate an enzyme.
The cycle of removal of the protecting group and addition of amino acids is
continued until the desired peptide has been formed, and then the peptide is
released from the bead using HF.
2. Oxytocin and Vasopressin. Oxytocin and vasopressin are two peptide hormones
with very similar structure, but with very different biological activities. Their primary
sequences are shown below. Interestingly, their structures only differ by one amino acid
residue (the hydrophobic LEU number 8 in oxytocin is replaced by a hydrophilic ARG
residue in vasopressin). Oxytocin is a potent stimulator of uterine smooth muscle, and
also stimulates lactation. However, vasopressin, also know as antidiuretic hormone
(ADH), has no effect on uterine smooth muscle, but causes reabsorbtion of water by
the kidney, thus increasing blood pressure.
3. Insulin and Glucagon. Insulin is an extremely important peptide hormone that is
produced by the beta cells of the Islet of Langerhans in the pancreas. It has 51 amino
acids, three disulfide crosslinks, and is comprised of two seperate chains, termed A
and B. Insulin has a number of important effects on cells in the body including:
Insulin is not synthesized in active form, but is first made as a single inactive peptide
chain called preproinsulin (see the figure below). Preproinsulin has no crosslinks, and in
addition to the A and B chain, has two additional portions called the signal sequence and
the connecting (C) peptide. The signal sequence informs the cell that insulin is being
made, and that the finished preproinsulin should be deposited outside the cell. The C-
peptide is necessary to allow preproinsulin to fold in the correct conformation to
ultimately produce active insulin. Preproinsulin is processed by a two step procedure; in
the first step, the signal sequence is cleaved by a peptidase, and two of the three
crosslinks are formed to give a new but still inactive peptide called proinsulin. A second
peptidase then cleaves the C-peptide, and an internal disulfide forms to produce insulin.
Glucagon is a peptide hormone that is formed in the alpha cells of the Islets of
Langerhans in the pancreas. It is a single chain peptide consisting of 29 amino acid
residues, and has effects which oppose insulin, including:
Hemoglobin is an example of an allosteric protein, i.e. its function can be altered by the
binding of some external substance (called the effector) at a site on the molecule other
than the active site (the allosteric site). When an allosteric effector binds to a protein, it
induces a conformational change which turns the function of the protein either on
(positive allosterism) or off (negative allosterism). In the case of hemoglobin, the
allosteric effector is 2,3-diphosphoglycerate (2,3-DPG), which causes hemoglobin to
have 1/26th of its normal affinity for oxygen. This is an important issue, since 2,3-DPG
in the tissues triggers the release of oxygen at the correct location.
As shown in the figure below, hemoglobin can also exist in a glycosylated form known as
HbA1C. HbA1C is formed when the amino terminus of HbA reacts with glucose, first
reversibly forming an aldimin or Schiff's Base, and then undergoing an irreversible
Amadori rearrangement to afford the ketamine form HbA1C. In normal patients,
HbA1C accounts for about 3-5% of HbA, but in diabetics who have elevated blood
glucose for extended periods, this number can reach 6 to 15%. Physicians can measure
HbA1C, and are using it as a reliable way to monitor how well diabetic patients are
complying with their insulin therapy.
5. Collagen. Collagen is a connective tissue protein that is found in skin, bone,
tendons, cartilage, the cornea, etc.. It is quite insoluble in water, and is composed of
two types of chain termed alpha-1 and alpha-2. In the amino acid sequence of collagen,
about every 3rd amino acid is a GLY residue, and there are many prolines which are
hydroxylated to form hydroxyproline (HPRO). LYS residues are also hydroxylated in
collagen to form HLYS. These additional sidechain OH residues allow for extra strength
due to H-bonding, and the GLY residues allow the protein to coil more tightly, since they
fit on the inside of the helix. In a collagen fiber, three of these helices are coiled together
to form a rope-like structure called a superhelical coil. It is this structure that gives
collagen its great strength. Collagen structure can be disrupted in diseases such as
scurvy, which is a lack of ascorbic acid, a cofactor in the hydroxylation of proline. In
addition, collagen structure is disrupted in rheumatoid arthritis.