Académique Documents
Professionnel Documents
Culture Documents
The Whitten-Rabinovitch (WR) approximation used in the semi-classical calculation of the Rice-
Ramsperger-Kassel-Marcus (RRKM) unimolecular reaction rate constant was improved for reliable
application to protein reactions. The state sum data for the 10-mer of each amino acid calculated by the
accurate Beyer-Swinehart (BS) algorithm were used to obtain the residue-specific correction functions (w).
The correction functions were obtained down to a much lower internal energy range than reported in the
original work, and the cubic, rather than quadratic, polynomial was used for data fitting. For a specified
sequence of amino acid residues in a protein, an average was made over these functions to obtain the sequence-
specific correction function to be used in the rate constant calculation. Reliability of the improved method
was tested for dissociation of various peptides and proteins. Even at low internal energies corresponding to
the RRKM rate constant as small as 0.1 s-1, the rate constant calculated by the present method differed from
the accurate BS result by 60% only. In contrast, the result from the original WR calculation differed from the
accurate result by a factor of 3000. Compared to the BS method, which is difficult to use for proteins, the
main advantage of the present method is that the RRKM rate constant can be calculated instantly regardless
of the protein mass.
I. Introduction are the critical energy, the complete vibrational frequency set
for the reactant, and that at the TS geometry. The major
In the study of a unimolecular reaction, it is often useful to
difficulty arises from the fact that the parameters related to the
have a rate constant evaluated theoretically. For a unimolecular
TS, namely E0 and the frequencies at the TS, cannot be
reaction occurring under the microcanonical condition, the
measured experimentally. The data obtained via quantum
Rice-Ramsperger-Kassel-Marcus (RRKM) theory1-6 is widely
chemical calculations can be used when the TS can be found
used to estimate the statistically expected rate constant. The
through computation. The less rigorous but more widely used
theoretical rate constant is useful not only for direct comparison
approach is to treat these as adjustable parameters. It is well-
with the experimental one but also for various other purposes.
known that the RRKM rate constant remains nearly the same
For example, RRKM calculation is done in the field of mass
regardless of the changes in individual frequencies as long as
spectrometry7-9 as an aid for understanding the structure and
the entropy of activation, ∆Sq, is kept the same.7,10-12 Hence,
dissociation dynamics of molecular ions, estimating their internal
the RRKM calculation of the rate-energy relation is often
energy contents, etc.
treated as a two parameter (E0 and ∆Sq) problem. In this
When the molecular rotation is ignored, the expression for a
approach, the frequency set at the TS is obtained from the
unimolecular reaction rate constant derived by the RRKM theory
reactant set by adjusting some of the frequencies in the latter
is as follows.
set such that the postulated value of ∆Sq results. The fact that
reasonable estimates for E0 and ∆Sq are needed for reliable
Nq(E - E0) estimation of the rate constant is the main drawback of this
k)σ (1)
hF(E) approach.
When one attempts a RRKM calculation for large biological
Here F(E) is the vibrational state density of the reactant at molecules such as peptides and proteins, additional difficulties
the internal energy E, E0 is the critical energy of the reaction, arise due to the very large number of degrees of freedom
Nq(E - E0) is the vibrational state sum from 0 to E - E0 at the involved. One of these difficulties is that it is virtually impossible
transition state (TS), h is the Planck constant, and σ is the at the moment to obtain the complete set of reactant frequencies
reaction path degeneracy, which is usually 1 for reactions either via experiment or via computation. In our previous
involving large molecules. paper,13 we reported a systematic and efficient method to
Even though the expression for the RRKM rate constant looks estimate the frequency set for a peptide or a protein with any
deceptively simple, various difficulties are encountered in actual amino acid sequence and presented its utility in a RRKM
calculation. The parameters needed for a RRKM calculation calculation. The method started with the vibrational frequency
sets for twenty amino acids calculated at the density functional
* To whom correspondence should be addressed. Telephone: +82-2- theory (DFT) level. Then, the frequencies disappearing upon
880-6652. Fax: +82-2-889-1568. E-mail: myungsoo@snu.ac.kr.
† Seoul National University. peptide bond formation were deleted from each set to obtain
‡ Korea Research Institute of Bioscience and Biotechnology. the fictitious sets for an amino acid residue at the N- or
10.1021/jp066453t CCC: $37.00 © 2007 American Chemical Society
Published on Web 02/15/2007
2748 J. Phys. Chem. B, Vol. 111, No. 10, 2007 Sun et al.
C-terminus or inside the chain. For a specified sequence of a w ) (5.00 + 2.730.5 + 3.51)-1 0.1 < < 1.0 (5)
protein, these residue frequencies were collected and the
frequencies appearing upon peptide bond formation were added w ) exp(-2.41910.25) 1.0 e (6)
to obtain the complete set for the reactant. The frequencies
appearing upon protonation of a protein were added as needed Here, is the internal energy scaled with the zero-point
to handle the reaction of protonated proteins that are of great energy.
interest in the field of mass spectrometry.14-16 The method
treated all the vibrations as harmonic and ignored anharmonicity. ) E/Ez (7)
Also ignored was the fact that some modes are better treated as
internal rotations rather than vibrations.1,5 Such simplifications The expression for the state density is obtained from the
and the estimation of the frequencies at the TS using the derivative of eq 2 as follows.
postulated value of ∆Sq adopted in this method are causes for
uncertainty in the RRKM calculations. (E + aEz)s-1
Once the frequency sets for the reactant and TS are provided,
various methods can be used to calculate the vibrational state
F(E) )
(s - 1)! ∏ hνi
[1 - β(dwd )] (8)
Initially, we hoped to achieve the above goal by using a present IWR method for higher mass proteins. The general trend
universal w function that could be used for proteins with any in all these cases was the same as in the M10 case: overestima-
sequence. The IWR′ method described in the previous section tion of k near 0.1 s-1 by 60-80% in the calculations with E0 )
was devised for this purpose. The rate-energy relations 0.5 eV and ∆Sq ) 0 eu and overestimation by a factor of 6
calculated by the IWR′ method were in excellent agreement with E0 ) 0.5 eV and ∆Sq ) 10 eu.
with the BS results for many peptides consisting of a variety of From the results presented so far, it is obvious that a rate-
amino acid residues. In the IWR′ calculations, the largest error energy relation calculated by the IWR method reproduces the
is expected for the peptides consisting of only one type of amino corresponding BS result satisfactorily except when E0 is small
acid residue, especially M10 and T10. The rate-energy relation and ∆Sq is large at the same time. In the present work, the
for M10 calculated by the IWR′ method with an E0 of 0.5 eV vibrational frequency set at the TS is estimated by adjusting
and a ∆Sq of 0 eu is shown in Figure 2 also. The IWR′ rate some of the frequencies of the reactant such that the postulated
constant is in much better agreement with the BS result than is ∆Sq results. As ∆Sq gets larger, these adjusted frequencies get
the WR result. At the internal energy of 1.38 eV chosen in the smaller. Then, the state sum evaluated using the w function
previous comparison, the IWR′ rate constant is 1.3 s-1, derived with the reactant frequency data deviates more from
corresponding to a factor of 13 difference from the correct value. the correct value, resulting in larger deviation in the rate
Namely, the difference factor of 3000 in the original WR method constant. When E0 gets larger also, the agreement between the
has been reduced to 13 in IWR′. We are not satisfied with the IWR and BS results gets better even when ∆Sq is large. The
IWR′ method, however, because our aim is to develop a method explanation for the better agreement at larger E0 is as follows.
that can reproduce a BS rate constant within a factor of 2. As E0 increases, the rate constant at a given internal energy
As has been mentioned already in the previous section, our decreases rapidly. In fact, the internal energy needed to maintain
method of choice (IWR) is to calculate a sequence-specific w the rate constant at the same value, and hence the internal energy
function by averaging the w functions (w-1 actually) of the at the TS also, increases almost in proportion to E0. At high
residues contained in a protein. The rate-energy relation for internal energy, however, the influence of the above low-
M10 calculated by IWR using the same E0 and ∆Sq values as frequency vibrations at the TS becomes less important, resulting
before is also shown in Figure 2. Because all the residues are in better agreement between the IWR and BS rate constants.
the same in M10, the sequence-specific w function evaluated Even though the present IWR method is not adequate to
from the residue functions is exactly the same as the residue reproduce the BS rate-energy relation for a simultaneously
function that was derived using the state sum calculated by the small E0 and large ∆Sq case such as E0 ) 0.5 eV and ∆Sq ) 10
BS algorithm. Hence, one may expect to obtain an IWR rate eu, it is to be mentioned that such a case is unlikely to be
constant that is essentially identical to the BS result. Even though encountered in protein dissociations. In dissociation reactions,
the IWR and BS rate-energy relations in Figure 2 are nearly a large value of ∆Sq, or the occurrence of a loose transition
the same, a tiny difference can be observed. For example, the state,5 is usually associated with simple bond cleavage reactions,
IWR rate constant at 1.38 eV is 0.160, namely 60% larger than which tend to have a large E0 value. It is well-known in the
the BS result. To see if the functional fitting of the w data is field of mass spectrometry that the b and y type fragment ions
the source of such an error, we calculated the state density at are generated preferentially when the internal energy of a
1.38 eV using the w function and compared it with the BS state protonated peptide is not high,26 which indicates that they are
density. The BS density turned out to be larger than the IWR generated via reaction paths with small E0 values. According
density only by 0.2%, which is the fitting error for the state to the DFT calculation for the dissociation of [G3 + H]+ carried
density. We also calculated the state sum at 0.88 eV, which is out by Paizs and Suhai,22 generation of the b2 ion occurred with
the internal energy at the TS. Here, the difference between the the smallest value of E0 (0.42 eV) and dominated in the low
IWR and BS results was only 7% when the reactant frequencies internal energy range. The value of ∆Sq evaluated using the
were used. Hence, it is unlikely that the fitting error is the main frequencies at the reactant and TS geometries supplied by Paizs
source for the 60% error in the rate constant. The main error in and Suhai was -2.65 eu. This is in agreement with our
the present IWR method may arise from the fact that the speculation that there is probably no dissociation channel with
vibrational frequencies at the TS are different from the reactant simultaneously small E0 and large ∆Sq values for protein ions.
frequencies that are used to obtain the w function. To check If such a channel existed, it would have dominated the fragment
such a possibility, we carried out RRKM calculations for M10 ion spectra over the entire internal energy range. However, it is
with the same E0 but increasing ∆Sq to 10 eu, which is a typical known26 that the major fragment ion types in the dissociation
value for a loose transition state5 reaction. Here again, the rate- of peptide and small protein ions change from b and y to a, d,
energy data calculated by IWR were much better than those by v, w, and x as the internal energy increases. Absence of reaction
WR, even though the deviation from the BS results was larger channels with simultaneously small E0 and large ∆Sq values
than in the E0 ) 0.5 eV and ∆Sq ) 0 eu case, as expected. The means that the IWR method developed in this work is probably
rate constant in the 0.1 s-1 range differed by a factor of 6. The reliable for dissociation of protein ions. Without attached
RRKM calculations were done with an E0 of 1.0 eV and a ∆Sq protons, the critical energy for the dissociation of a neutral
of 10 eu also. Here, the IWR and BS results were in excellent protein will be probably larger than the corresponding value
agreement as in the E0 ) 0.5 eV and ∆Sq ) 0 eu case. for the protonated form.
The RRKM calculations were done for peptides consisting The forgoing discussion on the influence of entropy on the
of various amino acids also. These included well-known peptides accuracy of the IWR method has arisen because the w function
such as angiotensin I (DRVYIHPFHL, monoisotopic relative derived with the reactant frequencies in this work is not quite
molecular mass (RMM) of 1295.7) and 100 randomly generated adequate at the TS. A related problem may arise from the fact
peptides consisting of ten amino acid residues. We also carried that the reactant frequencies used in this work are those
out calculations for various fictitious proteins such as the estimated for linear peptides and proteins. The frequencies of a
fictitious 8-mer of angiotensin I, (DRVYIHPFHL)8, with a protein may be affected by hydrogen bonding even in the gas
monoisotopic RMM of 10 239.4 to check the reliability of the phase. Then, the w function derived in this work may be invalid
Improved WR Approximation for Proteins J. Phys. Chem. B, Vol. 111, No. 10, 2007 2751
to deal with the dissociation of gas-phase proteins, rendering biopolymers such as nucleic acids and carbohydrates, even
the present method useless. Out of this concern, we attempted though the method to derive such functions would be the same
some primitive calculations for peptides with intramolecular as in this work. In this regard, it may be worthwhile to
hydrogen bonding. For this purpose, we assumed that the investigate other approximate methods developed previously
frequencies affected by hydrogen bonding are those of the N-H such that an efficient and accurate method with general
and CdO stretching modes. Then, the frequencies of these applicability to any biopolymer can be established.
modes in the set stored for each amino acid residue was reduced
by 5% based on the spectral correlation appearing in the Acknowledgment. Meiling Sun thanks the Ministry of
literature.27 The rate-energy relations thus obtained were hardly Education for the Brain Korea 21 fellowship. The software
distinguishable from the original data, suggesting that the w developed in this work will be made freely available.
function derived by the present method can be used even when
References and Notes
intramolecular hydrogen bonding is present in a protein.
Finally, we compared the rate-energy relations calculated (1) Holbrook, K. A.; Pilling, M. J.; Robertson, S. H. Unimolecular
Reactions, 2nd ed.; Wiley: Chichester, U.K., 1996.
with E0 ) 0.5 eV and ∆Sq ) 0 eu for the monomer, 8-mer, and (2) Marcus, R. A.; Rice, O. K. J. Phys. Colloid Chem. 1951, 55, 894.
80-mer of angiotensin I. The internal energies corresponding (3) Rosenstock, H. M.; Wallenstein, M.B.; Wahrhaftig, A. L.; Eyring,
to, for example, 103 s-1 in these molecules were 1.97, 14.21, H. Proc. Nat. Acad. Sci. U.S.A. 1952, 38, 667.
and 140.01 eV, respectively, increasing nearly in proportion to (4) Steinfeld, J. I.; Francisco, J. S.; Hase, W. L. Chemical Kinetics
and Dynamics; Prentice Hall: Englewood Cliffs, NJ, 1989; pp 342-401.
the molecular mass. This is in agreement with our previous (5) Gilbert, R.G.; Smith, S. C. Theory of Unimolecular and Recom-
suggestion28 that the internal energy needed to observe a bination Reactions; Blackwell Scientific: Oxford, U.K., 1990; pp 136-
particular type of protein reaction with the same rate constant 211.
(6) Baer, T.; Hase, W. L. Unimolecular Reaction Dynamics: Theory
increases almost in proportion to the number of degrees of and Experiments; Oxford University Press: New York, 1996; pp 171-
freedom, which, in turn, is almost proportional to the molecular 211.
mass of proteins. Because the zero-point energies of proteins (7) Lifshitz, C. AdV. Mass Spectrom. 1989, 11A, 713.
increase almost in proportion to the molecular mass also, 42.18, (8) Baer, T.; Mayer, P. M. J. Am. Soc. Mass Spectrom. 1997, 8, 103.
(9) Laskin, J.; Bailey, T. H.; Futrell, J. H. Int. J. Mass Spectrom. 2006,
332.77, and 3321.73 eV for the monomer, 8-mer, and 80-mer, 249-250, 462.
respectively, the three rate-energy curves can be brought into (10) Rosenstock, H. M.; Stockbauer, R.; Parr, A. C. J. Chem. Phys. 1979,
near coincidence by drawing them with the scaled energy in 71, 3708.
(11) Rosenstock, H. M.; Stockbauer, R.; Parr, A. C. J. Chem. Phys. 1980,
the abscissa. 73, 773.
(12) Lifshitz, C. Mass Spectrom. ReV. 1982, 1, 309.
IV. Conclusion (13) Moon, J. H.; Oh. J. Y.; Kim, M. S. J. Am. Soc. Mass Spectrom.
2006, in press.
Even though the BS algorithm is efficient and accurate in (14) Biemann, K.; Martin, S. A. Mass Spectrom. ReV. 1987, 6, 1.
evaluating the vibrational state sum and density needed in the (15) Paizs, B.; Suhai, S. Mass Spectrom. ReV. 2005, 24, 508.
(16) McCloskey, J. A. Mass Spectrometry, Methods in Enzymology;
RRKM calculation of a unimolecular reaction, its application Academic Press: San Diego, 1990; Vol. 193, pp 351-360.
to biopolymers is limited because the computational time (17) Whitten, G. Z.; Rabinovitch, B. S. J. Chem. Phys. 1963, 38, 2466.
increases rapidly with the molecular mass. In this work, the (18) Morse, P. M.; Feshbach, H. Methods of Theoretical Physics. Part
I; McGraw-Hill: New York, 1953; pp 434-443.
WR approximation used in the semi-classical calculation of a (19) Derrick, P. J.; Lloyd, P. M.; Christie, J. R. Physical Chemistry of
rate constant has been improved such that the rate constant for Ion Reactions in AdV. Mass Spectrometry; Wiley: Chichester, U.K. 1995;
a unimolecular reaction of a protein evaluated by this method Vol. 13, pp 25-52.
closely reproduces the BS result. Its main advantage lies in the (20) Beyer, T.; Swinehart, D. F. Commun. ACM 1973, 16, 379.
(21) Rabinovitch, B. S.; Diesen, R. W. J. Chem. Phys. 1959, 30, 735.
fact that a rate constant can be calculated instantly regardless (22) Paizs, B.; Suhai, S. Rapid Commun. Mass Spectrom. 2002, 16, 375.
of the protein mass. It has been found that the rate constant (23) Polce, M. J.; Ren, D.; Wesdemiotis, C. J. Mass Spectrom. 2000,
calculated by this method differs from the BS result significantly 35, 1351.
(24) Klassen, J. S.; Kebarle, P. J. Am. Chem. Soc. 1997, 119, 6552.
for reactions occurring with simultaneously small E0 and large (25) Allen, G. Protein; Jai Press: London, 1997; Vol. 1, p 9.
∆Sq values. Certainly, it is a weakness of the present method (26) Johnson, R. S.; Martin, S. A.; Biemann, K. Int. J. Mass Spectrom.
even though such a situation is unlikely to be encountered in Ion Processes 1988, 86, 137.
actual protein dissociations. Another weakness of the present (27) Dollish, F. R.; Fateley, W. G.; Bentley, F. F. Characteristic Raman
Frequencies of Organic Compounds; Wiley: London, 1974; Chapter 9.
approach is that the method is valid only for peptides and (28) Oh, J. Y.; Moon, J. H.; Kim, M. S. J. Am. Soc. Mass Spectrom.
proteins. Completely new functions must be derived for other 2004, 15, 1248.