Vous êtes sur la page 1sur 2

The Use of Non-Linear Least Squares Analysis

Thomas G. Copeland
Middlebury College, Middlebury, VT 05753
A common problem is how to use experimental data to es- Table 1. Comparison of Non-Linear Fits for Flrst-Order Klnetlcr
timate the unknown oarameters in a non-linear eauation. For
example, y = x l x z l i ~ x 1+ Bxz), where X I and i2are inde- Moore (2) BMDP PAR
pendent variables and A and B are unknown constants, is a pr 0.301 f 0.003 0.299 f 0 . 0 ~ l 3 ~
non-linear equation. However, the constants A and B cannot Po
Rate
*
0.544 0.002
26.8 f 0.7
0.544 f 0.001
26.4 f 0.7
be estimated by the normal polynomial least squares methods.
In the past, these equations have been rearranged to linear Error 0.0000341 0.0000332
form for ease of analvsis. In manvcases, the equations needed Emor estimsteo are 1 standard deviation.
to be simplified by approximati& before the; could be rear-
ranged. Rather than deal with the manv special cases of
rearranged equations, a non-linear least squares analysis can Table 2. Control Language for First-Order Klnetlcs Fit
be used to estimate the parameters directly. For any equation
that can be expressed the general formy = f(x;),nodinear (PROBLEM TITLE IS 'First Order Kinetics'.
least squares allows the unknown parameters in the equation /INPUT VARIABLES ARE 2.
to be estimated from experimental data. The function need FORMAT IS '(2F15.0)'.
not be a linear of polynomial form. Once the computer pro- {VARIABLE NAMES ARE T.P.
/REGRESS DEPENDENT IS P.
gram to do non-linearleast squares is mastered (a fairly simple PARAMETERS ARE 3.
task) almost any equation can be analyzed. This paper dem-
- ..
onstrates how this eeneral method can be aonlied to several
common problems.
/PARAMETER

/FUNCTION
INITIAL ARE .5,.2.25.
NAME = PO.. P1.. R.
F = Pl'(1.-EXP(-R'T))+PO'EXF'-R'T)
Some statistical oackaees (such as BMDP (1))contain so- /PLOT RESIDUAL.
phisticated p r o g r a k for doing non-linear' Last squares VARIABLE IS T.
analysis without the need for explicit functional forms of the /END
derivatives. The only input necessary for these programs data goes here
(other than the data to be fit) is an initial estimate of the value
of the parameters to be estimated and the functional form of
the eauation to be fit. Since the aleorithms used work well mates of the errors in all the narameters. and the eeneral e

even irthe initial estimates are quiteiar off, any crude method technique becomes slow for more complicated equations.
for obtaining initial values is adequate. Moore (2) reoorted a non-linear least souares analvsis for
a
These s t a h i c a l programs use simple, concise language
to determine the s~eciiicsof each oroblem. The control lan-
the first-&der'kinetics of a ~ t o [ , ~ e d - t l o ~ e x p e r i m eThe
same data were analvzed usinr the IWDP non-linear. deriv-
~;t.
guage is typically only 10 to 20 lines long. New problems re- ative-free least squaies program PAR ( 1 ), and eisenti& the
quire only a minor change to a few lines of the control lan- same results were obtained (Table 11. The PAR control lan-
guage. Writing appropriate control language is much simpler guage used t o analyze the data is given in Table 2. These
than writing an entire computer program. commands are very short and quite easy to understand. (For
These package programs are readily available for many a complete explanation see the BMDP manual.) It is certainly
medium-to-large computers. Although they are not presently easier to learn to write these commands than it is to write a
available on microcomputers, there are less sophisticated program to do the analysis.
programs that will run on micros. The programs that run on The analysis of the binding of small molecules to macro-
micros, in general, do not use the most current statistical molecules can become quite complicated even if only one
methods and, in most cases, relatively little information about binding site is present and becomes very complicated for
the methods of calculation is provided in the documentation. multiple binding sites (4). The general equation for small
This is one situation where the minor inconvenienceof dealing molecules binding to n identical binding sites is
with a minicomputer or mainframe is overshadowed by the
power and ease of use of the software available. In the near
future, powerful statistical packages will be available on in- where u is the velocitv of the reaction. I11is the concentration
expensive small computers. of small molecules, n-is the number ifsites of the same type,
The case of first- and second-order chemical kinetics is a and K is the binding constant.~h~~~are three commonly used
typical example of the need for non-linear least squares (2,3). linearizations of this equation: the Bjermm plot, the reciprocal
There are three Parameters that need to be estimated. For plot, and the Scatchard dot. All three of these methods work
first-order kinetics, well for one binding sitk. In the multiple binding case, the
equation is
where P is the measured physical quantity (e.g., absorbance),
PI is the measured quantity a t infinite t & n e , - ~is~ the mea-
sured quantity a t time zero, t is the time, and r is the rate of where the sum is over the total number of different types of
the re&ion. In cases where PI cannot be obtained experi- binding sites. However, for multiple-site bmding, none of these
mentally, Po, P I , and r are all parameters that need to be es- analyses gives direct values for the unknown parameters (5).
timated. This can be done directly using a non-linear least The limiting slopes and intercepts (for very large and very
squares analysis (2). Houser (3) recently suggested an alter- small [I]) give combinations of the various parameters, but the
nate, iterative method, but this method does not give esti- , actual parameters can only be easily obtained in some special

778 Journal of Chemical Education


Table 3. Control Language for Two-Slte Binding Problem Table 5. Comparlson of True and PAR-Calculated Values of Four
Constants
/PROBLEM TITLE IS '2 Site Binding'.
/INPUT VARIABLES ARE 2. True Calculated
FORMAT IS '(2F15.0S.
/VARIABLE NAMES ARE A. V. nr 100 100.0 i0.0009a
/REGRESS DEPENDENT IS V. n2 10 9.997 i0.002
PARAMETERS ARE 4. k, 1.0 0.99989 f0.00006
/PARAMETER INITIAL ARE 10.,10..1.,.5. k2 0.01 0.00998 f0.00002
NAME = Nl.N2.Kl.K2.
. . .
IFUNCTION F = Nl'AI(K1 + A ) + N2'A/(K2 + A). 'E m r 891imtes me 1 standarc deviation.
/PLOT RESIDUAL.
VARIABLE IS A.
/END
The analysis of this problem would take a considerable
data goes here amount of time using Klotz's method. By using the non-linear
equation directly, the problem is simplified greatly.
The analysis for Michalis-Menton (steadystate) kinetics
Table 4. Slmulated Two-SHe Binding Data leads to equations similar to the multi-site binding equations
(6). Extensions of the steads-state analvsis lead to eauations
Id1 " that are inherently non-linear. These equations canbe ana-
lyzed easily using a non-linear least squares program.
The estimates of the standard deviations of all theparam-
eters in the equation are automatically produced by PAR.
These values are necessary in order to establish the validity
of the results.
PAR will also use experimental errors in the dependent
variable to weight the observations. If there are errors in the
independent variables, the analysis is more difficult (7-9).
In addition, PAR will automatically produce plots of the
predicted and observed values of the devendent variahle.,as ~~- ~ ~ ~~~~~

well as a plot of the residuals. The residial plot can be exam-


ined to insure that the residuals are randomlv distributed. Anv
systematic pattern to the residuals indicatesMthatthe equation
does not adeauatelv fit the data.
In conclusi&, noklinear least squares computer programs
are extremely valuable in fittine com~licatedeauations to
experimentaldata. Such programs are now availadle for most
medium to large scale computers. They can be very easy to use
and can free students, faculty, and researchers from the ted-
ium of trsina to derive linearized forms to comvlicated
cases. However, these analyses can often he used to estimate equations.kq;ations that cannot be linearized become as easy
the initial values for the parameters in the non-linear least to handle as linear eauations. Manv eauations that in the vast
squares. have been "too complicated to analyze" can now he hanhed
The four-parameter equation to fit a two-site binding quite easily using derivative-free, non-linear least squares
problem is (4) methods.
u = ndAIl(k~+ [All + nz[All(kz + [A]) Literature Cited
The sample PAR control language to find the four parameters ,I Diron. W .I ."RMnPSlat.rtial Scftware 1981." I l n r . (:elifomla P~CII.I9bI.
.>IM..lrr.P.,I Chrm Sm Ihrodo) l h n . 1 . 1diklt1'1~3~
is given in Table 3. Clearly, the control language to solve a very , I t H.larer .I..I( ' 1 O M FI)I 1' 59. l-61I9RZf.
complicated problem is quite easy to write. 48 %nnall. 4. T: . Illopnyrtml ('l#l.n>#r!nj' J W~lcy.New York. 19id.('hnp 3 . ,nuon
Twenty simulated data points were calculated using the (5) Klotz.1. M.,andHunston,D. L.,J. Biochem., 10,3065(1911).
parameters given in Table 4. Using the above control language, (6) Re( (41, Chap 10.
(71 Chri8tien.S. D.,Lsne.E.H..and Gsrland,F., J.CHEM. E~~~.,51,475(19741
PAR gives values of the four constants that are in excellent (8) Lane, E. H., Christisn.S. D., and Garland, F., J.Phys. Cham., 80,690 (1976).
agreement with the true values Table 5. (9) Onar, J., Amer J. Phys., 50.912 (1982).

Volume 61 Number 9 September 1984 779

Vous aimerez peut-être aussi