Académique Documents
Professionnel Documents
Culture Documents
LAGRANGE MULTIPLIERS
William F. Trench
Andrew G. Cowles Distinguished Professor Emeritus
Department of Mathematics
Trinity University
San Antonio, Texas, USA
wtrench@trinity.edu
William F. Trench
1 Foreword
This is a revised and extended version of Section 6.5 of my Advanced Calculus (Harper
& Row, 1978). It is a supplement to my textbook Introduction to Real Analysis, which
is referenced via hypertext links.
2 Introduction
To avoid repetition, it is to be understood throughout that f and g1 , g2 ,. . . , gm are
continuously differentiable on an open set D in Rn .
Suppose that m < n and
2
However, it may be difficult or impossible to find explicit formulas for h1 , h2 , . . . , hm ,
and, even if it is possible, the composite function (5) is almost always complicated.
Fortunately, there is a better way to to find constrained extrema, which also requires
the solvability assumption, but does not require an explicit formula as indicated in (4).
It is based on the following theorem. Since the proof is complicated, we consider two
special cases first.
Theorem 1 Suppose that n > m: If X0 is a local extreme point of f subject to
g1 .X/ D g2 .X/ D D gm .X/ D 0
and ˇ ˇ
ˇ @g1 .X0 / @g1 .X0 / @g1 .X0 / ˇ
ˇ ˇ
ˇ
ˇ @x r1 @x r2 @x rm
ˇ
ˇ
ˇ ˇ
ˇ ˇ
ˇ @g2 .X0 / @g2 .X0 / @gm .X0 / ˇ
ˇ ˇ¤0
ˇ
(6)
ˇ
ˇ @x r1 @x r2 @x rm ˇ
ˇ :: :: : :: :
:: ˇ
ˇ
ˇ : : ˇ
ˇ
ˇ @gm .X0 / @gm .X0 / @g m .X 0 / ˇ
ˇ ˇ
ˇ @xr1 @xr2 @xrm ˇ
for at least one choice of r1 < r2 < < rm in f1; 2; : : : ; ng; then there are constants
1 ; 2 ; . . . ; m such that X0 is a critical point of
f 1 g1 2 g2 m gm I
that is;
@f .X0 / @g1 .X0 / @g2 .X0 / @gm .X0 /
1 2 m D 0;
@xi @xi @xi @xi
1 i n.
The following implementation of this theorem is the method of Lagrange
multipliers.
(a) Find the critical points of
f 1 g1 2 g2 m gm ;
treating 1 , 2 , . . . m as unspecified constants.
(b) Find 1 , 2 , . . . , m so that the critical points obtained in (a) satisfy the con-
straints.
(c) Determine which of the critical points are constrained extreme points of f . This
can usually be done by physical or intuitive arguments.
If a and b1 , b2 , . . . , bm are nonzero constants and c is an arbitrary constant, then the
local extreme points of f subject to g1 D g2 D D gm D 0 are the same as the local
extreme points of af c subject to b1 g1 D b2 g2 D D bm gm D 0. Therefore, we
can replace f 1 g1 2 g2 m gm by af 1 b1 g1 2 b2 g2 m bm gm c
to simplify computations. (Usually, the “ c” indicates dropping additive constants.)
We will denote the final form by L (for Lagrangian).
3
3 Extrema subject to one constraint
Here is Theorem 1 with m D 1.
Since gx1 .X0 / ¤ 0, the Implicit Function Theorem (Corollary 6.4.2, p. 423) implies
that there is a unique continuously differentiable function h D h.U/; defined on a
neighborhood N Rn 1 of U0 ; such that .h.U/; U/ 2 D for all U 2 N , h.U0 / D x10 ,
and
g.h.U/; U/ D 0; U 2 N: (8)
Now define
fx1 .X0 /
D ; (9)
gx1 .X0 /
which is permissible, since gx1 .X0 / ¤ 0. This implies (7) with i D 1. If i > 1,
differentiating (8) with respect to xi yields
4
has a nontrivial solution if and only if
ˇ ˇ
ˇ a b ˇˇ
ˇ
ˇ c D 0;
d ˇ
since the determinants of a matrix and its transpose are equal. Therefore, the system
2 @f .X / @g.X0 / 3
0
6 @xi @xi 7
7 u 0
D
6
@g.X / 5 v 0
6 7
4 @f .X /
0 0
@x1 @x1
has a nontrivial solution (Theorem 6.1.15, p. 376). Since gx1 .X0 / ¤ 0, u must be
nonzero in a nontrivial solution. Hence, we may assume that u D 1, so
@f .X0 / @g.X0 /
2 3
6 @xi @xi 7
7 1 0
D : (14)
6
@g.X0 / 5 v 0
6 7
4 @f .X /
0
@x1 @x1
In particular,
@f .X0 / @g.X0 / fx1 .X0 /
Cv D 0; so vD :
@x1 @x1 gx1 .X0 /
Now (9) implies that v D , and (14) becomes
2 @f .X / @g.X0 / 3
0
6 @xi @xi 7
1 0
D :
6 7
0
6 7
4 @f .X / @g.X0 / 5
0
@x1 @x1
Computing the topmost entry of the vector on the left yields (7).
ax C by D d
5
p
Solution We must minimize .x x1 /2 C .y y1 /2 subject to the constraint. This
is equivalent to minimizing .x x1 /2 C .y y1 /2 subject to the constraint, which is
simpler. For, this we could let
L D .x x1 /2 C .y y1 /2 .ax C by d /I
however,
.x x1 /2 C .y y1 /2
LD .ax C by/
2
is better. Since
Lx D x x1 a and Ly D y y1 b;
.x0 ; y0/ D .x1 C a; y1 C b/, where we must choose so that ax0 C by0 D d .
Therefore,
ax0 C by0 D ax1 C by1 C .a2 C b 2 / D d;
so
d ax1 by1
D ;
a2 C b 2
.d ax1 by1 /a .d ax1 by1 /b
x0 D x1 C 2 2
; and y0 D y1 C :
a Cb a2 C b 2
The distance from .x1 ; y1 / to the line is
p jd ax1 by1 j
.x0 x1 /2 C .y0 y1 /2 D p :
a2 C b 2
Example 2 Find the extreme values of f .x; y/ D 2x C y subject to
x 2 C y 2 D 4:
Solution Let
2
L D 2x C y .x C y 2 /I
2
then
Lx D 2 x and Ly D 1
y;
p
so .x0 ; y0 / D p
.2=; 1=/. Since C x02
p
2
yp
0 D
4, D ˙ 5=2. Hence, the constrained
p
maximum is 2 p 5, attained
p at .4= 5; 2= 5/, and the constrained minimum is 2 5,
attained at . 4= 5; 2= 5/.
3x C 4y C ´ D 1 (15)
closest to . 1; 1; 1/.
6
subject to (15). Let
so D 1=26 and
29 22 25
.x0 ; y0 ; ´0 / D ; ; :
26 26 26
The distance from .x0 ; y0 ; ´0 / to . 1; 1; 1/ is
p 1
.x0 C 1/2 C .y0 1/2 C .´0 1/2 D p :
26
Example 4 Assume that n 2 and xi 0, 1 i n.
n
X n
X
(a) Find the extreme values of xi subject to xi2 D 1.
i D1 i D1
n
X n
X
(b) Find the minimum value of xi2 subject to xi D 1.
i D1 i D1
then
1
Lxi D 1 xi ; so xi 0 D ; 1 i n:
n
X p
Hence, xi20 D n=2 , so D ˙ n and
i D1
1 1 1
.x10 ; x20; : : : ; xn0 / D ˙ p ; p ; : : : ; p :
n n n
p p
Therefore, the constrained maximum is n and the constrained minimum is n.
Solution (b) Let
n n
1X 2 X
LD xi xi I
2
i D1 i D1
7
then
Lxi D xi ; so xi 0 D ; 1 i n:
n
X
Hence, xi 0 D n D 1, so xi 0 D D 1=n and the constrained minimum is
i D1
n
X 1
xi20 D
n
i D1
subject to
x y
C D ; x 0; y 0; (17)
p q
where is a fixed but arbitrary positive number. Since f is continuous, it must assume
a maximum at some point .x0 ; y0 / on the line segment (17), and .x0 ; y0 / cannot be an
endpoint of the segment, since f .p; 0/ D f .0; q/ D 0. Therefore, .x0 ; y0 / is in the
open first quadrant.
Let
1=p 1=q x y
LDx y C :
p q
Then
1 1
Lx D f .x; y/ and Ly D f .x; y/ D 0;
px p qy q
so x0 D y0 D f .x0 ; y0 /=. Now(16) and (17) imply that x0 D y0 D . Therefore,
x y
f .x; y/ f .; / D 1=p 1=q D D C :
p q
This can be generalized (Exercise 53). It can also be used to generalize Schwarz’s
inequality (Exercise 54).
8
4 Constrained Extrema of Quadratic Forms
In this section it is convenient to write
2 3
x1
6 x2 7
XD6 7:
6 7
::
4 : 5
xn
An eigenvalue of a square matrix A D Œaij ni;j D1 is a number such that the system
AX D X;
or, equivalently,
.A I/X D 0;
has a solution X ¤ 0. Such a solution is called an eigenvector of A. You probably
know from linear algebra that is an eigenvalue of A if and only if
det.A I/ D 0:
Then
n
X
Lxi D 2 aij xj 2xi D 0; 1 i n;
j D1
so
n
X
aij xj 0 D xi 0 ; 1 i n:
j D1
n
X
Therefore, X0 is a constrained critical point of Q subject to xi2 D 1 if and only
i D1
if AX0 D X0 for some ; that is, if and only if is an eigenvalue and X0 is an
9
n
X
associated unit eigenvector of A. If AX0 D X0 and xi20 D 1, then
i
0 1
n
X n
X n
X
Q.X0 / D @ aij xj 0 A xi 0 D .xi 0 /xi 0
i D1 j D1 i D1
n
X
D xi20 D I
i D1
therefore, the largest and smallest eigenvalues of A are the maximum and minimum
Xn
values of Q subject to xi2 D 1.
i D1
and
ˇ ˇ
ˇ 1 1 2 ˇˇ
ˇ
det.A I/ D ˇˇ 1 1 2 ˇˇ
ˇ 2 2 2 ˇ
D . C 2/. 2/. 4/;
so
1 D 4; 2 D 2; 3 D 2
are the eigenvalues of A. Hence, 1 D 4 and 3 D 2 are the maximum and minimum
values of Q subject to (18).
To find the points .x1 ; y1; ´1 / where Q attains its constrained maximum, we first
find an eigenvector of A corresponding to 1 D 4. To do this, we find a nontrivial
solution of the system
2 3 2 32 3 2 3
x1 3 1 2 x1 0
.A 4I/ 4 y1 5 D 4 1 3 2 5 4 y1 5 D 4 0 5 :
´1 2 2 2 ´1 0
10
2 3
1
All such solutions are multiples of 4 1 5 : Normalizing this to satisfy (18) yields
2
2 3 2 3
x1 1
1
X1 D p 4 y1 5 D ˙ 4 1 5 :
6 ´1 1
To find the points .x3 ; y3 ; ´3 / where Q attains its constrained minimum, we first
find an eigenvector of A corresponding to 3 D 2. To do this, we find a nontrivial
solution of the system
2 3 2 32 3 2 3
x3 3 1 2 x3 0
.A C 2I/ 4 y3 5 D 4 1 3 2 5 4 y3 5 D 4 0 5 :
´3 2 2 4 ´3 0
2 3
1
All such solutions are multiples of 4 1 5 : Normalizing this to satisfy (18) yields
1
2 3 2 3
x2 1
1
X3 D 4 y2 5 D ˙ p 4 1 5 :
´2 3 1
As for the eigenvalue 2 D 2, we leave it you to verify that the only unit vectors
that satisfy AX2 D 2X2 are
2 3
1
1
X2 D ˙ p 4 1 5 :
2 1
For more on this subject, see Theorem 4.
for some r and s in f1; 2; : : : ; ng; then there are constants and such that
@f .X0 / @g1 .X0 / @g2.X0 /
D 0; (20)
@xi @xi @xi
1 i n.
11
Proof For notational convenience, let r D 1 and s D 2. Denote
U D .x3 ; x4 ; : : : xn / and U0 D .x30 ; x30 ; : : : xn0 /:
Since
ˇ @g1 .X0 / @g1.X0 / ˇ
ˇ ˇ
ˇ ˇ
ˇ @x1 @x2 ˇ
ˇ ¤ 0; (21)
ˇ ˇ
ˇ
ˇ @g .X / @g .X / ˇ
ˇ 2 0 2 0 ˇ
@x1 @x2
ˇ ˇ
the Implicit Function Theorem (Theorem 6.4.1, p. 420) implies that there are unique
continuously differentiable functions
h1 D h1 .x3 ; x4; : : : ; xn / and h2 D h1 .x3 ; x4 ; : : : ; xn /;
defined on a neighborhood N Rn 2 of U0 ; such that .h1 .U/; h2 .U/; U/ 2 D for all
U 2 N , h1 .U0 / D x10 , h2 .U0 / D x20 , and
g1 .h1 .U/; h2 .U/; U/ D g2 .h1 .U/; h2 .U/; U/ D 0; U 2 N: (22)
From (21), the system
2 @g .X / @g1 .X0 / 3
1 0
6 @x 1 @x2 7
7
fx1 .X0 /
D (23)
6
@g .X / 5 fx2 .X0 /
6 7
4 @g .X /
2 0 2 0
@x1 @x2
has a unique solution (Theorem 6.1.13, p. 373). This implies (20) with i D 1 and
i D 2. If 3 i n, then differentiating (22) with respect to xi and recalling that
.h1 .U0 /; h2 .U0 /; U0 / D X0 yields
@g1 .X0 / @g1 .X0 / @h1.U0 / @g1.X0 / @h2 .U0 /
C C D0
@xi @x1 @xi @x2 @xi
and
@g2.X0 / @g2 .X0 / @h1 .U0 / @g2 .X0 / @h2.U0 /
C C D 0:
@xi @x1 @xi @x2 @xi
If X0 is a local extreme point of f subject to g1 .X/ D g2 .X/ D 0, then U0 is an
unconstrained local extreme point of f .h1 .U/; h2 .U/; U/; therefore,
@f .X0 / @f .X0 / @h1 .U0 / @f .X0 / @h2.U0 /
C C D 0:
@xi @x1 @xi @x2 @xi
The last three equations imply that
ˇ @f .X0 / @f .X0 / @f .X0 /
ˇ ˇ
ˇ
ˇ ˇ
ˇ
ˇ @x i @x1 @x2 ˇ
ˇ
ˇ ˇ
ˇ @g .X / @g .X / @g1 .X0 / ˇˇ
1 0 1 0
ˇ D 0;
ˇ
ˇ
ˇ @xi @x1 @x2 ˇ
ˇ ˇ
ˇ ˇ
ˇ @g .X / @g .X / @g2 .X0 / ˇˇ
ˇ 2 0 2 0
@xi @x1 @x2
ˇ ˇ
12
ˇ @f .X0 / @g1 .X0 / @g2 .X0 /
ˇ ˇ
ˇ
ˇ ˇ
ˇ @xi @xi @xi ˇ
ˇ ˇ
ˇ ˇ
ˇ @f .X / @g1 .X0 / @g2 .X0 / ˇˇ
ˇ 0
ˇ D 0:
ˇ ˇ
ˇ @x1 @x1 @x1
ˇ ˇ
ˇ ˇ
ˇ @f .X / @g1 .X0 / @g2 .X0 / ˇˇ
ˇ 0
ˇ @x2 @x2 @x2
ˇ ˇ
ˇ
ˇ ˇ
Therefore, there are constants c1 , c2, c3 , not all zero, such that
@f .X0 / @g1.X0 / @g2 .X0 /
2 3
6 @xi @xi @xi 7
6 7
6 7
6 @f .X / @g .X / @g .X / 7 2 3 2 3
6 0 1 0 2 0 7 c1 0
7 c2 5 D 4 0 5 : (24)
6 74
6 @x1 @x1 @x1
7 c3 0
6 7
6
6 @f .X / @g .X / @g .X / 7
6 0 1 0 2 0 7
@x2 @x2 @x2
4 5
If c1 D 0, then 2 @g .X /
1 0 @g1.X0 / 3
6 @x1 @x2 7
7 c2
0
D ;
6
@g .X / 5 c3 0
6 7
4 @g .X /
2 0 2 0
@x1 @x2
so (19) implies that c2 D c3 D 0; hence, we may assume that c1 D 1 in a nontrivial
solution of (24). Therefore,
@f .X0 / @g1.X0 / @g2.X0 /
2 3
6 @xi @xi @xi 7
6 7
6 7
6 @f .X / @g .X / @g .X / 7 2 3 2 3
6 0 1 0 2 0 7 1 0
7 c2 5 D 4 0 5; (25)
6 74
6 @x1 @x1 @x1
7 c3 0
6 7
6
6 @f .X / @g .X / @g .X / 7
6 0 1 0 2 0 7
@x2 @x2 @x2
4 5
13
Since (23) has only one solution, this implies that c2 D and c2 D , so (25)
becomes
@f .X0 / @g1.X0 / @g2 .X0 /
2 3
6 @xi @xi @xi 7
6 7
6 7
6 @f .X / @g .X / @g .X / 7 2 3 2 3
6 0 1 0 2 0 7 1 0
5 D 4 0 5:
6 74
6 @x1 @x1 @x1 7
0
6 7
6 7
6 @f .X / @g .X / @g .X / 7
6 0 1 0 2 0 7
@x2 @x2 @x2
4 5
Computing the topmost entry of the vector on the left yields (20).
Example 7 Minimize
f .x; y; ´; w/ D x 2 C y 2 C ´2 C w 2
subject to
x C y C ´ C w D 10 and x y C ´ C 3w D 6: (26)
Solution Let
x 2 C y 2 C ´2 C w 2
LD .x C y C ´ C w/ .x y C ´ C 3w/I
2
then
Lx D x
Ly D y C
L´ D ´
Lw D w 3;
so
x0 D C ; y0 D ; ´0 D C ; w0 D C 3: (27)
This and (26) imply that
. C / C . / C . C / C . C 3/ D 10
. C / . / C . C / C .3 C 9/ D 6:
Therefore,
4 C 4 D 10
4 C 12 D 6;
14
Since f .x; y; ´; w/ is the square of the distance from .x; y; ´; w/ to the origin, it attains
a minimum value (but not a maximum value) subject to the constraints; hence the
constrained minimum value is
5 7 5 3
f ; ; ; D 27:
2 2 2 2
where .x1 ; y1 / is on one curve and .x2 ; y2 / is on the other. Find the distance between
the ellipse
x 2 C 2y 2 D 1
and the line
x C y D 4: (28)
d 2 D .x1 x2 /2 C .y1 y2 /2
subject to
x12 C 2y12 D 1 and x2 C y2 D 4:
Let
.x1 x2 /2 C .y1 y2 /2 .x12 C 2y12 /
LD .x2 C y2 /I
2
then
Lx1 D x1 x2 x1
Ly1 D y1 y2 2y1
Lx2 D x2 x1
Ly2 D y2 y1 ;
so
From (i) and (iii), D x10 ; from (ii) and (iv), D 2y10 . Since the curves do
2 2
not intersect, ¤ 0, so x10 D 2y10 . Since x10 C 2y10 D 1 and .x0 ; y0/ is in the first
quadrant,
2 1
.x10 ; y10 / D p ; p : (29)
6 6
15
Now (iii), (iv), and (28) yield the simultaneous system
1
x20 y20 D x10 y10 D p ; x20 C y20 D 4;
6
so
1 1
.x20 ; y20/ D 2 C p ; 2 p :
2 6 2 6
From this and (29), the distance between the curves is
" #1=2
2 2 1 2 p
1 1 3
2C p p C 2 p p D 2 2 p :
2 6 6 2 6 6 2 6
6 Proof of Theorem 1
Proof For notational convenience, let r` D `, 1 ` m, so (6) becomes
ˇ @g1 .X0 / @g1 .X0 / @g1 .X0 / ˇ
ˇ ˇ
ˇ ˇ
ˇ
ˇ @x1 @x2 @xm ˇ
ˇ
ˇ ˇ
ˇ @g .X / @g .X / @g2 .X0 / ˇˇ
2 0 2 0
ˇ
@x1 @x2 @xm
ˇ ˇ
ˇ ˇ¤0 (30)
ˇ :: :: :: :: ˇ
ˇ
ˇ : : : :
ˇ
ˇ
ˇ @g .X / @g .X / @g .X / ˇ
m 0 m 0 m 0
ˇ ˇ
@x1 @x2 @xm
ˇ ˇ
ˇ ˇ
ˇ ˇ
Denote
U D .xmC1 ; xmC2 ; : : : xn / and U0 D .xmC1;0 ; xmC2;0 ; : : : xn0 /:
From (30), the Implicit Function Theorem implies that there are unique continuously
differentiable functions h` D h` .U/, 1 ` m, defined on a neighborhood N of U0 ,
such that
.h1 .U/; h2 .U/; : : : ; hm .U/; U/ 2 D; for all U 2 N;
.h1 .U0 /; h2 .U0 /; : : : ; hm .U0 /; U0 / D X0 ; (31)
and
g` .h1 .U/; h2 .U/; : : : ; hm.U/; U/ D 0; U 2 N; 1 ` m: (32)
Again from (30), the system
@g1 .X0 / @g1 .X0 / @g1 .X0 /
2 3
6
6 @x1 @x2 @xm 7
72 3 2 3
1 fx1 .X0 /
6 7
6 @g .X / @g .X / @g2 .X0 / 7 6
7
2 0 2 0
7 6 2 7 6 fx2 .X0 /
6 7 6 7
@x1 @x2 @xm
6
6 76 : 7 D 6 ::
7
(33)
:: :: :: :: 74 : 5 4 7
: :
6 5
6
: : : :
7
f m .X0 /
6 7
6 @g .X / @g .X / @gm .X0 / 7
7 m x
m 0 m 0
6
@x1 @x2 @xm
4 5
16
has a unique solution. This implies that
so
@f .X0 / @g1.X0 / @g2.X0 / @gm .X0 /
ˇ ˇ
ˇ ˇ
ˇ ::: ˇ
ˇ
ˇ @xi @xi @xi @xi ˇ
ˇ
ˇ ˇ
@f .X0 / @g1.X0 / @g2.X0 / @gm .X0 / ˇ
ˇ ˇ
:::
ˇ
ˇ ˇ
ˇ
ˇ @x1 @x1 @x1 @x1 ˇ
ˇ
ˇ ˇ D 0:
ˇ @f .X0 / @g1.X0 / @g2.X0 / @gm .X0 / ˇˇ
ˇ :::
@x2 @x2 @x2 @x2
ˇ ˇ
ˇ ˇ
ˇ :: :: :: :: :: ˇ
ˇ
ˇ : : : : : ˇ
ˇ
ˇ @f .X0 / @g1.X0 / @g2.X0 / @gm .X0 / ˇˇ
ˇ :::
@xm @xm @xm @xm
ˇ ˇ
17
Therefore, there are constant c0 , c1 , . . . cm , not all zero, such that
@f .X0 / @g1 .X0 / @g2 .X0 / @gm .X0 /
2 3
6 @xi : : :
6 @xi @xi @xi 7
7
6 72 3 2 3
6 @f .X0 / @g1 .X0 / @g2 .X0 / @gm .X0 / 7 c0 0
6 7
6 ::: 76
c1 0
6 @x1 @x 1 @x 1 @x 1 7 7 6 7
76 7 6 7
6 c3 0
6
76 7D6 7 : (35)
6 7 6 7
6 @f .X / @g .X / @g .X /
0 1 0 2 0 @gm .X0 / 7 6 : ::
7 4 ::
7 6 7
6 ::: 7 5 4 : 5
6 @x2 @x2 @x2 @x2
6
7 cm 0
7
6 :: :: :: :: ::
6
6 : : : : : 7
7
4 @f .X0 / @g1 .X0 / @g2 .X0 / @gm .X0 / 5
:::
@xm @xm @xm @xm
If c0 D 0, then
@g1 .X0 / @g1 .X0 / @g1 .X0 /
2 3
6
6 @x1 @x2 @xm 7
72 3 2 3
c1 0
6 7
6 @g .X / @g2 .X0 / @g2 .X0 / 7
2 0
c2 0
6 76 7 6 7
6 76
6 @x1 @x2 @xm 76 :: 7D6
7 6
::
7
:: :: ::
7
6 :: 74
: :
:
5 4 5
: : :
6 7
cm 0
6 7
6 @g .X / @gm .X0 / @gm .X0 / 7
m 0
6 7
@x1 @x2 @xm
4 5
18
which implies that
@g1 .X0 / @g1 .X0 / @g1 .X0 /
2 3
6
6 @x1 @x2 @xm 7
72 3 2 3
c1 fx1 .X0 /
6 7
6 @g .X / @g .X / @g2 .X0 / 7
2 0 2 0
c2 fx2 .X0 /
6 76 7 6 7
@x1 @x2 @xm
6 76
:: 7D6 ::
6 76 7 6 7
:: :: ::
7
6 :: 74
: 5 4 : 5
6
: : : :
7
cm fxm .X0 /
6 7
6 @g .X / @g .X / @gm .X0 / 7
m 0 m 0
6 7
@x1 @x2 @xm
4 5
Since (33) has only one solution, this implies that cj D j , 1 j n, so (36)
becomes
@f .X0 / @g1 .X0 / @g2 .X0 / @gm.X0 /
2 3
6 @xi :::
6 @xi @xi @xi 7
7
6 72 3 2 3
6 @f .X0 / @g1 .X0 / @g2 .X0 / @gm.X0 / 7 1 0
6 7
:::
1 0
6 76
6 @x1 @x1 @x1 @x1 76 7 6
7 6
7
7
2 0
6 76
7D6 7:
6 76 7 6 7
6 @f .X / @g .X / @g .X /
0 1 0 2 0 @gm.X0 / 7
76 :: 7 6 :: 7
6 ::: 74 : 5 4 : 5
6 @x2 @x2 @x2 @x2
6
m 0
7
6 :: :: :: :: :: 7
6
6 : : : : : 7
7
4 @f .X0 / @g1 .X0 / @g2 .X0 / @gm.X0 / 5
:::
@xm @xm @xm @xm
Computing the topmost entry of the vector on the left yields yields (34), which com-
pletes the proof.
n
X
Example 9 Minimize xi2 subject to
i D1
n
X
ar i xi D cr ; 1 r m; (37)
i D1
where
n
(
X 1 if r D s;
ar i asi D (38)
i D1
0 if r ¤ s:
Solution Let
n m n
1X 2 X X
LD xi s asi xi :
2 sD1
i D1 i D1
Then
m
X
Lxi D xi s asi ; 1 i n;
sD1
19
so
m
X
xi 0 D s asi 1 i n; (39)
sD1
and
m
X
ar i xi 0 D s ar i asi :
sD1
Now (38) implies that
n
X m
X n
X
ar i xi 0 D s ar i asi D r :
i D1 sD1 i D1
Therefore,
m
X
xi20 D cr cs ar i asi ; 1 i n;
r;sD1
The next theorem provides further information on the relationship between the
eigenvalues of a symmetric matrix and constrained extrema of its quadratic form. It
can be proved by successive applications of Theorem 1; however, we omit the proof.
Theorem 4 Suppose that A D Œar s nr;sD1 2 Rnn is symmetric and let
n
X
Q.x/ D ar s xr xs :
r;sD1
20
minimizes Q subject to
n
X n
X
xi2 D 1 and xi s xi D 0; 1sr 1:
i D1 i D1
Denote
n
X
r D aij xi r xj r ; 1 r n:
i;j D1
Then
1 2 n and Axr D r xr ; 1 r n:
21
7 Exercises
1. Find the point on the plane 2x C 3y C ´ D 7 closest to .1; 2; 3/.
4. Find the points on the circle x 2 C y 2 D 320 closest to and farthest from .2; 4/.
9. A closed rectangular box has surface area A. Find it largest possible volume.
10. The sides and bottom of a rectangular box have total area A. Find its largest
possible volume.
11. A rectangular box with no top has volume V . Find its smallest possible surface
area.
13. Two vertices of a triangle are . a; 0/ and .a; 0/, and the third is on the ellipse
x2 y2
C D 1:
a2 b2
Find its largest possible area.
14. Show that the triangle with the greatest possible area for a given perimeter is
equilateral, given that the area of a triangle with sides x, y, ´ and perimeter s is
p
A D s.s x/.s y/.s ´/:
22
15. A box with sides parallel to the coordinate planes has its vertices on the ellipsoid
x2 y2 ´2
2
C 2 C 2 D 1:
a b c
Find its largest possible volume.
16. Derive a formula for the distance from .x1 ; y1; ´1 / to the plane
ax C by C c´ D :
ax C by C c´ D
Pn
for which i D1 jX Xi j2 is a minimum. Assume that none of the Xi are in the
plane.
n
X n
X
18. Find the extreme values of f .X/ D .xi ci /2 subject to xi2 D 1.
i D1 i D1
22. Find the extreme value of f .x; y/ D ˛ C ˇxy subject to .ax C by/2 D 1.
Assume that ab ¤ 0.
4x 2 C 9y 2 36´2 D 36:
x 2 C y 2 C ´2 C w 2 D 1:
x 2 C y 2 D 1 and ´2 C w 2 D 1:
23
26. Find the extreme values of f .x; y; ´; w/ D .x C ´/.y C w/ subject to
x 2 C ´2 D 1 and y 2 C w 2 D 1:
x2 y2 ´2
28. Minimize f .x; y; x/ D C C subject to ax C by C c´ D d and x,
˛2 ˇ2
2
y, ´ > 0.
a1 x1 C a2 x2 C C an xn D d:
n
X n
X
30. Find the maximum value of f .X/ D ai xi2 subject to bi xi4 D 1, where
i D1 i D1
p; q > 0 and ai , bi xi > 0, 1 i n.
n n
p q
X X
31. Find the extreme value of f .X/ D ai xi subject to bi xi D 1, where p,
i D1 i D1
q>0 and ai , bi , xi > 0, 1 i n.
f .x; y; ´; w/ D x 2 C 2y 2 C ´2 C w 2
subject to
x C y C ´ C 3w D 1
x C y C 2´ C w D 2:
x2 y2 ´2
C C D 1;
a2 b2 c2
assuming that at least one of p1 , p2 , p3 is nonzero.
24
36. Find the extreme values of f .x; y; ´/ D 2x C y C 2´ subject to x 2 C y 2 D 4
and x C ´ D 2.
37. Find the distance between the parabola y D 1 C x 2 and the line x C y D 1.
3x 2 C 9y 2 C 6´2 D 10
x2 y2 ´2
2
C 2 C 2 D1
a b c
are the largest and smallest eigenvalues of the matrix
0 a2 a2
2 3
4 b2 0 b2 5 :
c2 c2 0
40. Show that the extreme values of f .x; y; ´/ D xy C 2y´ C 2´x subject to
x2 y2 ´2
2
C 2 C 2 D1
a b c
are the largest and smallest eigenvalues of the matrix
a2 =2 a2
2 3
0
4 b 2=2 0 b2 5 :
2 2
c c 0
x2 y2 ´2
2
C 2 C 2 D 1:
a b c
25
44. Let a, b, c,and d be positive. Find the extreme values of
f .x; y; ´; w/ D xw y´
subject to
ax 2 C by 2 D 1; c´2 C dw 2 D 1;
if (a) ad ¤ bc; (b) ad D bc:
a1 x C a2 y C a3 ´ D c and b1 x C b2 y C b3 ´ D d:
Assume that
n
X n
X
ai xi D c and bi x i D d ;
i D1 i D1
where
n
X n
X n
X
ai2 D bi2 D 1 and ai bi D 0:
i D1 i D1 i D1
subject to
n
X n
X
xi D 1 and ixi D 0:
i D1 i D1
Prove explicitly that if
n
X n
X
yi D 1; iyi D 0
j D1 i D1
26
48. Let p1 , p2 , . . . , pn and s be positive numbers. Maximize
f .X/ D .s x1 /p1 .s x2/p2 .s xn /pn
subject to x1 C x2 C C xn D s.
50. Maximize
n
X xi
f .X/ D
i
i D1
subject to xi > 0, 1 i n, and
p p
x1 1 x2 2 xnpn D V;
subject to
a1 x1 C a2 x2 C C an xn D d:
52. Schwarz’s inequality says that .a1 ; a2 ; : : : ; an / and .x1 ; x2 ; : : : ; xn/ are arbi-
trary n-tuples of real numbers, then
r1 C r2 C C rm D r:
Show that
r1 x 1 C r2 x 2 C rm x m
1=r
x1r1 x2r2 xm
rm
;
r
and give necessary and P sufficient conditions for equality. (Hint: Maximize
x1r1 x2r2 xm
rm
subject to mj D1 rj xj D > 0, x1 > 0, x2 > 0, . . . , xm > 0.)
27
54. Let A D Œaij be an m n matrix. Suppose that p1 , p2 , . . . , pm > 0 and
m
X 1
D 1;
pj
j D1
and define
n
X
i D jaij jpi ; 1 i m:
j D1
subject to
n
X
xr r s D c s ; 0 s m;
r D0
is attained when
m
X
xr D s r s ; 0 r n;
sD0
where
m
X n
X
sC` ` D cs and s D r s; 0 s m:
`D0 r D0
Show that if fxr gnrD0 satisfies the constraints and xr ¤ xr 0 for some r , then
n
X n
X
xr2 > xr20:
r D0 r D0
n
X
56. Suppose that n > 2k. Show that the minimum value of f .W/ D wi2,
iD n
subject to the constraint
n
X
wi P .r i / D P .r /
iD n
28
whenever r is an integer and P is a polynomial of degree 2k, is attained with
2k
X
wi 0 D r i r ; 1 i n;
r D0
where
2k n
(
X 1 if s D 0; X
r r Cs D and s D j s:
r D0
0 if 1 s 2k; jD n
Show that if fwi gniD n satisfies the constraint and wi ¤ wi 0 for some i , then
n
X n
X
wi2 > wi20:
iD n iD n
n
X
57. Suppose that n k. Show that the minimum value of f wi2, subject to the
i D0
constraint
n
X
wi P .r i / D P .r C 1/
i D0
whenever r is an integer and P is a polynomial of degree k, is attained with
k
X
wi 0 D r i r ; 0 i n;
r D0
where
k
X n
X
r Cs r D . 1/s ; 0 s k; and ` D i `; 0 ` 2k:
r D0 i D0
Show that if
n
X
ui P .r i / D P .r C 1/
i D0
whenever r is an integer and P is a polynomial of degree k, and ui ¤ wi 0
for some i , then
Xn Xn
u2i > wi20 :
i D0 i D0
58. Minimize
n
X .xi ci /2
f .X/ D
˛i
i D1
subject to
n
X
ai r xi D dr ; 1r m
i D1
29
Assume that m > 1, ˛1 , ˛2 , . . . ˛m > 0, and
n
(
X 1 if r D s;
˛i ai r ai s D
i D1
0 if r ¤ s:
30
8 Answers to selected exercises
15 2 25
p p
1. 7
;
7 7
2. ˙5 3. 1= ab, 1= ab
p
4. .8; 16/ is closest, . 8; 16/ is farthest. 5. ˙ 53=6 6. 1=4ab 7. p 2 =4
p p p
8. 4 A 9. A3=2 =6 6 10. A3=2=6 3 11. 3.2V /2=3 12. abc=27
p
13. ab 15. 8abc=3 3
0 11=2
n
X
18. .1 /2 and .1 C /2 , where D @ cj2 A 19. 1, 2 20. 2, 4
j D1
p
21. 2=3, 2 22. ˛ ˙ jˇj=4jabj 23. 5, 73=16 24. ˙1 25. ˙2
p 2
d
26. ˙2 27. 2 1 28.
.a˛/2 C .bˇ 2 / C .c
/2
n
!1=2
jd a1 c1 a2 c2 an cn /ai j X a2 i
29. q 30.
a12 C a22 C an2 bi
i D1
n
!1 p=q
aiq=.q p/ p=.p q/
X
31. bi is a constrained maximum if p < q, a constrained
i D1
minimum if p > q
d2
32. 689=845 33. 34. ˙.p12 a2 C p22 b 2 C p32 c 2 /1=2
p12 a2 C p22 b 2 C p32 c 2
p p p p
35. 693=45 36. 2, 6 37. 7=4 2 38. 10 6=3 41. ˙jcj a2 C b 2=2
3 p p p
˛ˇ
˛ ˇ
42. C C 43. ˙3 44. (a) ˙1= bc (b) ˙1= ad D ˙1= bc
pqr p q r
n
!2 n
!2
X X
46. c ai ˛i C d bi ˛ i 47. xi 0 D .4n C 2 6i /=n.n 1/
i D1 i D1
h iP
.n 1/s p p p
48. P p1 1 p2 2 pn n
p1 Cp2 CCpn
S
49. .p1 1 /p1 .p2 2 /p2 .pn n /pn
p1 C p2 C C pn
p Cp 1CCp
V 1 2 n
50. .p1 C p2 C C pn / p p p
.1 p1 / .2 p2 / .n pn /
1 2 n
n
! n
! n
!1=2 n !1=2
X X X X
2 2 2 2
51. d ai ci / = ai ˛i 52. ˙ ai xi 0
i D1 i D1 i D1 i D1
m n
!2
X X
58. dr ai r ci
r D1 i D1
31