Académique Documents
Professionnel Documents
Culture Documents
Managing Editor:
M . HAZEWINKEL
Editorial Board:
F. CALOGERO, Universita deg/i Studi di Roma. Italy
Yu . I. MANIN. St~klov Institute of Mathematics. Moscow. U.S.S.R.
A. H. G. RINNOOY KAN. Erasmus University. Rotterdam. The Netherlands
G.-c. ROTA . M.I. T ..Cambridge. Mass . U.S.A .
edited by
M. Fliess
Laboratoire des Signaux et Systemes,
CNRS - Ecole Sup~rieure d' Electricit~,
Gif-sur-Yvette, France
and
M. Hazewinkel
Centre for Mathematics and Computer Science,
Amsterdam, The Netherlands
...
vi
Bruce Bairnsfather
Joseph Louis Lagrange.
What is now proved was once only imagined.
William Blake
Michiel Hazewinkel
TABLE OF CONTENTS
Series Editor's Preface
xi
Preface
33
45
55
77
89
101
121
147
159
181
viii
TABLE OF CONTENTS
227
w.
243
w.
257
285
311
OPTIMAL CONTROL
H. J. SUSSMANN / Envelopes, Conjugate Points, and
Optimal Bang-Bang Extremals
325
347
371
389
DISCRETE-TIME SYSTEMS
S. MONACO and D. NORMAND-CYROT / Nonlinear Systems in
Discrete Time
411
431
441
487
499
511
TABLE OF CONTENTS
ix
APPLICATIONS
R. MARINO / Feedback Linearization Techniques in
Robotics and Power Systems
523
545
573
597
623
INDEX
633
PREFACE
A dynamical system, in the sense that the word is used in mathematical system and
control theory, is something like
xf(x,u),
= h(x),
(1)
x = Ax + Bu. y = Cx.
(2)
A lot is known about systems like (2) and they are immensely useful in many
applications, i.e. modeling situations. Morever, to a first approximation around a
reference trajectory, every system is linearizable if explicit time dependence of the
matrices A, Band C is allowed.
It is however, also totally clear that the study of linear systems alone is by no
means enough. First there is, of course, the fact that in many situations there is not
(and cannot be) a well-determined reference trajectory. More important in the long
run, even from the applications standpoint alone, is the observation that the purely
linear world is a sad one and an expensive one. The (feedback) controls needed to do
all kinds of things tend to be massive; there is a depressing, yes, linearity about it
which precludes using something like pressure-point control, where use is made of,
say, the bifurcation points of the parametrized family x = f(x, u), to steer things with
very small (changes in the) u. (Not that this is, at the moment, a workable,
implemented technique.) To appreciate what we are hinting at it might be good to
reflect that if electronics was purely linear, i.e. if it consisted basically of Ohm's law,
we would have no fun with transistors, television, memory cells, etc.; in fact you
would not be reading these lines.
xi
xii
PREFACE
Nor is there mathematically a reason to stay away from nQnlinear systems. Many,
and very powerful, techniques have been developed and much can be said and done.
That one also needs varied and powerful parts of mathematics to deal competently
with nonlinear systems is, to our minds, an added inducement. Analysis, geometry
and algebra in their various aspects are all sorely needed, testifying to the vigour of
the subject.
This book can probably not be used as an introduction to the world of nonlinear
control systems, even though there are many survey-type contributions. For that the
reader is advised to consult, for example, W. J. Rugh, Nonlinear System Theory,
Johns Hopkins Univ. Pr., 1981; H. J. Sussmann, Lie Brackets, Real Analyticity and
Geometric Control, In: R. W. Brockett et al. (eds.), Differential Geometric Control
Theory, Birkhauser, 1983, pp. 1-116; M. Fliess, M. Lamnabhi, F. LamnabhiLagarrique, An Algebraic Approach to Nonlinear Functional Expansions, IEEE
Trans. CS 30 (1983), 554-570, A. Isidori, Nonlinear Control Systems: An
Introduction, LN Control Inf. Sci. Vol. 72, Springer, 1985. The present book is
instead a fairly comprehensive survey of the present state-of-the-art of substantial
parts of nonlinear system theory. It cettainly does not cover all aspects of nonlinear
system theory; that would be totally impossible in a volume this size. The core topics
are present in the form of both surveys and newer contributions, but many other
aspects are barely treated: large systems are not really covered, there is but little
about stochastics, identification problems are ignored by and large, etc. Much
remains and much is present, as a glance at the table of contents will show.
The book also constitutes the proceedings of a conference held in Paris in June
1985, organized with support from the Centre National de la Recherche Scientifique
and from the Laboratoire des Signaux et Systemes, SUPELEC, in Gif-sur-Yvette.
The world is nonlinear and mathematics now seems prepared to start taking
advantage of the fact. We hope and expect that this book will contribute to these
developments.
Michel Fliess
Michiel Hazewinkel
Controllability, Observability,
Realization and
other Structural Properties
INTRODUCTION
The aim of this paper is to give a short presentation of
three approaches that exist in the nonlinear realization
theory of continuous-time systems. They differ from each
other in the way of describing the input-output behaviour
of the system. The first describes this behaviour by a
general causal operator mapping input signals to output
signals. We will call this approach general and start the
exposition with it. The second approach uses a formal power
series in noncommuting variables as the input-output description of the system. For any specific regular input the
series turns to a converging series giving the value of the
output. This approach assumes analyticity of the system.
The third approach describes the input-output behaviour
by a Volterra series and, in general, also assumes analyticity. The theory is mainly developed for finite Volterra
series.
The three approaches will be presented in Chapters 1,2,
3, respectively. They were originated in the papers Jakubczyk [25]
(cf. also [24]) the first, Fliess [14] the second (see also
[12],[13],[27]), and Isidori, Ruberti [23], Brockett [3],
Crouch [6] the third. The background for them was prepared
in the earlier papers of Sussmann [41]-[44] who solved the
uniqueness problem and identified minimality.
In the paper we put the main emphasis on the existence
problem and constructions of realizations from the input-output map (or i . o . series). The existence problem seems~
already, fairly well understood. In ~ll the three approaches
the existence conditions are simple and can be split into a
regularity condition (analyticity or smoothness) and a rank
condition which is responsible for the finite dimensionality
of the realization. In the formal power series and Volterra
series approaches the regularity condition takes the form
of a growth condition on the coefficients (Volterra kernels)
of the series. To have the existence of global realizations
one has, in general, to impose an additional condition of
extendability. This is directly related to the property of
ordinary differential equations which can be solved equally
well for positive and negative times.
Our second aim is to show the interconnections of the
3
M. Fliess and M. Hauwinkel (eds.), Algebraic and GeOlrU!tric Methods in Nonlinear Control Theory, 3-31.
1986 by D. Reidel Publishing Company.
B. JAKUBCZYK
We recall that
if
ul [O,t] = VI [O,t]
(respectively ul [O,t) = VI [O,t
implies
that
(Fu) (t) = (Fv) (t)
for any u,v E U and t E [O,T) ,
where ~II denotes the restriction of the function ~ to
the interval I.
If F is causal (strictly causal), then it has a well
defined value
(Fu)(t) E Y depending only on a =ul [O,t]
(on
a = u[O,t)'
t = 0,
is denoted
Consider a system
f(x,u),
h(x,u),
x(O) = x o '
y(t) EY.
x(t) EX,
u(t) En,
( 1. 1 )
(for
(Fu) (t) ,
t E [0, T)
( 1 .2)
B. JAKUBCZYK
a ES
will be
a(-r) = u.
i = 1, ,k and
are denoted by
for
lal = Tk
= 0, T. = t 1 + . +t.,
~
~
Similarly, the elements a E S
-r E [T. 1,T.),
~-
a = ( t 1 ' u 1) (t k , uk) uk + 1
where, additionally, a(T k ) = u k + 1
Let F: U -+ V
be a causal operator. We shall say that
the operator F is of class Cr
if the maps
(t 1 ,,tk ) -+<F,(t"u 1 )(tk ,uk ) u k + 1 >
are of class Cr
on E~ for any!:! = (u 1 , ,uk + 1 ),
(1.3)
k~O,
k = {~ER+
k\ t + ... +t <T},
u i Erl, where ET
!: = (t 1 ,.,t k )
k
1
Let rl be a subset of Rm. The F will be called jointly
of class Cr if the maps
(1 4)
Cr
are of class
on
sup
~,!,~
rank
where a = (t 1 ,u 1 ) .. (tk,u k )
all k > 1 and all sequences
-
"'k
d <
{~
{c ij }i,j=1
for a
b = (b 1 ' ... , b k ) E S
such that \ a \ + \ b j \ < T.
Theorem 1.1. [31]. Let rl be a compact subset of Rm. The
causal operator F has a jointly analytic realization iff
F is jointly analytic and rank F is finite.
In the special case of finite rl this result was
shown in CelIe, Gauthier, Milani [5], cf. also the
contribution in this volume of these authors and Bornard.
For noncompact rl a jointly analytic F with finite
rank may not have a realization, as it is in the case of
F defined by the following system [20]
jt
= (x+u)
-1
x(O) = 0,
y = x,
u(t) E (0,"').
( 1 5)
B. JAKUBCZYK
x (0) = 1,
Y = x,
u ( t) E (- 00 , 0) .
(1. 6)
I t 1 I p (u 1 ) + . + I tk I P (uk)
<
1.
( 1 .7)
In the case of system (1.6) the F is locally j.a. extendable with the function p(u)=-4u
More generally, if F has a jointly analytic realization on Rn , then F is locally j.a.extendable with the
function
p(u) = c- 1 supllf(x,u)ll,
where II II is the Euclidean norm in Rn , the supremum is
taken over a compact neighborhood K of Xo and C is the
distance between x
and Rn " K. The same is true if F
o
= F(au),
{Fa}=F
(then the
{hal
a.
. -1 '
l.,J-
10
B. JAKUBCZYK
<F,a'b>
tiE R,
u i En}.
generated by the
11
(T
1 ; v 1) . . ( T P ; V p) ]
T=oo)
( 1 8)
sup
~-,!,E.
a -
B. JAKUBCZYK
12
sup
~,!,,'
where
all
rank
a = (t 1 ,u 1 ) ... (tk,u k )
k > 1,
~ E ~k,
t E R~
~rank
1,J=
,e.'
E Sk
such that
gives,
countable realization of
F.
Let
operator
= rank F.
F.
L
l3
L
L
L
is locally minimal
is dimension minimal
is accessible and observable at
xo.
14
B. JAKUBCZYK
respectively, minimal and accessible (observable and minimal), analytic, complete realizations of F, then there
exist a surjective submersion (injective immersion)
~: X'+X
which carries E'
to Eo Other refinements of
Theorem 1.14 can be found in [18], [19].
In the original Sussmann's definition minimality is defined as orbit minimality and observability. The orbit from
Xo of a system E is the set of pOints reachable from Xo
piecewise by trajectories of the vector fields f( ,u),
u En, admitting going backwards in time. Orbit minimality
of E means that the orbit from x
is the whole X.
It was proved in [41] that theOorbit is a submanifold
of X. This suggests the first step in, proving'the existence
part of Theorem 1.14: restrict the system to the orbit from
xO. The second step is to make the system observable by factorizing the new X through the equivalence relation of indistinguishability. This was carried out in [43] and [44].
1.4. Constructions of realizations
1.4.1. Local construction [29]. We shall outline an elementary construction of the local realization in Theorem 1.5
which, with a modification, also works in the case of
Theorem 1. 3 .
Let a = (t 1 ,u 1 ) . (tk,u k ) and ~, t = t* and b be
such that the supremum in the definition of rank F is at1
k
tained. Consider the function ~ = (~ , ... ,~ ), where
~j(.!:)
Then rank
n
d~
= <F,(t1,u1)(tk,uk)bj>.
(.!: *) = rank F = n
n: R + R
and a projection
rank d~ (x ) = n, where
such that
~=7TO~On
and
= t*.
o ERn is such that n(x)
0
Let X be a neighborhood of Xo in
is a local diffeomorphism on X. Define
h(x,u)
<F,
~(n(xu>,
where we denote
For abE S
define
Rn
x EX,
such that
15
<Pb (x) =
where
--1
1jJ
'IT
ljJb
ljJb
n (x) ,
and
ljJa<.:!:.l = <F,(t1,u1)(tk,uk)bbj>.
Take
f(x,u)
It can be proved ([29]) that
l: = (X,f,h,x )
is a local
o
Fa' where
k,
a = ( ti ' u 1 ) .. (t uk)
In the analytic case of Theorem 1.3 it is possible,
using also negative tt,
to choose a so that its effect
on
e,
[29].
1.4.2. Global construction via "gluing together". This construction can be used for proving Theorem 1.4 and (after a
preliminary step) for proving Theorem 1.1. The construction
goes in two steps.
Step 1. For any a E S construct a local realization l:a of
the shifted operator
Fa'
l:a'
a E S,
16
B. JAKUBCZYK
Vc EG.
h(x)
and
= P(bc)
[e) ,
P(a),
where
is the identity in
bEG,
<Pb(x) = [ab],
Define
G.
by
x = [aJ.
G,
and
For
<P
is smooth,
= ~<P
dt (t,u) ( x ) t=O
1jJu: R -+X
are
by
such that all the maps 1jJu are continuous. Analogously, the
Cr
differential structure on X is the strongest Cr
differential structure such that the maps 1jJu are of class
Cr.
It was proved in [25] that X with this topology and
differential structure is a Hausdorff, differentiable ~aniis
fold of class
dim X = rank P,
and E = (X,f,h,x )
o
(minimal, in the
a global realization of F of class Cr
analytic case) .
Another possible interpretation of the above construction is to view G as a sort of infinite dimensional Lie
group. The differentiable structure on G can be given by
functions
class Cr
q):
G -+ R
such that
with respect to
17
u 1 ' .. , Uk
X
En.
G/G,
o
Then
where
Go = {a E G I P ( ab)
Vb E G}.
P (b)
fu'
1-
the function
Differentiating consecutively
ficients
f
The collection
{L
u1
. f
u 1 u k + 1
uk
uk
, . ,f
(h
uk +1
u1
) (x ).
0
(2.1)
<Q,w>w.
(2.2)
wE~*
~+.
18
B. JAKUBCZYK
l:
R~>~
wE~*
QQ'
l:
l:
w'w"=w
wE~*
<Q,w'> <Q',w"w.
(2.3)
l:
wE~*
= [...
[Ci.
1'
Ci.
is
2 ] , , Ci. k ]
Lie
rank of a series
k
k >
w E S1 + , and
<Q,u 1 . u k + 1 > = f
u1
uk
(h
uk + 1
) (x ).
0
(2.4)
Z=R[[s1, . ,sn]]
A=A(s1, ,sn)
19
(2.5)
for any w=u 1 ... u k + 1 En+. Two local analytic realizations of Q of dimension n=rank Q are related by an
analytic, local diffeomorphism. L
Part (b) of this theorem can be applied for obtaining
a local analytic realization of the form
*=
g (x) +
o
m
E
i= 1
u.g.(x),
1.
1.
(t)
x(O)
h (x) ,
(2.6)
t
+<Q,i 1 ... i k > f
1.1 l.kE~
0
E.
20
B. JAKUBCZYK
local analytic realization of form (2.6) (for e sufficiently small) iff rankLQ is finite and Q satisfies condition
(2.5) in Theorem 2.1.
Remark. In the strictly causal version of Theorem 2.1 the
function h is independent of u. Therefore, the coefficient <Q,u 1 u k + 1 > in (2.4) is independent of u k +,.
It
is then better to denote (2.4) by
we define another series
In this way
.1
n + 1
commutative variables
\jiu = (\jiu1,.,\jiun)
and
X =
(x 1 '
,X n ),
X = \ji
-1
Define
21
for
.a/as.,
UJ
where
Q.
and let
alP
\P n + 1 = a \Pi/as,
for any
and
k = (k 1 , .. , k m),
ki
where
O.
s=(s1, .. ,sm)
1:
p.+q.=1
~
where
~,J=1'
\P.(1/J1,
.. ,1/J n ,0, ,0) = \P.(s1'
. ' sm) ,
~
~
have a unique solution
}z:+ . '
P1
Pk
q1
qk
<Q,u 1 ... u k ><Q' ,u 1 ... u k >,
0,1
and
u oi
denotes the
= <Q,wW>.
QiJ. = R[
Then
gebra
{Qij}
be such that
wi
]R
Vj
Q,
i,j = 1, . ,n.
Rn. Denote
{H ij } = {Qij}
-1
22
B. JAKUBCZYK
operators
K.
in
R:<rI
given by
2: H . .W(R[
]Q).
j=1 ~J
Wj
Define
h u (s 1 ' .. , s n ) = <exp (s 1 K 1) . exp (s n Kn ) Ru Q, A> ,
f
~-
were exp(tu)
denotes the formal power series
2:t i (i!)-1 u i
and we use the usual product (2.3) in RrI>~
With this
product S(rI)
forms a semigroup with identity (monoid) and
G(rI)
forms a group, where
23
G(~)
as well as
-+
This lemma shows that all the results of Chapter 1 concerning piecewise constant inputs can be translated into the
formal power series form. Namely, one should represent the
piecewise constant input jt 1 ,u 1 ) ... (tk,u k ) by the series
exp(t 1 u 1 ) .. exp(t 1 u 1 )
<F , (t 1 ' u 1) ... (t k , uk) > = <Q , exp ( t 1 u 1) . exp ( tk uk) > ,
for t
in a neighborhood of 0 E R. Conversely, if for a
given Q E R~ each such series converges locally and so
defined analytic function extends analytically to the set
t 1 + ... +t k <T,
ti~O,
then the Q defines via (2.9) a
unique analytic, strictly causal operator
There is also equivalence of ranks.
F: 11.
is analytic and
-+
V .
is defined by
(2.8)
24
B. JAKUBCZYK
m
}{ = g (x) + l: u.g.(x),
o
i= 1 ~ ~
where
x(t) ERn,
ui(t) ER,
x(O)
y(t) ER
= (Fu)
h (x) ,
and
T >0
(3 1)
g(x o ) =0.
and
e;
>0
such
25
6*
= 6 * U { A} ,
Denote
where
a . = (i., a .) E 6
where
aE[O,T).
and
Define, inductively,
<w a ,a 1 [a p ] . a k > = <W a ,a 1 a k >,
<W a , a 1 .. La p a q ] . ak> = <W a , a 1 . a p- 1 [a p .. a q- 1] a q . a k >
- <W a , a 1 ... a p- 1 a q.[a p .. a q- 1la q+ 1 a.K> ,
where
1~p<q~k.
W gi-
(3.5)
1,
<Wa ,w>,
+
wE A,
6,
with the
26
B. JAKUBCZYK
= rank W.
go = 0
in (3.1), then
where
vl
= {Ww}WE/',*
wE/',*
WW w.
27
One can easily see that the definition of rank Wand the
definition of the Lie rank in Section 2.1 coincide in this
case, i.e.
rankLQ = rank W,
Theorem 3.5 was proved for constant Volterra kernels in [34],
in the course of proving an equivalent of Theorem 2.2.
3.2. Crouch's theory of finite series. In [6] the following
structural results were obtained for minimal realizations of
finite Volterra series. Define rank .W as in (3.5) but
]
and
o =
are polynomial
such that
and
A 1X 1 + d 1
A 2 X 2 + d 2 (x 1 )
and
i=1, ... ,m
g.1
=
b.lp (x 1 ',x p- 1)
n.
nj = rankjW - rank j + 1W and Xj E R J,
the Lie algebra L generated by go, ... ,gm is finite
where
(d)
28
B. JAKUBCZYK
COO
Institute of Mathematics
Polish Academy of Sciences
00-950 Warsaw, Sniadeckich 8, Poland
REFERENCES
[1]
Z. Bartosiewicz, 'A new setting for polynomial continuous-time systems, and a realization theorem', IMA
J_. Math. Control & Information ~(1985), 71-80. - -
[2]
[3]
[4]
[5]
F. Celle, J.P. Gauthier, E. Milani, 'Existence of realizations of nonlinear analytic input-output maps',
IEEE Trans. Automat. Control. (to appear) .
[6]
[7]
[8]
[9]
29
[10]
[11]
[12]
[13]
M. Fliess, 'Realizations of nonlinear systems and abstract, transitive Lie algebras', Bull. Amer. Math.
Soc. (N.S.) ~(1980), 444-446.
[14]
[15]
[16]
[17]
M. Fliess, 1. Kupka, 'A finiteness criterion for nonlinear input-output differential systems', SIAM. J.
Control. Optimiz. 21 (1983), 721-728.
[18]
[19]
COO
systems', SIAM
----
COO
[20]
[21]
[22]
[23]
30
[24]
B. JAKUBCZYK
[25]
B. Jakubczyk, 'Existence and uniqueness of realizations of nonlinear systems', SIAM J. Control Optimiz.
lJH1980), 455-471.
[26]
[27]
[28]
[29]
[30]
[31]
[32]
B. Jakubczyk, in preparation.
[33]
[34]
[35]
R.E. Kalman, P.L. Falb, M.A. Arbib, Topics in Mathematical System Theory, New York 1969.
[36]
[37]
A.J. Krener, C.M. Lesiak, 'The existence and uniqueness of Volterra series for nonlinear systems',
IEEE Trans. Aut. Cont. AC-23(1978), 1090-1095.
[38]
31
[39]
[40]
[41]
[42]
-- =
H.J. Sussmann, 'Minimal realizations of nonlinear systerns', in Geometric Methods in System Theory, D. Mayne,
R. Brockett eds. D. Reidel 1973.
[43]
H.J. Sussmann, 'A generalization of the closed subgroup theorem to quotients of arbitrary manifold' ,
J. Diff. Geom. 10(1975), 151-166.
[44]
[45]
Christophe Reutenauer
Universite du Quebec a Montreal
and
CNRS (Paris)
1.
INTRODUCTION
33
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 33-43.
1986 by D. Reidel Publishing Company.
c. REUTENAUER
34
4(t)=Ao(q) + E u.(t)
i=l l.
{
(1)
y(t)=h(q)
y(t)=hl (0) + E
E
(A . A. hi (0
q
":2:0
. J\)
.
Jo
J"v
q
v
Jo
f~ de . de
J"v
Jo
(2)
where Iq(Q) means evaluation in q(O) and where the iterated integrals
fd~j d~.
are defined by the formulas (u O is the constant function
V
JO
equal to 1):
and i f \) :2: 1
't'
[u j (,) dT f 0 dC
. d~. J
\)
J\)_l
JO
Actually. the input-output behaviour of system (1) is completely
defined by its generating series ([4J p.12). which is a noncommutative
formal power series in the variables xO ,xn :
g=hlq(O) +
E
E
(A . A. hi (0) x . x.
\):2:0 jO'" ,j\):2:0
JO
J\)
q
J\)
JO
(3)
35
are commutative variables) is convergent if it converges in a neighborhood of 0; similarly, we say that a formal vector field, that is, an
operator of R[[qJJ of the form (*)
A=
l::;lc;d
Sk(q1,,qd) a!k
* (g,w)=h(pw)I O
VWEX,
(that is, (g,w) is the constant term of the series which is the image
of h under the operator pw).
We call the couple (p,h) a differential representation of g, of
dimension d.
It is now obvious that if g is the generating series of system
(1), then it is produced differentially (take local coordinates around
q(O) in Eq. (3)). Conversely, if a series g is produced differentially, then one may associate to it a system of type (1), whose generating series is g. Thus, the study of differential repres'entations
of series is equivalent to the study of local realizations of systems
like (1).
Before stating the main result, ~e need some notations. Denote
by R<X> the set of noncommutative po1ynomias, that is, formal series
having only a finite number of nonzero coefficients Then RX
i~
isomorphic to the dual of R<X> (because R<X> ~ R(X*) and RX ~RX ,
as vector spaces), with duality
RX
(S,P)
x R<X>
~
The set R<X> possesses an associative product, which extends linearly the concatenation of words in X*
Thus, the algebra R<X> acts
(*)
36
C. REUTENAUER
P= L (S,Pw)w
WEX*
S= L (S,wP)w
WEX*
(,4)
{p glpEL<x>}
We say that a formal series gERX satisfies to the convergence
hypothesis (C) if
(C) For any Lie polynomials P 1 ' . 'P , there exist constants a and C
such that
q
i1
i
i 1 ++i
V i 1 , ,iq EN,I(g,P 1 Pqq)l~ac
q i 1 ! i q !
Note that one may replace il! .. i ! by (i 1 + ... +i q )!, because these
numbers satisfy to
q
i1 + +i
i 1 !iq ! ~ (i 1 ++iq )! ~ q
q i1~iq
Now, we can state the following fundamental result.
Theorem (F1iess [5J)
A series g E RX is differentially produced if and only if its Lie
rank is finite and if it satisfies to the convergence hypothesis. In
this case, its Lie rank d is equal to the smallest dimension of' all its
differential representations. If (U,h) and (u',h') are two differential
representations of dimension d of g (with the same R[[qJJ), then there
exists a continuous and convergent(*) automorphism of R[[qJJ such
that h'=~(h) and ~(kuw) = ~(k)U'w for any word w and any series k in
R[[qJJ.
Note that the fact that ~ is bijection is equivalent to the we11known jacobian condition
J Cl~(qj)
Clqi
(*)
~(qi)
(o)l;eo
is convergent and without constant
37
The third part of this paper is devoted to the proof of the theoWe need before some definition and results.
On RX is defined another product, the shuffle, which is associative and commutative and which will be important in the sequel;
this is not a surprise, as the shuffle intervenes already for systems
of the form (1): indeed, by F1iess ([4J prop. 11.4), if two systems
admit the generating series gl and g2' then the system whose output
is the product of the outputs of these two systems admits as generating series the shuffle of gl and g2.
If w is a word of length Iwl = p (w = Y1.~ , Yi E ~) and ~
c {l, ,p}, denote by wlI the word Yi Yi ' wl~h I = {11 <.<l r }
Then the shuffle u*v of two words u anat v isr defined as
rem.
u*v = L w(I,J)
where the sum is extended to all partitions (I,J) of (l, ,lul + Ivl)
with Card (I) = lui, Card (J) = Ivl and where the word w(I,J), of
length lui + lvi, is defined by
w(I,J) II
= u,
w(I,J)IJ
=v
. P
(16)
38
C. REUTENAUER
Lemma 2
If P is a Lie polynomial, then
c (P) = pel. 1 + 1.P .. 1 + 1.1. .. eP
p
PROOF OF THEOREM
3(h~w) (q,~p)JI
3q,1
l <'<d
_1_
= ~ (T"w)(qi~P)1
,1
r r
C q
=a(l-Cq)
-1
il
id
in the sense that the coefficient of ql qd is bounded by
il+" .+id
a C
It is then easily shown that h~w (w of length p) is
bounded by the series f~P, where ~ is the differential operator
~=d_a_.L
l-Cq 3q
(*)
39
f~P
= a(daC)P
1.3 (Zp-l)
(l_Cq)Zp+l
Hence, we obtain
a(daC)P(Zp)!
Zp p!
As (Zp) is bounded by ZP, we obtain
p
I(g,w) I s a(ZdaC)P p!
SD
i l + . +i
q(il++i q )!
which proves (C), in view of the remark following the definition of (C).
b.
We come now to the converse, which is the essential part of the
theorem. Let g be a series of Lie rank d and which satisfies to the
convergence hypothesis.
Let Lo = {p E L<X> I pog = a}. By assumption, Lo is of finite codimension d in L<X>. Moreover, it is a sub -Lie- algebra of L<X>.
Let Pl,""Pn "" be a basis of L~X> such that Pdtl""':n"" is a basis of L. Let Sl"",Sd the ser1es defined by ~s.,P.) - O. . and
j
=0
rem).
Then
if r
=0
or if jl+ ... +j
1,J
(1)
L:
il' . , id~O
Note that the Si's have zero constant term, which ensures that
the sum is well-defined.
In fact, we shall prove a more general result.
Proposition 1
Let Lo be a sub-Lie-algebra of L<X> of codimension d. Let P1 P d
be a basis of L<X>modu10 Lo and Sl 'Sd series.in RX without
constant term such that (Si'P,) = 0i j and wich vanish on the left
o
_
_
ideal J = R<X>L.
Then
J
~ = {sl(s.p)
Moreover, any S E R[[Sl' Sd]] has a unique expression as series in the Si's.
40
C. REUTENAUER
Proof
1.
We show that Ji contains R[[Sl"",Sd JJ As Ji is closed for
the usual topology of RX and closed for the operation T + ToP
(p E R<X, because J is a left ideal, it is enough to show that it is
also closed for the shuffle. Let S,T in JL: it suffices to show that
(ST)ow
vanishes on Lo for any word w (it will imply that ST vanishes
on X*L , hence on J).
Lemma 3
T + Tox is a derivation for the shuffle.
By the lemma (which is well-known), (ST)ow is a linear combination of series of the form (Sou) (Tov). As S,T vanish on J, we obtain
that Sou, Tov vanish on L
Lemma 4
Let i > j, T1 , . ,T. series without constant term and Q1, ,Qj be
Lie polynomials. TBen
0
(T 1 T i , Q1 . Qj) = 0
The lemma follows from lemma 1 and 2: write that (T .. T.,
Q1 Q.) = (T1~ T., c.(Q1 Q.)) and note that c.(Q ~.), ~ich
is theJproducE from 1 t6 j of J
1 1
J
Q~tl1
..1S1 +
. ~1 + 1$1 eQ~
1.Q~
1 1 ""
For I
1
= (i
1 , ,id )
.
,1 d
11 ,,1d
, let a(I)
S(I)
= s111"'Sd--d,
. + ... + i d an d I' -- 1. 1 1
I
I
P(I) = P1 1 . p d1d ' III -- 11
d BY t h e P-B-W
theorem, we have to show that both sides of Eq (2) have the same value
on any polynomial of the form
;1,1
ir
P1 "'P r ' r ~ 0, 1_~ i 1 <<i r , 1 ~ j1, jr
But, if r > d, then this polynomial is in J, hence both sides
map it to zero. Hence, we have to find coefficients a(I) such that
VJ
(S,P(J))
=E
a(I)(S(I),P(J))
(3)
Lemma 5
(i)
If III > IJI or if III = IJI and I ;II; J, then (S(I),P(J)) = 0
(ii) (S(I),P(I)) =-rr-If III > IJI, use lemma 4. Otherwise, use lemma 1 and 2 to prove that
41
which is true for any series Ti without constant term and any Lie polynomials Q..
Lemma J 5 shows that Eq. (3) is a triangular system of linear equations in the a's, with I! on the diagonal. Hence, it admits one and
only one solution, which proves the proposition.
Eq. (1) is proved by using the fact that in this case one has:
III < IJI implies (S(I), P(J = 0 (use lemma 1 and 2).
Eq. (1) gives almost the differential representation of g. Indeed, g is given in (1) as a commutative series in Sl, ,Sd' and by
proposition 1, R[[Sl, .. ,SdJJ is isomorphic to an algebra of commutative formal power ser1es in a variables. We have to define ~ and h. We
let h = g and define ~w (w E X*) as T ~ Tow.
By lemma 3, ~x is a continuous derivation, which maps R[[Sl' '
SdJJ into itself (by the proposition). Moreover
(g,w)
= (gow,
1)
= (h
~w,
1)
i ,J.
(4)
0..
J,1
,,~
~x
= T. ox
J
42
C. REUTENAUER
satisfy to (C) and are thus convergent series in the S.'s; hence, they
are also convergent series when expressed as series in 1the Ti's. All
this shows that (~,h) is a differential representation of g.
c.
Now, let g be a series of Lie rank d and (~,h) be a differential
representation of dimension d of g. We use the notations of paragraph
b.
Lemma 6
The mapping n: R[[qJJ-+ RX which maps k onto ~ (k ~wl)w is a
w
continuous homomorphism (for the shuffle), such that for any word w
one has n(k ~w) = n(k)ow.
This lemma is a simple consequence of [4J prop. III. 1.
Lemma 7
The mapping 0: RX -+ RX which maps S onto ~ (S,P(I)) S(I)
I!
n (kdt)
References
[lJ
P. d'Alessandro, A. Isidori, A. Ruberti: Realization and structre theory of bilinear systems, Siam J. Control 12 (1974) 517-535.
[2J
R.W. Brockett: On the algebraic structure of bilinear systems,
In: Theory and Application of Variable Structure Systems (Mohler, Ruberti, ed.), Acad. Press (1972) 153-168.
[3J
M. F1iess: Sur 1a realisation des systemes dynamiques bi1ineaires, C.R. Acad. Sci. Paris A 277 (1973) 923-926.
[4J
M. F1iess: fonctionne11es causa1es non 1ineaires et indeterminees non commutatives, Bull. Soc. Math. France 109 (1981) 3-40.
[5J
M. F1iess: Realisation locale des systemes non 1ineaires, a1gebres de Lie fi1trees transitives et series generatrices non commutatives, Invent. Math. 71 (1983) 521-537.
[6J
W. Gr8bner: Die Lie Reihen und ihre Anwendungen, Berlin, VEB
Deutscher Verlag der Wissenschaften (1967).
[7J
R. Hermann, A.J. Krener: Nonlinear controllability and observabi1ity, IEEE Trans. Automat. Control 22 (1977) 728-740.
[8J
J.E. Humphreys: introduction to Lie algebras a~d representation theory, Springer Verlag (1980).
[9J
B. Jakubczyk: Existence and uniquences of realizations of nonlinear systems, SIAM J. Control Optimiz 18 (1980) 455-471.
[10J
R.E. Kalman: Mathematical description of linear dynamical systems, SIAM J. Control 1 (1963) 152-162.
[llJ
[12J
H.J. Sussmann: Minimal realizations and canonical forms for
bilinear systems, J. Franklin Inst. 301 (1976) 593-604.
[13J
H.J. Sussmann: Existence and uniquences of minimal realizations
of nonlinear systems, Math. Systems Theory 10 (1977) 263-284.
43
Zbigniew Bartosiewicz
Instytut Matematyki
Politechnika Bialostocka
Wiejska 45, Bi~ystok
Poland
1. INTRODUCTION
This paper is a continuation of the previous author s paper
[1J, where a new definition of a polynomial input-output
system has been proposed. We have been interested in immersions of nonlinear systems into polynomial ones (immersions
into simpler systems as affine or linear ones have been
studied by Fliess and Kupka [4J, Claude, Fliess and Isidori
[2] and Claude (3J). The observation algebra of a nonlinear
system has appeared to be a useful tool for examination of
this problem. Finite generatedness and regularity of the
observation algebra are necessary and sufficient in order
that the nonlinear system may be regularly immersed into an
algebraically observable polynomial system. Here we study a
more general case - without assumptions about regularity and
observability. Now the observation algebra does not have to
be finitely generated but has to be a subalgebra of a finitely generated algebra. However , we complete the result of
[1] proving that if the observation algebra is finitely generated then there is an immersion into an algebraically
observable polynomial system.
The results on immersions into polynomial systems give
an idea how to study conditions under which a response map
may be realized as a polynomial system. We use here the theory of Jakubczyk [6]. It appears that the construction of a
realization given in (6) allows to define an algebra of
functions corresponding to the response map which plays a
role of the observation algebra. We have to emphasize that
polynomial realizations considered here are not canonical or
minimal in the usual sense. Henoe, the dimension of a polynomial system may be much bigger than the dimension of the
minimal realization.
We also have to mention many connections with Sontag's
work [8J, which deals with discrete time polynomial systems.
45
. M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 45-54.
1986 by D. Reidel Publishing Company.
46
Z. BARTOSIEWICZ
2. POLYNOMIAL SYSTEMS
Let us recall some notions given in [1J.
By the control space U we mean an open set in Rn and
the output space is Rr. We also have the set U of admissible
controls, which are measurable functions u: [0, Tu ] ~ U
with Tu ~ 0 depending on u. We assume thatU. contains the
piece-wise constant controls.
Definition 1. An input-outPUt system 2: is a triple (M,S,h)
where M is a 0 00 manifold, S is a family {So( I oC eU} of 0 00
vector fields satisfYing the following condition
for all u E U. and mE M the system
~t
and
If(t} =
3 u (t)<'f(t)
,~(O)
Definition 2. The observation algebra of a system ~ denoted by 1C or d{(2:;) is the smallest subalgebra of Coo(M,R)
containing hi' i=1, ,r , and stable under the action of
the vector fields of 2 . If ~ has no zero-divisors then Q
or Q(~) means the field of quotients of ~ and is called
the observation field of the system z:
an alfebra over R we mean a pair (A,~), where A is a
commutat ve ring with identity and If: R--+ A is a ring
homomorphism with ~(1)=1 A homomorphism of algebras
fJ.: A~ B means a ring homomorphIsm whose restriction to R
(identified with ~R is the identity. An algebra A is
called finitely generated if there are a1' '~ A such
that R[a1' '~) = A. Then a1' '~ are said to generate
the algebra.
If A is an algebra, X(A) denotes the set of all homomorphisms It: A--+ R. Let X be a set and A be an algebra of
real functions on X. For x E X by ex we denote a homomorphism
ex : A--+R, defined by ex(a) = a{x), called the evaluation
at x.
By
47
z. BARTOSIEWICZ
48
v~(t)(a1""'~)(im,u(t)
= v~(t)('Y1(t),,'Yk(t).
Henoe, 't" satisfies the equation
~t 'Y(t)
which means that
h~('t'(t
2~(t)('V(t
for t(O,Tm,u)
~= f~(m),u . Moreover,
= wi (a 1('m, u (t), '~(~m,u (t) )
= hi(\fm,u (t)) ,
= (M, S
49
equations on M=R
x2
x1 = e
U E
R.
=e
x1
x 2 ':'\
-:.\
~ +"'*2
x1
x 1 x2
X1( X2)2
X1( X2)3
is generated bye, e e , e
e
, e
e
, .
x ( x )k
~ itself is not finitely generated since e 1 e 2
cannot
x
x1 x 2
X1( X2)k 1
be obtained as apolynomial of e 1 , e e , , e
e
-,
but 1{ is contained in the algebra generated by
x1
x2
x1
x2
e
and e Let a 1 (x 1 ,x2 ) = e
, a 2 (x 1 ,x2 ) = e Then
h = a 1 and SoC. a 1 = a 1a 2 , SoC. a 2 = '" a 2 Henoe Z may be immersed into I:: , ... (M',:=;',hl) where M' = 1R2, (z1,z2) = T(x 1 ,x2 }
x 1 x2
..... I
. '0
()
I
= (e ,e ), '::'ot(z1,z2) = z1 z 2'<fZ + oC. z 2 'Oz:"' h(z1,z2)=z1
and
-&1 ...
Zo BARTOSIEWICZ
50
..-.
.::. 0<:
.-.
1
't~O{i)
.::..
o(k
= 0d{ 0
hi'"
"(..... I
;::, 0<.
.-, I i )
.:::.'"
hi
which gives
Bel then
o1
algebraically dependent).
Cs
But '2
i:=: i h j
0(.1
o(s
...
= wi (x 1 , ,xk ) = xi
in
4{ (~I).
CI
'1
o(.s
51
(6). Let the control spaoe U and the output spaoe be fixed.
We limit ourselves here to piece-wise constant controls although Jakubczyk's paper provides a setting for more general
classes of controls. Thus, let U mean the class of piecewise constant functions u : to, TuJ ~ U. If u Ct) = "'i for
t [6"i_1' 6"i)' i=1, ,k , 6 k=Tu and 6 i = ~=1 tj , 6 0=0,
tiC: IR+ = [0,+00), we use the notation u = Ctkt""k) (t 1 ,cx.. 1)
Let us introduce the multiplication in the set ~ by
vu = ('tm'~m) ('t1'~1)(tk,ok) (t1'o(1)
where
v = ('lm'\'>m) (T1,fo1) The neutral element (identity) e is
the empty sequence identified with all u=(O,cC.), 01.. e U. We denote by S the semigroup given by the set ~ and the multiplication defined above. There is a natural action of IR+ on
S defined by tu = (ttk'~k) (tt1'o(1).
Definition 7. A map p: S~ IRr is called a response map.
A response map is a formal description of "response experiments" carried out with some "system" black box We apply
a control u and after time Tu measure the output consisting
of r real parameters.
Defini tion
p
from a
p~( u,m) ..
be defined
Defini tion 9.
A (C CO)
realization I:: x
= (z: , x o ) of a
!! = (u 1 , ,up ), p~1,
S. Define 'V ~: R~ ~ IRkr
k~1,
Z. BARTOSIEWICZ
52
d{
(p) c A, where
(~)
Proof. (A sketch) The idea of the construction of the reais similar as in the proof of Theorem 1. Let A be
generated by f 1 , ,fk Then Pi=wi (f 1 , ,fk ), i=1, ,r,
f~= viCf1, ,fk), i=1, ,k, where wi and vi are polynomials in k variables. Define the realization (M,:=:: ,h,xo ) as
follows. Let M=Rk, x o=(f 1 (e), ,fk (e, h=(h 1 , ,hr ),
hi(x)=wi(x), XlRk, S,,(=~=1 vi&: . 0
IIZati~n
Corollary 3. If P is a COO response map and ~ (p) is fini tely generated then p has a polynomial realization. 0
Remark. The realization constructed in Theorem 3 is not
mInimal in general. It may be proved that if (M, '2 ,h,xO) is
a polynomial realization of a response map p then
dim M ~ tr deg 'J( (1::.) ). tr deg 'l{ (p) ~ rank p = n , where
n is the dimension of the minimal realization. This means
that in order that p has a minimal polynomial realization
it is necessary that tr deg lK(p) = rank p.
Now we are going to prove a necessary condition for the
existence of a polynomial realization. We limit ourselves
here to the analytic case. A realization (M,S,h,xO) is said
to be analytic if M is an analytic manifold, '2 is a family
of complete analytic vector fields and the components ~f h
are analytic functio~s. A response map is oalled analytio
53
Y"i
if the mappings
have real analytic extensions onto the
P
whole n Hence, for an analytic response map we may extend
this map to the group G obtained from S by admitting negative times in a sequence u=(tk'~k) . (t1'~1). We identify
(t 1 , oc.)(t 2 ,.,() = (t 1+t 2 ,..c.) and define the action of R on G
by tu= (ttk,"'k) (ttl' "'1 ) for t ~ 0 and tu= (ttl''''1) (ttk'..l~
for t < O. So, if p means the extension of p to the group G,
by~(p) we denote the algebra of functions on G defined in
exactly the same way as lK(p). A realization is called
weakly controllable if for any Xl'X 2 e M there is U G such
that ~x1,u(Tu)=x2' where Tu may be negative.
Theorem 4. If an analytic response map has a polynomial
analytic realization then '1{ (p) is isomorphic to a subalgebra of a finitely generated algebra.
Proof. (A sketch) Let (M,S ,h,xO) be a polynomial analytic
realization of the response map p. Then 1K(~)C A - the finitely generated algebra of Eolynomial functions on M. Now
shall relate "J(~) with 'l{(p) to obtain the required condi tion. Let 2:';1
be a weakly controllable and observable
Xo
realization obtained from ~x using the procedure described
in [9],[5]. It is constructed in two steps. Firstly, we consider O(xo' - the orbit of the system ~ from Xo which
appears to be a submanifold of M. Let i : O(xO)~ M be the
natural imbedding. It induces the epimorphism of algebras
i*: O'-'l(M)~OW(O(xo. Let. AI = i~(A) and -l{i = r*(1{(E. It
is easy to see that "l{'C AI and AI is finitely generated. Secondly, we consider indistinguishability relation in o(xo ) :
x'" y if for every u G h'fx,u=h"lfy,u and put MI = o(xo ) IN
It is easy to prove that the observation algebra of the system L! I is isomorphic to the algebra l{1. On the other hand
letus consider the equivalence relation in G : u'" v if
p<.wu )=p(wv) for all WE G. Define X=GI'" Now we consider
the algebra'1(Cp) generated by qi: X-+R, qi(lul) = Pi(uh
and invariant under the operations : f ---+ fo(.,
foC ( [u 1 ,) =
= ~t f(l(t,~)ul)lt=o Similarly as before it appears that
"1{ (p) is isomorphic to l{(p). The algebra 'i(p) is the observation algebra of the OW-minimal realization (i.e. weakly controllable and observable) constructed in [6}. Now,
54
Z. BARTOSIEWICZ
C3.J
t4J
t5J
C6J
r: 7J
r8J
[91
ABSTRACT
In this paper we review some ,results concerning the local
controllability of nonlinear control systems. We stress those results
which are most closely related to the existence of certain symmetries,
including results by the authors and H. J. Sussmann. We also comment
on the relation between this work and generalizations of Lie group
theory to semigroups and Lie wedges.
1.
Introduction
,xe:M
(1)
F
U Rt(x)
O<t<T
55
M. F1iess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 55-75.
1986 by D. Reidel Publishing Company.
56
interior RF (T,x o )
u
T>O
If
(T,x)
ac~essible
at
m
+
i;1
uiXi(x)
X E
M
(2)
where in this case we suppose that Q contains 0, and that the affine
span of Q is Rm. Moreover, if we are interested in local
controllability of the system about x o ' we assume Xo(x o ) ; O. As in
[21J, in the analytic case where the local accessibility rank condition
holds everywhere on M then int R~(Xo) * 0 for each T > 0, and hence
also lnt RF(T,X o ) * 0 for each T > O. It is shown in Sussmann [20J,
that system (2) is locally controllable at x o ' if and only if the
system is locally controllable at x o ' with the admissible control set
enlarged to include bounded measurable controls taking values in the
closure of the convex hull of Q. This mitigates our definition of
57
de g<5(Z)
of an element Z
Let 1T
58
CUT
1
(R.)
1
(X O )
Ct.
m
+
i-1
u.
z. ,
1
n}
defined by G.
where adX(Y)
[X,YJ is the Lie bracket and define inductively the
subspaces Rk and Sk of L(F) by setting
Rr
Theorem 2.1
Sussmann [19J.
<N
(4)
59
(5)
+ R2 (x 0) + R 1 (x 0 )
Rn, Xo
= 0,
Xo (-x)
and each Xi' 1
X. (x) = X. (-x)
1
e: n such
-Xu (x)
(6)
60
x = f(x,u)
Xo(O)
Xo (x) +
i=1
u.X.(x) +
1
i=r+1
uix i
(x)
[-1, 1J
U.
-f(x,u)
where g
gog
(8 )
xo
GJ
61
<
<
(9)
where (t,x)
n
(11
.(y)
i=l
As in
neighborhoodofOET
(x o ).
g((ll (
i=l
y i Xi) (x o ) ) =
(11
i=l
Yi
xgi )
(x o )
(A(g)y)
(10)
M.
Xo
From equation (10) we can d.educe some important facts. Let GO be
the fixed point set of G, GO = {x e: M; g(x) = x for each g E G}, and GO
a
the fixed point set of Ga , then it is clear that Xo is an isolated
~
62
P. E. CROUCH AND C.
r.
BYRNES
> n+1 is a
En has an expansion
i= 1
where (1i
Lemma 3.2
Let B
. v ) denote a basis of T
n
Xo
Xo
M.
Each set
M i f and only i f G
has no invariant
The main result in Crouch and Byrnes [4J may be stated as follows.
Theorem 3.3
63
( 11)
<m
~ j
F let
to F is a mapping
A stream relative
R+ xM -+ M
(t,x) -+ n(t)(x)
defined by an expression of the form
n(t)(x)
aU1('I(t))O
u.
a r('r(t))(x)
>
U.
F,
1 ~ i ~ r,
1.
G where
and
u. (g)
Xl,
Ui
' u i (g)
Il
<
<
n such that
+
and span {a 1 a }
n
eo
Rn.
(t
q.+1
1
( 12)
G, and
64
.
Sl(t)(O)
g
A(g) a.t
q.
+ 0 (t
q.+l
1
),
< i <n
Select gi
G so that vi
A (gi) a.,
Ji
<
<
<
(t)(y)
<
Let
( 14)
- e.
It follows that -i
S
0, and set
(+
tij(s) ) (y)
F wi th
(15 )
+
( I f a:-.
= 0 then we set SiJ'+-(s)(y)
y).
lJ
relative to F by setting
..
k'
S J(s)(y)
0
0
j-
(s) (0)
(s) (0)
e.s + O(S)
J
-e.s + 0(5)
J
(16)
65
-+
oj(s)(y) by
s > 0
={oj-(lsl)(Y) , s
<
It follows from
(16) that
nei ghborhood of 0 e: Rn wi th
~
~
(0, 0 0)
o s.
1
lji
is continuously differentiable in a
ei
' 1
<
<
R~ (x o ) , T ~ 0
e: [-1, 1 J
a3
66
Xo
(- WI XI - w2 X2 - W3 XS
(WI Xo + W3 X2 - 1Il2 XS
( 18)
x2
"2
1
2
Since x~ + xf + x~ + x~
in a neighborhood of X
= -1 ( W
X
11 -
=
=
XIX
(19)
fix
whereR o is the 3x3 identity matrix and RI = diag (1, -1, -1), R2 =
diag (-1,1, -1), R3 = dia~ (-1, -1,1). The action of G is simply
defined by (w,x) ~ (R.w , R.xJ , 0 < i < 3. It is clear fram equations
1
--
lI b (R
1f
2)
diag (-1,1)
b (R o ) = diag (1,1)
Thus the family of vector fields defining the systan of equations (17)
and (19) are indeed G invariant, and since G clearly has no non zero
fixed points in R 3 x R 3, by lemma (3.1), the hypotheses of theorem
(3.3) are sati sf ied. To deduce local controllability about x = w = 0
we need only check that the local accessibility rank condition is
satisfied but this is a standard argument as performed in Crouch [4J.
4.
4.1
In this section we describe some of the results of Sussmann [20J,
in relation to systan (2). The paper by Sussmann [19J should also be
consulted in this context.
We introduce m+l noncommuting
67
indeterminates
power
the
Z 0'
series
and
. Z
in Zo' Zl
L 0:1
form
Z1
over R.
homogeneous
degree
III
multiplication
elements
. Zi
,0
<
i, < m (of
J-
r).
Addition
is
performed componentwise and
performed as usual with ZI ZJ = ZI*J where I*J is the
Ri' where Ri
L(Z)
i= 1
That
i.
is,
Ri
is
an
associative
~i
S=
m
S (Zo +
i= 1
z)
ii'
( 20)
~k(Z)
A
linear
map
~k(Z)
inductively by setting
Ll (Z) = L(Z)
L( Z) ,in cas e
68
co
i=1
co
R.1 =
co
i=1
where
for k < i.
It
A/~ AI!.
As
in
I')
<
< co}
Al
of A (Z),
I')
degl')R = i.
(R)
Clearly
autanorphisms.
S(Z,O) i f
I')
I')p
homogeneous with
in Sussmann [20J,
As
I')
I')
is said to be compatible wi th
<
p ~
each u e: 0 and p, 0 < P < 1 there exists T > 0 and v e: 0 such that
m
I')
(Zo+
i= 1
u.Z.)=T(Zo+
1
i=1
v 1 Z.)
(21)
e:
L(Z)
is
homogeneous
said
to
'IT(R) (x o )
be
I')
neutralized
at
Xo
e: M if whenever R is
I')
i= 1
'IT
69
4.2
Theorem 4.1
Sussmann [20J.
<5
<5
neutralized at xo.
It
dt g
(exp tR)
It
g (exp tR)g(R)
for R E L(Z), which implies that along a solution Set) of equation (20)
we have
d
dt g
It
(S(t
m
(1TR) (x o )
If
the group G
: T
Xo
M ~ T
That is (1TR) (x o ) e: T
Xo
M.
g*
M is a fixed point of
Xo
But under the assumptions of theorem
L(Z) of
70
, and let S
(r) be the
u
v
u
element of S(Z,O) obtained by solving (20) with the control v, and
k+1
A (Z).
I t follows that r
m
2: u i
i=l
Zi'UO}.
If G is a finite group of pseudo autanorphisms of L(Z) satisfying
in addition
( 23)
A
<
1,1
of {Zo
Zo' 0k(Z.)
1
+
=
2:
i= 1
u. Z.
+ Z.,
1
i .. k
71
o (R)
B(R)
a e: G
for sane Lie monomial R e: L(Z).
may wri te G
am' m
k.
j.!
e: Sm' and
=
-R
o(R) +
B(R) =
element Zk'
<
<
a e: S
(R k )
( 24)
P (Zo) 0
(Z.)
1
e
p
Zi ' 1
oO(R) +
S.
e degree
S.
e L
m, for sane
e,
S. e < co.
If R
of R, by setting
oi(R)
i=1
{u ;
lUi
s.
1 , 1
S.
S.
Since in
72
deg a (R )
i= 1
Sussman
[20J shows that wi th thi s def ini tion of deg a (R) we may also take a
to obtain
Theorem 4.2
co
Sussmann [20J.
luil ~ 1, 1 ~ i ~ m} is locally
{u;
controllable at Xo e: M in case:
(i) it satisfies the local
accessi bil i ty rank condi tion at x o , (ii) there exists-a e: [l,coJ such
i
that whenever R e: L(Z) is a Lie monomial with cO(R) odd, 15 (R) even for
1
< i
(a)
'IT(R) (x o ) =
i=l
, a,1 e:R
a, 'IT (R,) (x o )
1
deg a (R i )
A
The two techni ques reviewed in sections (3) and (4), rely on seemingly
different methods to obtain sufficient conditions for local
controllability at xo' In section (3) symmetries are used to ensure
that the attainable directions at x o , VI v , (see (12), and lemma
n
M = {L al,v l,
Xo
i=l
convex cone generated by VI
(3.2
satisfy T
a,
> OJ.
1 -
vn '
That is, T
M is the closed
Xo
Sussmann proves theorem (4.1) in
In this
73
= {X
74
6.
References
1.
2.
3.
C. C hevall ey, The Theory of Lie Groups, Pri nceton Uni versi ty
Press, Princeton Mathematical Series No.8, (1946).
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
75
15.
A. J. Krener, "A Generalization of Chow's Theorem and the BangBang Theorem to Nonlinear Control Systems," S.I.A.M. J. Control,
Vol. 12, (1974), pp. 43-52.
16.
17.
18.
19.
20.
21.
22.
23.
Extrinsic Geometry
f(~)dt +
g(Odw
(lola)
d~
h(~)dt + k(~)dv
(1.1b)
~(tO)
~o
(1.1c)
The state process ~(t) and output process ~(t) evolve on nand p
dimensional manifolds Nand P. The driving processes wet) and vet) are
m and p dimensional independent standard Wiener processes. The
initial condition ~O is an N valued random variable independent of
wet) and vet). This (1.1) is a local coordinate description using Ito
differentials. We regularly abuse notation by confusing local coordinate
descriptions with the intrinsic objects they describe.
The nonlinear filtering problem is to compute in real time the
conditional distribution of the current state ~(t) (or some useful
statistics such as the conditional mean) given the past observations
~(s), to ~ s ~ t.
are known. This
problem. Kalman
which an efficient
problems.
dx
A(t)x dt
B(t)dw
(1 2a)
dy
C(t)x dt
D(t)dv
(1 2b)
(1 2c)
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 77-87.
1986 by D. Reidel Publishing Company.
78
A.J. KRENER
y.
= (t,x) and
79
EI;
3)
be
k(l;~
Intrinsic Geometry
. (E b. yj
J
E a Xl.J
l
. E. (a. b . g . yj +
I, J
I J Xl
X and
and
(~)
(2.2)
where
(2.3)
a
a~i
if
r kij
o.
is
A.J. KRENER
80
(2.4)
(2. 5a)
h(f,;)
(2.5b)
o~ for k
1
(2.6)
~-1
(-f)g, ad
~-2
'
(-f)gJJ>
(2.7)
81
ProQf.
indUC[i:n.ror
By
Row Span
and
Lk
f
= 1 , ,
l!I!.
a~
.
[d.
k.!.1
Row Span
k 1
Lf (dl/J)
(dl/J)
(2.8 )
Lf (d~)
Lk
f
(d~)
Mod (2.8)
(2.9)
therefore
g
al/J
(2.10)
a~
and
(2.11 )
Mod {g, , ad ~-3 (-f)g}
Now
p,O"
<
(2.12 )
al/J
-p
a~i
ad~-2(-f)gOL
(~-1)
ad
~-1
(al/J~
~l-
(-f)g
al/J.
~o
ad ~-2 (-f)g
L ~-2
~j
ad
(-f)g
by
82
A.J. KRENER
x
Y
Ax
Cx
(2.14a)
(2.14b)
(x
(2.15a)
x-x) dynamics
(2.15b)
x = (A + GC)x.
<k <~
k =
Then
r~j
i<
(2.16)
Ax + J(y)
(2.18a)
Cx
(2.18b)
83
We can add the same output injection to the observer to obtain the same
error dynamics
(A
x
(A +
GC)
x-
Gy
J(y)
(2.19a)
GC) x
(2.19b)
Under a change of
Clearly
~(~)
(2.20)
with
r~j = 0 we obtain a system of linear first order partial
differential equations for the ~i thought of as functions of ~,
-I:
r ij ,~m
k
't'
84
A.J. KRENER
The equations (2.20) and (2.21) are first order linear partial
differential equations for the desired change of coordinates. Moreover
we can address the solvability of (2.21) independently of (2.20). The
former are solvable if certain integrability conditions are satisfied
(the mixed partials must commute). These are given by
I: (r ir
k
m,r
(2.22)
0 where
q>:
in (2.22).
The
Ri~m are
'=
r ij _
k
r ji
k
(2.23)
i .
r~j = o. If this
can be transformed into (2.18) where the Chr i s toffe,l symbols are also
zero then from (2.4) we see that ~ is necessarily an affine function of
y; a~/ay = constant. Applying the argument of Lemma 1 we see that
g = li
ax
ay
a~
and so
(2.24)
must be a commuting frame. We call (2.24) the frame of basic vector
fields associated to the output map ~.
On the other hand if the basic vector fields (2.24) are a commuting
frame then we can choose state coordinates x so that they are a
coordinate vector fields. It is straightforward to verify in these state
85
W.
~1 ~ ~2
... ~ ~p
and ~1+"'+~P
(~1''''~)
is less than
~l
L (L k - 1 (,h.)
gJ f
"'I
<
<
~i
k = ~i
( '''.
"'I
) =
~j
>
~i
implies that
(2.26 )
A.J. KRENER
86
. -1
<di., ad J
1
(-f)gJ>
(2.2'7)
~. > ~i'
This is an underdetermined system of PDE's for
lolvable first of all the p dimensional column vectors
~or
To be
(2.28 )
should be functions of not ~ and hence define vector fields on the
output space. Then by the Frobenius Theorem i satisfying (2.27) exists
iff the distributions
C=span {yj:
ire involutive for i
~j > ~i}
=
1, ... p.
~ransformations
aai
if
Theorem 4.
(Krener and Respondek [2J) Let the dynamic observations
(2.5) be observable with indices ~1 ~ . ~ 1 . It can be transformed
into linear, output injected dynamlc observati~ns (2.18) iff
Formula (2.28A) defines vector fields on P and the distributions
(2.28b) are involutive. Hence special output coordinates exist.
(0)
(i)
~.-1
~.-2
(_f)gl, ad J
(-f)gJJ>
Hence r~j
P.
can be
r ij = 0 then the
k
{ad
.-k
.
(-f)gJ:
1_<
_<
p;
1 < k-,
-
87
~.}
is commuting.
References
[lJ
[2J
[3J
[4J
H. Fr i tz
Institut fur Systemdynamik
und Regelungstechnik
Universitat Stuttgart
D-7000 Stuttgart 80
West Germany
H. Keller
Institut fur Regelungsund Steuerungssysteme
Universitat Karlsruhe
D-7500 Karl sruhe 1
West Germany
ABSTRACT
*) This work was performed at the Institut fur Regelungs- und Steuerungssysteme of the University of Karlsruhe.
89
M,. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 89-98.
1986 by D. Reidel Publishing Company.
90
This paper only deals with single ou~put systems although the
design method can also be appl ied to nonl inear systems with more than
one output as shown in [3].
It has to be mentioned that this treatment was developed from an
engineering point of view and that it is rather an analytic approach
than a differential geometric one [4,5].
2. DEFINITION OF THE 'GENERALIZED OBSERVER CANONICAL FORM' (GOCF)
*( *
- a o xn '
.*
X.
(n)
, ~, ... , ~
* (*
x i - 1 - a i - 1 xn
(3a)
... ~n,-_~+l))
,~,~,
i=2, ... ,n ,
= c * (x * , u) .
n -
(4)
(y ,-u , -~ , .. ,(-~) )
<-1 - a i . .,l ( y
x*
n
c(y
,~)
(5a)
(.n,-~i+l))
,~,i!., ..
i =2, ... , n ,
(5b)
(6)
91
:...*
x.
I
(7b)
i=2, ... , n ,
that achieves compensation of all nonl inear terms and pole assignment
for the resulting I inear homogeneous differential equation
(8a)
- k e
on'
e.
i=2, ... ,n ,
(8b)
= x*
- x"*
The constant gain coefficients k., i=O, ... ,n-l, can always be determined so that system (8) possessks desired eigenvalues in the left half
plane. Thus, the error convergence to zero can be made arbitrarily
fast.
I inear mapping
pres~nted
(n-1) )
,~
G ,
(n-l)
~
which carri.es system (1), (2) into system (5), (6) and the corresponding inverse
x*
-( *
= ~~
,~,~,
(n-l) )
,~
(10)
(n-l)
~
92
u
;!.
(n-1 )
u
-
[~ ]
x * = ~ (:s"g) ,
and
-
.:s, = ~(.:s,
,;!.)
m
E
G,
( 11)
G*
(12)
The nonl inear vector functions v(x,u) and v(x*,G) are assumed to be continuous and partially differentiable with
to U, x and x*,
respectively. The total differentiation of (11) with respect to t and
the subst i tut ion of (1) 1ead to
respect
x. *
a~(.:s,,~)
ax
(.:s,,;:) +
a~(.:s,,~)
aG
(14)
93
By successive substitution of the i-th equation of (14) into the foregoing (i-l)-th equation of (14), i=n, ... ,2, and by appl ication of the
operator
" =
Mk Vi (~,~)
[a
[a
= Mn c(y,~)
n
L
n k
M -
k=1
n-j
(n-j))
-
L
k=1
v. ( x,u,u, ... ,u
J---
(k))
a n - k y,~,~, ... ,~
n- j-k
(16)
a n - k y ,~,~, .
(k))
,~
( 17)
if the relation
( 18)
v (x,u) = x* = c(y u)
n - -
'-
is taken into consideration. Thus, the transformation (11) is completely establ ished by (17) and (18) if the functions c(y,u),
(n-l) )
a 1 ( y,u,u, ... ,u
- -
, ... , a
(.)
n-
1 y,u,u
-
( )
xi+l ,
xn
(19b)
Xl
(20)
.:.
( 19a)
94
which hQs been introduced by Zeitz [2] and is a consistent general ization of the linear observabil ity canonical form. It can be generated
from system (1), (2) by the transformation (cf. [2])
x.
I
i -1
M h (~,~)
i =1 , ... , n ,
[~]
e G
(21)
[ah~~)r
rank
= n
(22)
[0-1 a:(~'~)
aM
is satisfied. Appl ied to the GOCF (3) , (4) this rank condition reads
ac*(x*,u)
nax*
n
0--0
rank
1//
ac*(x*,u)
nax*
n
n .
X--X
95
(18). If this condition is not satisfied the GOCF (5), (6) does not
exi st.
Now let system (1), (2) already be transformed into GOBCF (19), (20).
Then the GCE related to the GOBCF reads
o=
Mn c(y,~)
In
k=l
-n-k
(
(k)
M a n - k y.~.~, ... ,~
.
(23)
t(i,~.~ .... ~~
~Q.
has to
This will be
o = M2 c(y.~)
Mal
(y.~.~)
ao(y,~,~,id)
(24)
o
aal(Y'~'~)
3y
x2
If this condition is fulfi lIed the functions K2(xl ,u), Kl (xl ,u,td and
Ko(xl ,~,i,Q) are known and the three unknown functions c(y,~)~ al(Y,~,~)
and ao(Y,~,~,Q) can be determined from the three partial differential
equations
96
.
f-(-x,u,u,
. ... ,u
(n)) has to
t he structura I con d
Itlons
whIC h t he f unction
satisfy become stronger and stronger [8].
- --
IN ORIGINAL COORDINATES
In section 3 it has been shown how the observer can be designed in the
transformed coordinates. To determine the observer in the original
coordinates the inverse transformation (12) has to be calculated, related to the estimated variables x and x* and differentiated totally
with respect to t:
_(A* A)
a~ ~ ,~ A*
--a-~-*-- x +
--a-G-- ~
yield
;*
by the transformation
97
"
=
X.
- ("* A)
aV i ~ ,~
a~*
,,*
""
,,(n)
ib,y,~,~ )
]
(A ")
~ =~ ~,~
_ ("* ")
aV i ~ ,~
aU
"
fl(~'Y'~'~""~~)
,,(n)
read
= -
(n-i+1))
f.(x,v,u,u,
,_-L__ ... , _u
A"
"A
i{~,y,~,~)
(n-i+1)
v. 1 (~, u, ~, .. , -u
,- ---
) -
(n-i+1))
-a 1 ( y,u,u, ... ,u
+
--
,-
REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
98
[7]
[8]
Feedback Synthesis
and
Linearization Techniques
Henk Nijmeijer
Department of Applied Mathematics
Twente University of Technol~gy
P.O. Box 217, 7500 AE Enschede
The Netherlands
1. Introduction
During the last decade significant progress has been made in
the theory of input - ouput decoupling of nonlinear control
systems.
In fact
some of
Singh
& Rugh [29], Freund [6] and Sinha [21], where a solu-
tion
of
the
static
state
feedback
noninteracting control
[12]
for an
among others,
took
This,
almost
Hirschorn [8]
ten
years
before
Isidori
et
al.
[9]
this
and
solution of
It
101
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 101-119.
1986 by D. Reidel Publishing Company.
102
H. NIJMEIJER
but
the
interested reader may find them in [17,18]. For those readers, who are not too familiar with differential geometry we
refer to [2]. Some of the most important concepts, needed in
the paper are defined in an appendix.
Xo
y( t) = C(x(t
where x = (xl"" ,x n ) are smooth coordinates of a smooth ndimensional manifold M.A.B 1 Bm are smooth vector fields
on M and C : M ~ N is a smooth map from the state space M
into the smooth q-dimensional output manifold N. Moreover
the inputs u() = (u 1 (), ,um(T belong to the space of
admissible controls, i.e. they are such that the solutions
of the dynamical equation in (2.1) are well-defined. Because
we will concentrate in this paper on a local theory for decoupling of (2.1) one may think of (2.1) evolving on an open
neighborhood of Xo in IRn and the output values in an open
neighborhood of C( x O) in IRq. Let us firs t explain what is
meant by decoupling of (2.1). Suppose that the outputs in
C (x(t
p
that is C = (C 1 ' Cp )T and each Yi' i = 1 p a qidimensional vector (qi)l). Such an a priori subdivision naturally appears in various control systems. e.g. in a model
103
input
does
not
E:
pH
affect
j
Besides
this noninterac ting condition we impose an additional constraint in the input-output decoupling, namely that the inputs in the
'control'
the corres-
notion of decoupling,
is the
E:
E:
m.
vector
puts m equals the number of vector outputs p. Then it is required that each of the inputs u i only affects one of the
outputs. After a possible relabeling of the inputs this
means the i-th input only influences the i-th output. Clearly the output Yo does not explicitly depend upon u. DifferJ
entiating yo(jem = p) with respect to t we obtain
J
-
(2.3)
So
YJo(t) = LACo(x(t
J
1=1
LB Co(x(tu1 (t)
1 J
* i,
H. NIJMEIJER
104
(2.4)
LB. C.(x)
for
*i
oC.
dC .(x)
(2.5)
B.(x)
dC.(B.)(x) (where
J
_J(x
ox
Ker dC.(x) ,
* i.
Assuming that the inputs are sufficiently smooth we may differentiate (2.3) once more
(2.6)
..
y. (t)
LALAC(x(t
L [LALB C.(x(t
~=l
~ J
+ LB~LACj(x(tl u~(t) +
m
L LB LB C.(x(tuk(t)u~(t) +
k,~=l
m
L LB
~=l
~ J
C.(x(t~~(t).
~ J
0, LB LB C. (x)
k i J
LALB.Cj(X)
1
0, i
j, k
m.
So (2.6) reduces to
YJ.(t)
(2.8)
= LALA
LB LAC.(x(tu~(t)
~=l
+ LALB C.(x(tu.(t)+
j J
J
+ LB C.(x(t~.(t)
j
and for j
C(x(t
LL
~=l B~
LB C.(x(tu.(t)u~(t)
j J
J
(2.9)
Now by (2.7) and (2.9) we see
0, i
j.
105
(2.10)
L[B
(2.11)
[A,B.] (x)
[Bk,B.J(x)
n Ker dC .(x)
J
JFi
n
j*i
KerdC .Cx)
J
n KerdC.(x) ,
j*i
* i.
more and more awkward. Fortunately there is much more concise formulation.
For
the
x1 ,X 2
E VeRn)
o
1
define ad X X2 = X2 ' ad X X2 = [X 1 'X 2 ] and recursively
k
1
k-1
1
X2 ], k = 2,3, Associated
ad x X2 = [X 1 ,ad X
with
the
1
1
system (2.1) we define the following Lie algebras, see also
[ 19],
(2.13) Loi
and
the
Lie
denoted
by Loi,i E m. Note
that
by
which will be
definition Loi is
the
closed
[L 01.,L0 1 c
by
Loi
taking
the
Lie
product
with
Lo'
i.e.
106
H. NIJMEIJER
X E IRn
R (X)
Span{L (X)}
Ri (X)
Span{L 01.(x)}
i E
~,
X E IRn
Ri (X)
Span{L01.(x)}
i E
~,
X E IRn
(Z.14)
Note that the distributions Ro,R i and Ri , and i E mare inthey are closed under taking Lie brackets,
volutive i.e.
e.g.
i f X1 ,X Z E Ro then [X 1 ,X Z]
smooth
Ro for
vector
Jt1
(Rn.
m.
Span{L .(x)} , i
01
01
p+1
--
p+1,
X E
--
IRn
(Z.15)
becomes
(Z.19) R.(x)
1
n Ker dC.(x),
jfi
J
p,
IRn
and
(Z.ZO) RP+1 (x) c ~ Ker dC .(x),
j=1
J
Here (Z.ZO) means that the inputs in the (P+1)fh block have
no effect on one of the outputs.
107
also
y j'
want
E.r..
that
the
accessibility
dim RO(x)
n,
(3.1) is
strongly accessible if
the
= A(x),
x(O)
= x O).
two
different
means that
oC
rank ox (x)
(ii)
outputs
coincide. Mathematically
oe 1
oC
15'X (x) + .. + rank ~ (x),
this
x E Rn.
noninteracting
Xo
rank
therebye,
is chosen.
H. NIJMEIJER ,
108
p+ 1 , j
E!., x
Ri(x),
iRu
This
This raises
the question is it possible to obtain noninteracting by adding control loops to the original system. The control schemes we will allow for are static state feedbacks,
Le. an
(3.1)
u = a(x)
where
~(x)v
are
~(x)
= (~i/x
~(t)
A(x(t
A(x)
A(x)
L B.(x(tvi(t)
i=1 ~
where
(3.3)
L B.(x)~ .. (x).
j= 1 J
J~
Af
109
in (3.1) such
that the new dynamics (3.2) with outputs (2.2) are inputoutput decoupled (except for the earlier mentioned scalar
input scalar output case with m
is
(3.4b)
[B j ,Ri ] (x)
~f
~f
~f
R. (x), i E p+l,
Ri(x),. i E p+l, j E ~,
~f
E IRn
x E IRn
~f
[A,Ri ](x)
(3.5)
i E p+l
~f
[B j ,Ri ](x)
x E IRn
~f
iEp+l, j E ~, X E IRn
But
exactly
(3.5)
means
that
way
regular
defined
controllability
these
the
Af
distributions R.l are
distributions,
cf.
[11,
are
14].
also
But
these two observations open the way for tackling the problem
from the other side.
First recall the following results. Suppose that D is an
involutive distribution on IRn which is locally controlled
invariant for the system (2.1). So
110
H. NIJMEIJER
IR n
(3.6) {
Xo
Xo
(3.7) {
X E
A and
~. are as
1
E!!),
in (3.3),
0,
x EO,
controlled
invariant
distribution
D* c K, i.e. D*
satisfying
this
property.
Note
that at
this
is assumed. However
borhood
(3.8)
Then R* is
[11,14]
local
controllability
Furthermore R* c
contra11ibi1ity
D* c K is
distribution
in
K.
distribution,
see
the
supremal
local
Note
that R* automa-
111
to a
candidate
(3.9)
jH
E, KP+l
P
n Ker dC j ,
j=1
and
p+l,
--
local
E
controllability
p+l (here
constant
distributions
dimension
in
assumptions
Ri'
*
Ri* + Ker dC i
.
T IR n ,~E~"
hold,
then
the
invariant
a.(x) +
by
an
appropriate feedback u
~.(x)v,
~
[17, 18] (note that the same problem for linear systems was
treated in the early severities in [24, 12] and that these
papers serve as a motivation for the results of [17]). Our
noninteracting condition is as follows
Span{B 1 , ,Bm} n R7+ +
(3.10)
*
+ Span{B1 , ,Bm} n Rp
c~,
112
H. NIJMEIJER
all
the
distributions
R~.
rJ
R~ and (
rJ
R~)
rJ
Then
the
input-output
decoupling
problem
is
locally
The output
~ R~
i=l
contains
~L Ro*
i=1
TR.
j'i'i
R~
J
R: + Ker dC i
T~n. i E .E.
(3.13)
(iii) Because in principal the distributions Di* (or R*i ) are
directly computable from the system parame~ers A,
B1 , ,B m,C 1 , ,C p ' see [8,9,13] our noninteracting
condition (3.13) is rather easy verifyable (this in
contrast with the nonconstructive solution of the pro-
113
(so
m=p,
dim y. = 1, i
m) there
are
other
the
nonlinear
Falb-Wolovich
condition,
For
the
of
with
proof
ours
the
together
equivalence
with
other
of
see
this
[4,6,21)
condition
characterizations
we
ref er to [5,9).
(v)
diffeomorphism
(and
therefore
As
in
the
strong
linear
link
decoupling
between
the
theory
there
nonlinear
exists
input-output
in
the
fact
E..
that
the noninteracting
L R~
jE I
condition
is what is the
114
H. NIJMEIJER
canonical forms,
the facts
output
grouping
(2.2)
is
input-output
decoupled.
To
(4.1 )
E ~
span{~
*
*
R1 + R2
= span{~
(4.2)
each x,, i
~
span{;-}
x3
-3,
xl
0
o~
0
, ox '
2
o~ 3}
decoupled system as
(4.3)
and
1,2
{ ,:~].
(4.5)
Using
[A,R.]
c R.*~
[B j
c R.*
~
(4.4)
and
(4.5)
(4.6)
115
i = 1,2, j = 1,2.
one
easily
verifies
the
"ideal
so
0
R* = Span {-0- , ~--}
2
oX 2 oX 3
(4.2) - (4.6),
Combining
together
with
and
(4.7)
x2
A2 (x 2 ) + B2 (x 2 )u 2
x3
A3 (xl ,x2 ,x 3 ) +
Yl
Y2
C(X l )
i~lB3i(Xl'X2'X3)ui
C(X 2 )
"-
>2
the following
(4.8)
x
m
Am(xm) + Bm(xm)um
xm+l
Yl
Am+l(xl,,xm+l) + L Bi(xl,,xm+l)u i
i=l
Cl(x l )
Ym
. Cm(xm)
116
H. NIJMEIJER
5. Concluding remarks
In the previous sections under appropriate (constant dimension)
assumptions
the
solution
of
the
local
input-output
It is possible to relax the constant dimension assumption? In particular for analytic systems this seems an
attractive question, see for both topics [3].
(iii) Is
it
possible
to develop a
decoupling where
feedbacks
are
it
is
no
similar theory
longer
nonsingular
(i.e.
required
the
for
the
that
the
matrix
~(.)
in
If
the
static
problem is
not
state
feedback noninteracting
solvable,
try
to
find
control
other control
Appendix
Although all our definitions given here can be given on an
arbitrary manifold we will only do it on ]Rn. Given ]Rn,
dis tri bution on ]Rn is
point x
i.e. D(x)
a mapping
D which assigns
to each
smooth if
locally we
can find
smooth vector
117
oY
= ~x)
oX
E D. An important
distribution
coordinates
(x 1 x n )
where
in
this
on
such
coordinate
Rn
there
that D
system
locally
exist
= Span{~ ~}
Xl
Xi
(x 1 x n ) + x 1 x n ).(0 1.0 0)) (1 in the i-th
position). Finally for a smooth vector field X and a smooth
References
[I]
G. Basile lie G. Marro. "A state space approach to noninteracting controls". Ricerche di Automatica
1...
pp.
68-77. 1970.
[2]
W.M. Boothby. An introduction to differentiable manifolds and Riemannian geometry. Academic Press. New
York. 1975.
[3]
C.l. Byrnes. "Towards a global theory of {f.g} invariant distributions with singularities". Proceedings of the MTNS Conference. Beer-Sheva (1983). Mathematical Theory of Networks and Systems. P.A. Fuhrmann
(ed.) LNClS 58. Springer New York. 1984. pp. 149-165.
118
[4]
H. NIJMEIJER
[5]
.J. Descusse & C.H. Hoog, "Decoupling with dynamic compensation for strong invertible affine nonlinear systems", to appear in Int. J. Contr. 1985.
[6]
[7]
1.!.,
[8]
[9]
..!2.,
[10]
[11]
A.J. Krener & A. Isidori, "ad(f,G)-invariant and controllability distributions" in Feedback contro1 of
1inear and nonlinear systems
LNCIS~,
pp. 157-164,
1982.
[12]
[l3]
[14]
H. Nijmeijer, "'Invertibility of affine nonlinear control systems: a geometric approach", Syst. Contr.
Lett.,1.. pp. 163-168, 1982.
[16]
119
[17]
[19]
[20]
1-,
S.N. Singh & W.J. Rugh, "Decoupling in a class of nonlinear systems by state variable feedback", J. Dyn.
Syst., Meas. & Contr. pp. 323-329, dec. 1972.
[21]
1~,
pp. 95-116,
1972.
[23]
[24]
W.M. Wonham & A.S. Morse, "Decoupling and pole assignment in linear multivariable systems : a geometric
approach", SIAM J. Contr.
Optim.,~.
A. Isidori
Dipartimento di Informatica e Sistemistica
Universita di Roma "La Sapienza"
Via Eudossiana 18, 00184 Rome (Italy)
1. INTRODUCTION
In this paper we consider control systems described by differential
equations of the form
(lola)
f(x) + g(x)u
(LIb)
h(x)
= a(x) + S(x)v
' (x)
h(x)
tV
+ g(x)v
with
tV
f(x)
f(x) + g(x)a(x)
121
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 121-145.
1986 by D. Reidel Publishing Company.
122
A. !SIDOR!
'"g(X)
g(X)S(X)
(1. 3)
a(~.x) + b(~.x)v
c(~.x) + d(~.x)v
123
~l(x) = {w:
(w,v)
= 0,
Vv
~(x)}
(~
aX l
~~
where the superscript "T" denotes transposition. By means of this operation, it is possible to construct, from any codistribution n, a new codistribution denoted LTn and defined by
With any system of the form (1.1) one may associate a sequence of
codistributions defined in the following way
(2.la)
(2.lb)
no(x)
= span{dhl(x), ,dh~(x)}
where G(x)
= span{gl(x), ,g
g~
(~_~
ct(x)
(i)
(ii)
the dimension of
~(x)
124
A. !SIDOR!
(iii)
(~
the dimension of
*<
have seen, implies the convergence of the algorithm (2.1), in a neighborhood of xO, in a finite number of stages. The codistribution Qk*
will be sometimes denoted by the simpler symbol Q* and its annihilator
by
Ql
~(x)
=0
~(x)
~(x)g(x)
Clearly
de~ote
~(x)
there exists an (sk-rk)xsk matrix Yk(x), of full rank, whose entries are
still analytic functions, and such that
Yk(x)~(x) =
~(x)
125
yk(x)Mk(x)
The computation of Qk+l now requires that of
Lf(~
~)
and
will
~(x)
span{(LT(Y~(x): I 2 i 2 sk-rk}
(Lf(Y~(x)
i\+l(x)
(Lg
(Y~
-r Mk(x)
k
(L g (Y; -r Mk(x)
m
(L 2
'"11J.
(Y~
-r Mk(x)
k
126
A. ISIDORI
~+l
of ~+l(x).
The recursive use of the above procedure makes it possible to construct matrices MO(x),M l (x), .. whose rows span, at each x in a neighborhood of xO, the codistributions ~O(x)'~l(x) .
The maximal controlled invariant distribution algorithm is mostly
used in order to compute the distribution
(the annihilator of
~k*)
and this distribution plays an important role in the problems that will
be described in sections 6 and 7. However. there are also problems,
like the one discussed in section 3 (and again that described in section
6) in which one is also interested in keeping track of some data, or
facts, related to each of the intermediate steps of the algorithm.
As a matter of fact, it will be seen that the integers rO,r l .
which represent the ranks of the matrices (2.2) and that, in more geometric terms, may be characterized like dimensions of quotient spaces.
as
~k(x)
(2.3)
rk
dim -~-(-x.:;c)-n-(f'r'(-x-)
(2.4)
1.1 ,1
(r o times)
2,2, .. ,2
(rl-r O times)
k * +l,k * +l, ,k * +1
is said to characterize the zero structure at infinity (of
functions matrix) of a linear system.
The idea of associating a formal zero structure at the
for a nonlinear system was introduced in [41 (see also [5])
special class of nonlinear systems. Subsequently. in [6] it
a transfer
infinity
for a
was proposed
127
to take, for any nonlinear system of the form (1.1), directly the integers (2.3) as a basis for a formal definition of zero structure at
the infinity, and thus to assume exactly the table (2.4) as a formal
characterization of such a structure.
Even though the integers thus considered do not share all the
properties that one usually associates with the notion of zero structure
at the infinity in a linear system (see section 4) there are cases in
which they make it possible to characterize in simple terms the solvability of a synthesis problem. One of these cases will be discussed in
the next section.
3. MATCHING A PRESCRIBED LINEAR INPUT-OUTPUT BEHAVIOR
In this section we shall see how dynamic state-feedback can be used in
order to match a prescribed linear input-output behavior. We recall,
from section 1, that the composition of a control system of the form
(1.1) with a dynamical state-feedback compensator of the form (1.3)
defines a ~ew dynamical system with inputs vl""'v~, outputs Yl""'y~
and state X = (I;,x), described by equations which have the same structure as that of (1.1), i.e. are of the form
(3.la)
f(x)
(3.lb)
h(x)
g(x)v
with
f(l;,x)
h(l;,x)
a(l;,x)
b (I; ,x)
[
f(x)+g(x)c(l;,x)
g(x)d(l;,x)
= h(x)
(3.2a)
= Az
(3.2b)
Cz
+ Bv
(3.3)
y(t)
Q(t,(I;,xO
J CeA(t-T)BV(T)dT
o
A.ISIDORI
128
= f(x)
= Az
= hex) - Cz
+ g(x)u
+ Bv
:i:E = fE(x E)
e
letting x
(x,z) , u
E
f (x,z)
h (x,z)
E E
= h E (x E)
E
= (u,v) and
[ f(x)
Az
+ g (x )u
E
g (x,z)
g(x)
hex) - Cz
Moreover, one may compute for this system a sequence of structural indexes of the form (2.3), namely
E
dim
where
(3.4a)
(3.4b)
Qk(x)
Q~(x) () GEl-(x)
129
where Q*
1 < i < m
E
.E*] C Ll.E*
[f E + ~
L g.a.,
Ll
i=l
(ii)
~ ~
E ..
g.y
J
J~
E t:,.
E*
At; + Bv
= ~(x,t;)
+ y(x,t;)v
l<i2,.ll
A.ISIDORI
130
(P2)
min(~,m).
The integer
rO + 2(r l -rO)+" .+(k*+l) (r k * - rk*-l)
(i.e. the total number of zeros at infinity, counted with their
own multiplicity) is bounded by n-dim(~*).
(P3)
If
= rk*
n-dim(~
*).
= ~.
(P4)
(PS)
(P6)
= m.
hI (x)
Yl
Y2
h 2 (x)
131
x4
f(x)=
,gl(x)
hI (x)
xl
h 2 (x)
x 2-x 3
, g2 (x)
x3
0
o
o
o
(x) = sp
no n <f(x)
(1
OJ}
-1
0)
sp{(x3
-1
(0
n l (x) 2 no(x)+sp
no(x)+sp
O)}
{ Lf(x,
L (x3
gl
-1
0)
-1
0)
(x 3 0
-1
'
1
J
1) } T* lR4
x
0)
4
*
and /). (x) = 0 for all xElR.
(f
= 1
3
and therefore
rl - r 0 = 2
132
A. ISIDORI
the total number of zeros at the infinity - counted with mUltiplicity is 5. This shows that both (PI) and (P2) do not hold for the system
(4.3)
ul
Xl
x2
x4
.
x3
x3 u l
x4
u3
and therefore
= O.
xl(O) +
f Ul(T)dT
o
x 2 (O) + x 4 (O)t +
t
x 3 (o)exp
Thus,
Cf
f f U3 (T)dTdCf
o
(f UI(T)dT)
o
xl(O) +
J Ul(T)dT
o
x 2 (0) + x 4 (0)t +
f f U (T)dTdCf-X
Cf
3 (0)exp
(I
Ul(T)dT)
Xl (0)
Y2(0)
x 2 (0)
:1 2 (0)
133
Xs
Xs
u1
x=
f(x)+gl(x)~1+g2(x)u2+g3(x)u3
with x E lR S , and
f(x)=
Xs
x4
gl (x)
x3 x S
g2(x)
g3(x)
Now
"o(x)=
,p{
0 n <f-(x)
OJ}
(0
-1
0)
O(x)
"O(x) + SP{
l(x)
01 n J(x)
Thus k
(1
(0
(0
-x S 1
1
-X 3
)J
(x)
1 and
and the system is said to have two (formal) zeros at infinity of order
2. Moreover
-* > d~m
. /).*
dim /).
134
A. ISIDORI
true for k
dim(QO n
rank(~)
2 m, because
~*
~,
we have
rk
(5.2b)
- r
(5.2c)
k
k
sk - sk-l + r k - l
< s
0-
k > 1
- ro
k > i > 1
The first comes directly from the definitions and the second from (PI).
The third one follows from (5.2a) and repeated use of the inequality
si+l - si
(si - si_l)-(r i - r i - l )
i > 1
which depends from the fact that the new rows of M'+ l are at most
~
(s i- s i-I ) - (r i- r i-I ) Summing all (5.2) together, for k = k , one obtains
135
i.e. (P2).
The proof of (P3) relies upon similar arguments. The proof of (P6)
may be found in [ 7] 0
From this, we see that a nonlinear system in which the assumption
(AI) is satisfied, from the point of view of its zero structure at the
infinity, behaves very much like a linear one. We even conjecture that
under this assumption not only (Pl),(P2),(P3) and (P6), as shown, but
also (P4),(P5) and Proposition (4.2) might hold. As a matter of fact,
in the next section we will illustrate a special situation in which the
latter is true.
6. EXACT LINEARIZATION VIA DYNAMIC STATE-FEEDBACK
We return now to the synthesis of dynamic state-feedback control laws
and we show that appropriate use of the assumption (AI) makes it possible to set up a new interesting design procedure.
To this end, we recall that (see e.g. [1]), if a system can be
decoupled via static state-feedback and if, in addition,
(6.1)
IJ.
*=0
Pi
L h.(x)
g. f ~
h. (x)
~
a(x)
_A-l (x)b(x)
(6.2b)
S(x)
A-I (x)
136
A. ISIDORI
the feedback u = a(x)+S(x)v with a(x) and S(x) defined as in (6.2) fully
linearizes (in suitable local coordinates around XC) the state space
equations. As a matter of fact, a standard computation shows that the
system
(6.3a)
f(x)+g(x)a(x)+g(x)S(x)v
(6.3b)
h(x)
z.
A.z. + b.v.
(6.4b)
y.
c.z.
1.
1.
1. 1.
1.
1.
1 < i < m
1. 1.
where
0
b.
A.
1.
c.1.
o
o
1.
[1
0 ... 0]
and
Pi
1.
1.
1.
2. Moreover, let
(AI)
i=l
(L
gi
(AZ)
t:,
137'
*=0
Then, the system has exactly Z (formal) zeros at the infinity, of order
111 .:::. 11 Z and
(6.6)
Moreover, there exists a feedback u = a(x)+S(x)w, with a and S defined
in a neighborhood of xo, such that
(6.7a)
x=
(6.7b)
y = hex)
(f+ga)(x) + (gS)(x)w
(6.8a)
where
(6.8b)
t;..
h
Li - 1
(f+ga) j1
(6.8c)
n1
Li - 1
h
(f+ga) jz
t1
(6.9a)
(6.9b)
t;.Z
138
A.ISIDORI
(6.9c)
and that
When k
= PI'
nk
np
PI
PI
k_
_l(x)+sp{dL f hl}+sp{dL f hZ, .. ,dLiflz}
and
r
r
PI
Pz
Pl-l
p2-l
Pz
0
1
Pz
~Z =
~l
= Pl+l
and one
Pz
(by (A2, then (6.6) is true. Under feedback u = a(x)+S(x)w with a(x)
and S(x) defined as in (6.5) and the diffeomorphism (6.8) the system
takes the normal form (6.4), which is a special case of (6.9).
If the matrix A(x) is singular, the proof is less trivial, and
relies upon an idea suggested by Krener (see [81, pp. 539-540).
139
'"f(x)
f(x) + g(x)a(x)
fu~
np2 +k - l
P 2+k
+ SP{dL}
P~
expressions of the
h 2}
= k*+l,
nk *
140
A.ISIDORI
of orders
~l
and
~Z
*n
=0
(which is true
are
(6.10b)
Sex)
(6.l1a)
zll -ll -1
(6.l1b)
141
f c;~)
(6.l2a)
(6.12b)
g6~);
he;)
ll/~
still with two inputs and outputs, which evolves on an open subset of
This new system is one in which the input-output decoupling problem is
solvable (i.e. the decoupling matrix, now denoted A(x), is nonsingular)
and the condition corresponding to (6.1) (now written ~* = 0) hold.
Thus the system in question can be turned into a linear system, via
state-space feedback and local diffeomorphism.
In order to check these claims, one may take advantage of the
special structure of the canonical form (6.9), which is part of (6.12).
Setting
= (';1'",,';
,zl""'z
,nl, ... ,n) we have.
-T -
f (x)
-T gl(x)
~l
(';2""'';
g2(x)
~2-~1
,0,n 2 ,
h.
(x)
';1
h.
(x) =
nl
J2
,zl""'z
~2
-T Jl
~l
~2-~1
L- L_ h. (x) =
gj f ~
~2-l,
all i,j
~
L- Lgl f
~
L- Lgl f
Thus
XC;)
-1
2-1 h
= 1,2,
L- L_2
g2 f
L- L_2
g2 f
-1
-1
~*
is
142
A.ISIDORI
(6;13) Theorem. Suppose ~ = m = 2, dim G = 2 and let (Al) and (A2) hold.
Design a dynamic state-feedback in the following way:
(i)
(ii)
and let w
].1 -1
L- L-
gl f
- h (x)
1
L_ hl (x)
f
].12_
h2 (x)
Lf
143
(7.1) Theorem. If a system is left-invertible, the scalar-input-scalaroutput noninteracting control problem is solvable. 0
We do not insist on the proof of this result, that may be found in
[9], but rather describe the constructive algorithm on which that proof
is based.
cedure.
(i)
From the given data (f,g,h), compute the decoupling matrix A(x).
(ii)
(iii) If the deco~plin~ matrix has a rank p < m, find a square and nonsingular matrix Sex) of analytic functions defined in a neighborhood of xc, such that
A(x)
A(x)S(x)
* *
1
A(x)
* *
*
*
*
* *
Let jl, ... ,jq denote the indexes of the columns of A(x) in which
at least two entries are not identically zero. Note that q > 0
because by construction all rows of A(x) are not identically zero.
144
(iv)
A. ISIDORI
S.(x)v.
this feedback with the original system (4.5) defines a new control
system with state
= (xl, ... ,xn,zl, . ,Zq) and input (vl, ,vm),
(v)
= f(}c)
h(x)
+ g(x)v
REFERENCES
[1)
A. Isidori, A.J. Krener, C. Gori Giorgi, S. Monaco, Nonlinear decoupling via feedback: a differential geometric approach, LEEE
Trans. on Automatic Control, AC-26 (1981), pp. 331-345.
----
[2)
[3]
C. Commau1t, J.M. Dion, Structure at infinity of linear mu1tivariab1e systems: a geometric approach, Decision and Control Conference (San Diego, 1981).
[4]
[5]
[6]
H. Nijmeijer, J. Schumacher, Zeros at infinity for affine nonlinear control systems, IEEE Trans. an Automatic Control, AC-30
(1985), to appear.
[7]
[8]
145
controllability distributions, SIAM Journal on Control and Optimization, Vol. 23, (1985), pp. 523-549.
[9]
M. D. Di Benedetto
Dipartimento di Informatica e Sistemistica
Universita di Roma "La Sapienza"
Via Eudossiana 18, 00184 Rome (Italy)
1. INTRODUCTION
Consider a nonlinear system of the form
x
f(x) +
(1.1)
g.(x)u.
i=l
hex)
with state x E X eRn, input u ERm and output y E R P ; f and gl, ... ,gm
are analytic vector fields on X and h is an analytic function.
This system will be referred to in the sequel as (f,g,h), where g
is the nXm-matrix whose columns are gl,g2, ... ,gm'
It is well-known that with the triplet (f,g,h) one may associate
the "invariant subdistribution algorithm", denoted ISA [ 1], (or "maximal controlled invariant distribution algorithm"), which defines a
sequence of codistributions as follows [2]:
dh
(1. 2)
where dh
+ iL Lgi (r2k- l n
cf)
rk
. (
'
d1m
"k"- )d1m(r2
k n
I)
k > 0
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 147-157.
1986 by D. Reidel Publishing Company.
148
M. D. DI BENEDETTO
~*
(1. 4)
[ (gS).
, ~]
C ~
l, ... ,m
distributions which are contained in (dh)l and satisfy ~1.4), but with
a matrix S which might be singular. A distribution ~ belonging to this
class is said to be locally controlled invariant with singular control. Clearly, we have
1
li (f,g,(dh) )
s
1
I(f,g,(dh) )
and algorithm (1.2) does not always converge to the maximal element of
I (f,g,(dh)l), if it exists. This class is relevant because some syn-
t~esis
f 0
= dh
fk
= f k- l
Lf(f k _ l n
~)
~k
149
for all k ~ O.
The class of systems for which (AI) is satisfied is indeed significant in that it includes a large share of the systems of interest.
Firstly, it contains systems which can be input-output decoupled via
static state-feedback. It also contains a more general class of systems,
those which can be made linear, via static state-feedback, from an
input-output point of view ([6] ,[7]). Moreover, if A* = 0, systems
which satisfy (AI) can be made diffeomorphic to linear systems via
dynamic compensation ([8]).
This paper has a twofold objective. First, to prove that, under
the assumption (AI) introduced in [8], the maximal elements of the two
classes I(f,g,(dh)l) and I (f,g,(dh)i) coincide. Second, to introduce
s
a new assumption (A2), weaker than (AI), under which the structure at
infinity of (1.1) may be computed by means of the simplified ISA. It is
reasonable to forsee that such assumption leads to a number of further
types of simplifications.
2. CONTROLLED INVARIANCE WITH SINGULAR CONTROL
In this section, we will show that, under assumption (AI), the maximal
elementsof the two classes of distributions I(f,g,(dh)l) and
Is(f,g,(dh)l) coincide.
Proof. By definition
Suppose
A~ax
(Qk)l. Let
~.
one-form in Qk n
L
W
giai k
Therefore,
(L
gi
Lg.a.~
~
A~ax
and
(since
~)a.~
(~,g.~
E Lg. (Qk n
~
~)
)da.
(L
gi
~)a.~
~)
150
M. D. DI BENEDETTO
and
LL
w..Erl
. 1 g.a. k
1.=
T E
Moreover,
1.
1.
~a
max
(by hypothesis)
c (rl )1 . Thus
k
i~l( Lgiai~
>
= ( L f +gawk
Lf + ga
>
T> -
(wk '
[f+ga, Tl >
(wk '
(2 .. 2)
T>
(since
(w k '
T>
0)
By definition of ~a
max
[f+ga ,
1 E ~amax
Therefore
(~
, [f+ga , Tl >
T >
Proof. Let
*=
max I
I
(f,g,(dh)-)
be an element of I
I
(f,g,(dh)-). Then
151
*
1
t. E Is(f,g,(dh) ).
I(f,g,(dh)l), we
k > 0
We will show in this section that (3.1) holds under a weaker assumption than (AI). Before stating this assumption, we need to introduce a recursive procedure which defines a sequence of codistributions of interest.
With each codistribution Q. defined by the algorithm (1.2) we
1
(3.3)
Q.1
and
in
Q?,
1
Le.
S.1o C G1 .
~.
Choose, in
~,
a direct summand
S? of
1
M. D. DI BENEDETTO
152
Step j
1, ... ,k*-i-l)
(j
Set
n.l.+J.
(3.4)
+ LfSJl.:- l
n~l. c
(3.5)
n.+.
l. J
~,
Choose, in
a direct summand
in
S~l. of n.+.
l. J
dl.
i.e.
(3.6)
and
s~l. c ~.
l.
codistributions S~ are not unique. Nevertheless, one can prove that the
l.
-j
l.
l.
with S., S.
C (j
S~l.
n.+. $
l.J
S~l.
+S l...
j-j
and S.
l.
I
(j
and differentiating
Therefore
den
..
l.
l.+J
d-
j < k -i-I. 0
along
153
defined.
We will now show that the assumption (A2) is feedback invariant.
.
tV
tV
.
More prec~sely, let us set f := f+ga and g := gS where a ~s an m-vector
of smooth functions and S an mXm invertible matrix of smooth functions.
Consider now the sequence of codistributions defined by the P~ocedure
3.2 where f and g in (3.3) and (3.4) are replaced by } and g. Let
n~ = ~~
~
mod.if-
Proof. Let us first note that, since S~~ C ~ for 0 .:. j < k *-i-l
S~~ c
(3.9)
n .. 1
~+J+
for
Therefore
Since
S~+l
c n..
+ LfS~ , (3.9) holds for j+l.
~
~+J+l
L
(3.10)
and differentiating
~)
~) = n.
?t? = n?
~
+ LtV(n. n
g ~
~)
154
M. D. DI BENEDETTO
s? = S?
1
I
By (3.10), L s. C L (Il. 1 n G').
Then
g 1
g 1+
?l~1
J)
for 1 < j
Il~1
S?1+1
that
inc~uction,
n~ C Il~
(3.11)
+ { Sj-k
k=l i+k
k *-i-I.
..
?t~ =Il . . 6l ~~
1
1+J
'k
I S~-k
k~l 1+
Il~ +
1
j.
ell . . + S~ +
1+J
ho1~s
for j. By
j
'k
+ I S~-k
k=O 1+
~.
S~ C
(3.12)
~j+1
. k
I
S~- C G'
k=O
1+k
which, by Lemma
choice of ~~ We have
1
':I.-j+1 _
~j
H
- Il. . +1 + L'\iS.
1
1+J
f 1
~j
':I.-j
Il. . 1 + LfS. + L .s.
1+J+
1
g 1
j
j-k j
j-k
Il. . 1+ I LfS. k + I L S. k
1+J+ k=O
1+ k=O g 1+
(by (3.12.
By (3.10),
j
j-k
I
\ L S. k C L (Il . . 1 n G') C Il. '+1
k~O g 1+
g 1+J+
1+J
and, by (3.4),
L S j-k C nj-k+1
f i+k
"i+k
ll i +j +1 6l Si+j+1
n
6l Sj-k+1
"i+j+1
i+k
155
~~+l
c~ 1+J+
.. 1
1
i S~-kk+l
+ Lf S J1: +
k=l 1+
S?1+J+. 1 c ~~+l
+jIls~-kk+l
1
k=l 1+
(by (3.4
'\
j-k
L Si+k
k=l
the (3.11) imply that
o
~. =
1
0, since
"'0
~ .
1
~~1
~~1
if
1
G . 0
Therefore,
.
~~ +
1
",.
c- = ~~1
(3.13) Remark. Lemma 3.8 allows to say that assumption (A2) is invariant for system (1.1) under static state-feedback transformations. 0
We can now prove that, for a system of the form (1.1) which satisfies (A2), the structure at infinity may be computed by means of the
simplified ISA (see 1.5.In fact,the following result may be stated.
= dim r
- dim(f
r1k
mod.
for k < k .
At step 1 of algorithm (1.2), one can write, using (3.6)
~l
I
0
~O+ Lf(~On~) + So
r 0 + Lf(r 0 n
fl + rl
fl
d-)
1
mod. G
S~
(since ~O = fO)
(with fl = SO C G1 )
156
M. D. DI BENEDEITO
r.1. + ri
Q.
1.
JI.=O
r Sf G1
j=O
mod. G1
r.
1.
Then
= Q.1. + Lf(Q.1. n
= Q.1. + Lf(r.1. n
Since Qm
s?1.
~)
ct)
+ L ri +
f
s?1.
S j+l
.
1.
for
= r i+l
i
r i +1
i-JI.
r r Si c ct.
JI.=O j=O
with
Qi +1
s?1.
= r i +1
Thus
mod.
ct
k , we obtain
ct)
+ J) - dim ct
r k = dim Qk =
dim(~
dim(~
. (r k + G1) - dl.m
. G1
= d1.m
= dim r k - dim(r k n
cf).
to
157
(3.16) Remark.
REFERENCES
[1]
[2]
[3]
H. Nijmeijer, J.M. Schumacher: Zeros at Infinity for Affine Nonlinear Control Systems. IEEE Trans. Automat. Contr., Vol. AC-30,
566-573, (1985).
[4]
[5]
[6]
[7]
[8]
Christopher I. Byrnes*
Dept. of Electri cal and
Computer Engineering
Dept. of Mathematics
Ari zona State Uni versi ty
Tempe, AZ 85287
USA
1.
Introduction
Despite (or perhaps because of) the extreme complexity of the most
general nonlinear control systems, there has been considerable recent
interest in the control of certain nonlinear systems by a relati vely
straightforward application of techniques analogous to those used in
linear control theory (e.g., [5J-[9J, [15J). Among the most elegant of
these recent methods is the theory of linearization via nonlinear
feedback developed by Hunt-Su-Meyer [7J (see also [2J, [8J, [17J) and
vastly extended in a spate of more recent contributions. Finding
application, for example, in the design of helicopter autopilots [7J and
the control of robot arms [18J, linearization techniques owe their
popularity to their advantage on the one hand of being conceptually
appealing and on the other hand of allowing for the application of
classical control intuition in a nonlinear context. It should be
stressed that the apparent ubiquity of applications is not due to the
genericity (even inIR 2 --see [1J, [3J) of linearizable systems, but
rather to the way nonlinear systems are actually designed. One can be
sure that, in practice, nonlinear control systems are not desi gned so
that certain distributions are involutive. Rather, systems are often
designed so that there is an independent control for essentially
everything in the system that moves. Not only does this principle
account for the applicability of linearization techniques (at least
locally), but also hints at a rather broad extension of these techniques'
retaining two important advantages: conceptual appeal and retention of
classical control intuition. Explicitly, it is perhaps reasonable to
assume that many systems of interest will have a stable inverse or, if
one prefers, that the system is "minimum phase."
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory; 159-179.
1986 by D. Reidel Publishing Company.
160
2.
161
In this section we lay down sane of the basic framework underlying our
frequency domai n theory of nonl inear systems. Expl ici tl y, we have two
goals in mind. First, we want to formulate a "frequency domain package"
for nonlinear systems, containing the analogues of frequency domain
notions such as system zeroes, minimum phase systems, relati ve degree,
etc., in terms of which we can formulate a design philosophy similar in
breadth to classical control theory. Second, in order to analyze the
closed-loop systems resulting from such designs, we need an analytic
theory similar in scope i f not in simplici ty to the well-known graphi cal
methods of classical control.
Our principal techni cal resul ts yi el d a state-space version of
root-locus theory and are based on asymptotic results which form a blend
of singular perturbation results and the method of Newton-Puiseux
series. That singular perturbations were related to the analysis of
high gain feedback systems (by setting e: = 11k) was pointed out in [21 J.
In this section we want to pursue this relationship further, so as to be
able to capture more of the root-locus design philosophy (e.g., lead-lag
compensation) than just the root-locus plot itself. In this way, we are
able to apply root-locus methods directly in the nonlinear case.
2.1
(2.1.1a)
hex)
(2.1.1b)
f(x) - kh(x)g(x)
(2.1.1)'
to (2.1.1).
x
y
-ky , k
ell-
(2.1.2)
Ax
cx
bu
(2.1.3a)
(2. , 3b)
162
c(sI-A)
-1
b = 0
Re(s) < 0
(2.1.4)
(2.1.5)
then the feedback strategy (2.1.2) for k> > 0 renders (2.1.1)'
asymptotical1y stable. In state-space form, the assertion is that the
matrix A - kbc is "stable" for k > > 0, provided (2.1.4) and (2.1.5)
hold.
Our goal is to construct an asymptotiC theory of systems (2.1.1)'
powerful enough to contain al1 the standard assertions, including the
assertion above, whi ch one deri ves for 1 inear systems via root-locus
methods. Of course, even in the linear case we want to derive these
assertions independent of direct appeal to root-locus techniques
(compare [21J). To this end, note that, by setting = 11k, (2.1.1)' is
a special case of a nonlinear, singularly perturbed system
~
= f(z,y) , f(O,O) = 0
(2.1 . 6a)
(2.1.6b)
Setting
g(z,y,O)
(2.1.7)
for roots
y
y(z)
(2.1.8)
=
y(zo)'
163
dy
dt
(
g Z 0' y, 0 ,
h
Tychonov's Theorem
(cf.
(2.1.10)
t/ , YO: Yo
[10J)
asserts that,
provided Z :
is
(2.1.11)
-y
-tIE Yo+O()
(2.1.12)
o
Figure 2.1.1
<
fixed.
164
E:
< 1/1Yol.
Note
that for linear systems (cf. section 2) E: < < .. can be chosen
independent of the initial data.
Our contention is that, suitably generalized, Tychonov's Theorem
and its corollari es in the linear case form the state-space version of
root-locus theory and therefore can be regarded as the nonlinear version
of root locus methods. In the next subsection we illustrate this in the
linear case, while in the following subsection we use this as a starting
pOint for the design and analysis of frequency domain methods for
nonlinear systems.
2.2
By
Linear Tychonov Theorem 2.2.1
in t and suppose
165
Proof of 2.2.1
Regarding A 21 z as the output of the stable system
(2.2.1a), we first estimate
1< y, A21 z > 1$ c 1 IIYl12
by the lemma in the appendix of [19J.
sol ution of Lyapunov's equation:
Ax
cx
bu
(2.2.4a)
(2.2.4b)
wi th transfer function
g(s) = c(sI-A)-1 b = p(s)/q(s)
(2.2.4)'
166
(2.2.5b)
of the closed-loop system.
see that
-cby
-cby
Yo = Yo
(2.2. 6a)
(2.2.6b)
Example 2.2.4 (r;:: 1) In order to stabilize minimum phase systems of
higher relative degree, a classical strategy (lead-lag compensation) is
to place the compensator zeroes in the left half-plane in order to
167
"attract" the infinite branches of the system poles and then to render
the compensator proper and stable by placing the same number of poles
sufficiently far off in the left half-plane. The overall stability
rests on the following inductive step:
Suppose the feedback law (2.1.2) stabilizes the system (2.2.4) for
some fixed k o . For E sufficiently small, the compensator
u(s)
~
yes)
1 +ES
(2.2.7)
stabilizes (2.2.4).
The closed-loop system (2.2.4) - (2.2.7) may be realized as
Ax + bu
-u - koY
EU
Setting E
(2.2.8a)
(2.2.8b)
0, we obtain
x=
(A - kobc)x
ko cx
168
where the
~j'
P.
q.lp.
the leading term in a fractional expansion for lis in terms of J J
This asymptotic expansion is the frequency domain analogue of the
asymptotic expression (2.2.6) for stable systems, and it is from this
point of view that one can see that stability of the boundary layer
system is also necessary for stability of the overall system.
2.3
f(x) + ug(x)
(2.3.1a)
hex)
(2.3.1b)
Lg hex)
and
169
Definition 2.3.2
Proof
'"
d(Lih)
i=O
and therefore
'"
(2.3.2)
By definition,
170
To say
In
(ii)
(iii)
span {g}
e;
U;
" ;
axn-1
0
span {a! }.
n
y = h(x )
n
In the light of the third equation, the second equation may be replaced
by
f 2 (z,y)
where, of course, f2
expressed as
ug 2 (z,y)
+
=
Lfh and g2
L h.
g
(2.3.4)
By direct analogy with the linear Tychonov Theory, one defines the
(n-1 )-th order system,
To give
171
* 0 , for all x
L h(x)
g
EO:
h- 1 (0)
f(x)
to lie on h- 1 (0).
.!.
O.
EO:
-1
(0)
(2.3.6)
where
a = Lf
Vco
h/Lif
For v co
v co-1
> 1 we define a as
h.
172
we will say that (2.3.1) has s left half-plane zeroes, u right halfplane zeroes, and c purely imaginary zeroes, in analogy wi th the linear
case. Note that
s + u + c = vf
Applications:
desired output function y(t), one finds a control function u(t) giving
rise to y(t) by first integrating (2.3.4a), obtaining z(t), and then
solving (2.3.4b) for u(t). We are interested in the construction of a
global inverse for systans having strong relative degree one, and also
in the stabili ty of a global inverse when the original systan is minimum
phase . Evidently, the local inverse systan constructed above is locally
stable whenever (2.3.1) is locally minimum phase. Our approach will be
to find a global version of the canonical form (2.3.4) fran which a
stable inverse systan can be constructed as above.
We also make the following technical assumption:
I f B = l/L h, then Bg is complete.
g
so that if (z,y)
xw.
(3.1.1b)
173
z
is globally stable on the leaf
Since L_h(x)
L h g
g
~
complete.
s:l~
)(f-~M
via
S(x,t)
<l>t(x)
Q.E.D.
174
(3.2.1 )
x 2 +u,.y=x
L h(x) ;:: e:
(H2)
a is
>0
complete
ucH<-n, there exists kU such that for all k ~ kU and all Xo e: U the
sol ution xt of the closed-loop system tends to O.
Proof By Theorem 3.1.1, there exists coordinates (z,y) in which the
closed-loop system (2.3.1) - (2.1.2) takes the form
where e:
11k.
Y L h(z,y) = 0
g
which imply, by hypothesis,
y = y(z)
175
Yo
Zt
O(l/k)
Yt
O(l/k)
lim Yt
t-+oo
A Global Adaptive
Relati ve Degree Two
Q.E.D.
In
this subsection we propose an adaptive stabilization scheme,
universal for the class of systems described above, composed of <3.3.1)
with k = kl
k2' together with the tuning law
176
(ii) lim
0; and
t-+co
(iii) lim
t-+co
Wo
y .. y
w .. ky - kw
where we have incorporated the zero polynanial in an (augmented) openloop system. Suppose, for fixed initial data, that k t -7+ co and
consider the corresponding time-varying system
Letting k(t)
k(t)y - k(t)w
-+ co,
w .. 0
y
177
Since
spec (A 11) = Zeroes (g (s))O Zeroes {s+ 1} ,
the dynamics is stable.
and therefore
lim
t-+co
kt = k co
k(t)y - k(t)w
we deduce wt e: L 2 (O,co).
that Zt "
L'.
[~a
[tl "
we alfJo deduce
and hence
L'.
178
lim
t~oo
as claimed.
QED.
References
[lJ W. M. Boothby, "Some Comments on Global Linearization of Nonlinear
Systems," Systems and Control Letters 4 (1984) 143-148.
[2J R. W. Brockett, "Feedback Invariants for Nonlinear Systems," Proc.
VII IFAC Cong., Helsinki (1978) 1115-1120.
[3J C. I. Byrnes, "Remarks on Nonl inear Planar Control Systems whi"ch
are Linearizable by Feedback," Systems and Control Letters (to
appear) .
[4J C. I. Byrnes and A. ISidori, "A Frequency Domain Philosophy for
Nonlinear Systems, with Application to Stabilization and to
Adapti ve Control," Proc. of 23rd IEEE Conf. on Dec. and Control,
Las Vegas, 1984, 1569-1573.
[5J R. Hirschorn, "Invertibility of Nonlinear Control Systems," SIAM J.
of Control and Opt. 17 (1979) 289-297.
[6J R. Hirschorn, "(A,B)-invariant Distributions and Disturbance
Decoupl ing of Nonl inear Systems," SIAM J. Control and Opt. 19
(1981) 1-19.
[7J L. R. Hunt, R. Su and G. Meyer, "Global Transformation of Nonlinear
Systems," IEEE Trans. Aut. Control 23 (1983) 24-31.
[8J B. Jakubczyk and W. Respondek, "On Linearization of Control
Systems," Bull. Acad. Polon. Sci. Ser. Sci. Math 28 (1980) 517-522.
[9J A. Isidori, A. J. Krener, C. Gori-Giorgi and S. Monaco, "Nonlinear
Decoupling via Feedback, a Differential Geometric Approach," IEEE
Trans. Aut . Control AC-21 (1981) 331-345.
[10J P. V. Kokotovic, "Applications of Singular Perturbation Techniques
to eontrol Problems," SIAM Review Nov. 1984.
[11J R. Marino, "Nonlinear Compensation by High Gain Feedback," to
appear.
179
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 181-226.
1986 by D. Reidel Publishing Company.
182
(0. I)
D.CLAUDE
q(t) = AO(q) +
yet)
i=1
u.(t)
1.
= h(q)
= ul(q)
1 = I, ... ,n
S~
(i,l
(Si)
1
~o(q) +
(0.2)
n.
i=1
l!.'
v.(t) ;41.(q)
1.
yet) = h(q)
where
~o
AO +
l.
1=1
u l Al
Fn (t) +
(0.3)
yet)
l.
i=1
I,oo.,n
u.(t) Gi
1.
= Hn(t)
183
the system and its behaviour as a black box. It is then merely a matter
of retaining only the linear part of a Taylor-development. It is important to note that the two methods don't necessarily give the same tangent linear system (cf. Csaki [19]).
The experience of automatic control engineers and the progress made
in the study of linear systems with deterministic or stochastic controls
have permitted approximate linearization techniques to have numerous
applications in the most varied fields of engineering. However,yet the
tangential aspect of the method may lead in particular to unacceptable
structural control incompatibilities such as the loss of controllability (cf. Bonnard [3]) for mechanical systems or offers solutions which
do not perform very well (cf. Gauthier, Bornard et al. [27] for the
study of distillatio~ columns). If one adds to that-the fact that modern technologies require fast actions and that it is necessary to take
the nonlinearities into account one can be quite certain that neither
the use of non autonomous linear systems (time-dependent coefficients)
nor the techniques of adaptative control can provide final solutions.
As the same time, if one goes so far as to use output-feedback nonlinear compensators before an approximate linearization (cf. Rugh [54],
it is possible, through more general transformations, to change a nonlinear system into a system with an adequate input-outpu~ linear behaviour.
b) Linearization of a nonlinear dynamics through diffeomorphism and
feedback
Linked to the state-space-approach and geometrical considerations, the
comparaison of the respectives dynamics of the two systems L and A is
.a natural question. Since Hermann [30] and Lobry [44] we know that the
controllability of a nonlinear system is expressed by means of the Lie
algebra generated by the vector fields AO, ,An . The Lie brackets of
the vector fields inevitably appear .even for RN-systems. The Lie bracket
of vector fields is invariant by diffeomorphisms. Thus one can, for exampl~ to follow a trajectory in state space, hope to change the expression
of the vector fields in a system of local coordinates in order to simplify the calculations. From an historical point of view it is interisting
to see how the problem has evolved since Stenberg [58] and implicitely
Poincare. [52], through the work of Hermann [31] as completed by Guillemin
and Stenberg [29] and the work of Sedwick and Elliott [56] via Krener [39]
who, from the point of view of nonlinear automatic control, tried to
transform locally the dynamics of L into the dynamics of a linear "SystemA
by changing the state coordinates. The problem was then well set up in the
setting of.automatic control and the use of more general transformations
to classify linear systems led Brockett [5] to ponder over the conditicns
for a nonlinear system L to have linear dynamics up to a transformation from
the group lJ of operators on systems, consisting of changes of coordinates
and the feedback laws of the kind (*). Necessary and sufficient conditions
for the existence of such linearizing transformations which are linked
to the Lie brackets of the vector fields AO, .. ,An have been given by
Jakubczyck and Respondek [38], and in a slightly more elaborate way, by
184
D.CLAUDE
Su, Hunt and Meyer [33, 34, 59]. It must be noted that these results
are being applied to aeronautics (cf. Meyer, Su and Hunt [47]) even
though they are of a local nature (cf. Boothby [4]) and not generic
(cf. Tchan [61, 62]).
c) Approximate linearization of nonlinear dynamics by state feedback
and coordinate change
This concerns an approach (cf. Reboulet and Champetier [53] and Krener
[40]) which includes both the idea of approximate linearization and
the idea of linearization of nonlinear dynamics by diffeomorphisms and
feedbacks. Its object is to look for the diffeomorphisms and feedbacks
which will make it possible for the system - in its new coordinates to have an approximate linearization around the operating points that
will be independent from those points (cf. [8, 53]) or an approximation
to a given order (cf. [40]) - with the possibility of having both aspects simultaneously. An industrial application of that method can be
found in an asynchronow current feed machine (cf. Mouyon, Reboulet and
Champetier [50]).
d) Immersion of a nonlinear system into a linear system
The linearization of the dynamics of a ~ system has the conceptual
drawback of ignoring the outputs and consequently also of ignoring
the input-output functional aspect of the system Thus Isidori and
Krener [35] look for a transformation from the group b which separates
the state of the system~ into two parts q = (qI,Q2) so that the yet)
output verifies y = h(q2) and the q2 state is the result of a linear
dynamics. Nevertheless, since the study of the decoupling of nonlinear
systems, it is known that the existence of singularities are obstacles
for such decompositions and a functional approach is preferable
(cf. [II, 13]). The problem is then a matter of finding type (*) regular feedbacks which make it possible for a nonlinear system ~ to have
the same input-output behaviour as a linear systemA. This notion of
similar input-output behaviour is formalized by the concept of immersion (cf. Fliess [22] and Fliess and Kupka [23] ) which has the fundamental advantage of leaving alone the original state manifold Q which
serves to describe the system ~ and thus it is no longer necessary to
use diffeomorphisms. The solution given by Claude, Fliess and Isidori
[16] in the continuous case and by Monaco and Normand-Cyrot [48] in
the discrete time case offers perspectives for fruitful practical applications (cf. Claude, D.R.E.T. - Contrat nO 81.492, Determination
de lois de commandes pour Ie pilotage des helicopteres ; Claude and
Bernard-Weil [14], Claude, Glumineau and Moog [17], Freund [261,Monaco
and Stornelli [49]) or other developments (cf. Levine [43], Marino [45]
and Marino and Nicosia [46]). Even though immersion under feedback
cannot be globally considered in general (cf. Rugh [54]). Moreover the
linearization of a dynamics by diffeomorphisms and feedbacks is a particular case of the technique of immersion into a linear system (cf [12]).
This result also highlights that there are complications involved attached if one wants to stay on the starting variety and only wants to
consider the linearization of the dynamics of a system.
185
e) Volterra linearization
In a Volterra series aPfoach an autonomous linear system is described
by two kernels : the w 0 kerI}el of order 0 which corresponds to the
drift of the system and the w~l) kernel of order I. In this case w(o)
1
is known to have
case for example
Ruberti [36]) to
the study of the
of a system defined by the kernels w(o) and w(~) only, where the
Volterra kernels
w(~)
1
f~rm
as a Volterra kernel
T :
Q I -+Q 2
(q) - have
Example 1.1,
q=
from statistical physics (cf. Fliess and Kupka [23]):aq - bqP + uI(t)q
(1.2) L2 f]Z(t)
y(t)
with
p ~
2 and QI = lR - {O}
=o
b(p-l)n l + (I-p) (a+u I (tn 2
n2
with QZ = JR
186
D, CLAUDE
so that the closed system L (O.Z) has the same input-output behaviour
1\
(1.3)
v;e
Jo
J o "" ,J v =
Jv
C C
,
s
.h Iq(O).c . c.
Jv
= 1, ,
(s
Each
J o.
r)
/\
,1\0
l\t1.
0
1
n
= tA
, ,A--} = {C , c , ... ,C }
the
(1.4)
f s In(O)=H s (I-Fa
In.
)- .(n(O)+ L G1 a.) (s = I, ,r)
i";l
the~-linear
:/\
form defined by
1\
= ~(q(O
- have the
to = {Ltohs I s
= 1, , r ; v ~ O},
(i) ~
187
is of finite dimension
&,
o
the LA'
A1
a) Let's suppose there is an immersion T of L into A. These systems respectively initialized at q(O) and n(O) = T(q(O -' have the same
generating series :
hlq(o).I+
I
v~o
n
j
, .. , j =0
J0
=H(l-Fa) -1 (n (0)+
o
i=1
Ga.)
1
Q=
t A
&,
o
&0 .
1\
&o :
1\
&.
0
(")
.
"
cond 1t10n
11 an e I ement G1,1"\
E ~ corresponds to ~J . i. 1W1th
every h 1S
tt A
A
s
associated an element H of the bidual
& of & , i.e. an element
0
0
t
tS
of Q. If we take H
(H , ... ,H ), the result is a (F, G1 , H) linear
l
= vi
(~=I,
&o
0 and
is considered
188
D.CLAUDE
41 = qZ+ZqZq3+ZQZuI+ZqZuZ
42
= Q3+ u I+ u Z
43
= Q4- Q3-qS+u I
qs
= u2
Y1
= QI-q2Z
YZ
= q3
Z 2
(l.S)
hl(q)
= h (q)
Let
(1. 6)
q = AO(q)
+ UIAl(q) + u 2 AZ(q)
where q = (ql,q2,q3,q4,qS)
with
(1. 7)
Take the u l
~-vector
= vI
and u 2
= v2
&0
to be the
189
A1 .h I .. 0
(I.8)
2
A .h I
1
A .LAoh I
1
A .h 2 = 1
1
A .LAo h 2 = 0
1 2
A .LAo h 2 = 0
.0
2
A .LAoh I
2
A .h 2 .. 0
A2 .LAo h 2 = 0
2 2
A .LAo h 2
and conditions (i) and (ii) of proposition 1.1 are thus verified.
The construction given in the demonstration of proposition 1.1 is
applied. Let
/\
(1. 9)
Fn + u l G + u 2 G
Hn
where n
(nl,n2,n3,n4,nS)
T :
T (q)(~) = ~(q)
Q ~~, defined by
for
/\
eo
Thus we have :
(1.12)
222
T: (qI,q2,q3,q4,qS) ~ (QI- 2Q 2;Q2;Q3;Q4-Q3-QS;QS)
(1. 13)
- 2Q 2
1
- 2Q S
1
- 2Q 3
0
~~ ~
is defined by
190
D.CLAUDE
2
LAoh2' LAoh2 - by the matrix.
(1.14) F
"
[~
0
0
0
0
0
0
0
0
0
0
0
~l
et
et
From which
(1.16)
o
o
nl
n2
n2
n3 + u l + u 2
n4 + u l
n3
(1.17)
n4
YI
nS
u2
nl
Y2
n3
nS
into the
191
The existence result of proposition 1.1 does not directly give the
A
feedback la~vs which guarantee the i!"!"ersability of the syste!" '- into
the linear system A. Nevertheless if the methods that have appeared
in the study of decoupling (cf. [II, 13] ) are used they also offer an
explicit way of calculating interesting feedback laws.
Let us recall that the characteristic numbers
are the integers defining by :
(I.18
VI, 1 = 1, , n
1 ~s
nd 31 A .LAohs f. 0
Vv, 0 $. v <
(s
1, , r) of r
~s
1 ~
(A .LAoh s )
(al ) ;
192
D. CLAUDE
= (8 1) ;
(-L~Ih
AO s
s
T :
Q ~~ defined by :
~I
\l
= r +
m
Ds
(1.21)
where
.~s
Ds
s=1
m+1
(s = 1 , r ; O~m<~) ;
Ds
s
n
f O + L v. (t) fi (s
I r)
1
s
s
i=1
Ys
nO
fO
s
I
k=1
IPk
r ck
j=O s.j ~
if gO
s
k=1
~k
I cks.j
j=O
LAO\.
fi = gsi (s = I , r ; i = I, . n)
s
For the case that instead of feedbacks one uses output injection.
cf. Krener and Isidori [4I] for linearization by diffeomosphism
results.
and
193
+ c 3 (X+'Y-m)
+ ul
(1.22)
+ c 3 (X+'Y'-m)
+ U2
=uI
y = u2
Y1= X-'Y ; Y2 = X+'Y-m
where
k~, k~,
~
1,2,3).
We have Q =E 4 and
j
i=1
c. (:r:+'Y-m) ~] '\:
~
Ow
3.
.
+ [ 'i' k! (X-'Y) ~ + c! (X+'Y-m) ~]
(1.23)
i=1
..2..
(l'Y
(1. 24)
Thus, n
I
2
I 0
-I ;
A .h l
A .LAoh l = A .h I <= I
2
I
A .h2 =
A .h 2
3
[(k.-k!) hi + (c.-c!) hi]
LAOhl
I
2
~
~
I
~
~
i=1
3
hi + (c.+c!) hi]
LAOh2 = I [(k.+k!)
~
~
I
2
~
~
i=l
=[~ -~]
be solved :
D. CLAUDE
194
n. [::] =
(1.25)
n.
(1.26)
ul = a l +
described by :
(1.27)
L 8~
i=1
vi
1 = 1,2
122
I I
I
a 1 = - (AI +A 2 )h 1 + '2 (A I +A 2)h 2
2
- L
I
I I
122
a 2 = - (A 2-A I )h 1 + '2 (A 2-A I )h 2
2
- i=lL
i=1
3
I
I 2
8 1I = '2 (gl+gl)
122
82I = '2 (gl+g2)
I
I I
-g )
821 = -2(g
21
I
2 2
822 ='2 (g2 -gl)
4
i
[k i (h1)i + c i (h 2 ) ]
[ki, (hi) i + Ci,(h 2 )i]
given by
(X-'!:l, X+'!:l-m)
n2
Al n + A2 n + glI vI + gl2 v
2
I 2
I l
Al n + A2 n + g2I vI + g22 v
2
2 2
2 1
YI
n1
Y2
n2
nl
(1.28)
195
This means that for short periods of time and small inputs the system
(1.22) - initialized in (X(O),'Y(O),X(O),Y(O - has the same inputoutput behaviour as the linear system (1.28) initialized in
(X(O)-'Y(O), X (O)+'Y(O)-m)
Example 1.4 : The (1.5) system is reconsidered.
According to the relations (1.8) we have ~1 = 1 and
~2
o where
The solution of the system (1.19) gives a type (*) feedback defined by:
(1.29)
13 11
gO - LAoh2
2
2
gl
; 13 1
gO - h2 _ gO + LAoh2
1
2
C!2
2
= g2
13 21
I
2
gl - gl
2
2
gl - g2
13 2
2
nl
n2
(1. 30)
where the
Tl2
n3
f O + fl VI + f2 v 2
1
1
1
fl
+ f2 v 2
fO +
2 VI
2
2
YI
Tll
Y2
Tl3
(1.19) are the data which make it possible to find the linearizing
feedbacks. Thus, for the systeni (1.5) we must solve:
196
D.CLAUDE
(1.31)
If one requires
(1.32)
= 0
1 ,
(1.33)
The necessary conditions (1.33) are here sufficient as has been shown
in a direct study of the system (1.5).
II - LINEARIZATION OF A NONLINEAR DYNAMICS BY DIFFEOMORPHISM
AND FEEDBACK
Jakubczyk and Respondek [38] as well as Hunt. Su and Meyer ["34] look
for a local diffeomophism T : lRN ..... lRN which - by means of a type (*)
feedback - transforms the dynamics D of a L type system into the dynamics - given by a Brunovsky canonical f.orm [6] - of a controllable
linear system i\.
This means that. if K1 K denote the second invariants of the pair
(F. G) (Popov - Kronecker'sninvariants). for the system i\ written as :
(2.1)
n = Fn + Gu avec u = (u 1 un ) t
has the following block decomposition (cf. [6] and [20])
G = (G )
s
Jj
with
F'
[~
0
x
IJ
(2.2)
G
Fjs
[0
1 x
[x
197
f. s
K xK.
s J
K xK
s s
where s, j
I, ... ,n.
Ks xn
where the x's indicate entries which may be any real number.
Similarly the state n is decomposed as :
(2.3) {
(nl, ,nn)
with
oK-I
(n , . ,n s ) , s
n
ns
I, ... ,n.
If the data in (2.1), (2.2) and (2.3) are then taken together with
y = nO (s = 1, ,r), we obtain a linear system IT of type (1.21). The
l~nk w~th c.i.f. linearization is now clearly established and the proposition (1.3) makes it possible to state:
Proposition 2.1 (cL [12]) :
=n
s=1
lP
wi th dim Q
=N
EN -+ EN
defined by
.
(1)1
(l)n.
198
D.CLAUDE
~AO+BIB
.it = AO +$
m O=$
~,:F ]
E $(
and by induction
j-l
m j =
< [.it.m
D E j)
[D.F]
],
m j-1 >
E :F
(iii)
199
Example 2.1 :
We take the dynamics studied by Krener [39], with time dependent coefficients, in its usual form :
qo
(2.4)
=u
ql
uqo
q2
We have :
Cl
= --Clql
+ q
Cl
--0 Clq2
h(~,ql,q2)
=0
A .h l
and
A .LAOh ~ 0
(CIa ql ,q2)
:]R
-+ lR
Cla q I- q2
defined by
(2.5)
.0
n
.1
n
n2
= u
Moreover these dynamics has the same behaviour as the dynamics of the
following linear system
(2.6)
f.t2
=u
lR
-+]R ,
defined by :
D. CLAUDE
200
dQ2
whence
m2
m 1]
m1
= mI.
K -I
on
n}
An
,ad(A
)A , ,ad n o
(A )A
generates a distribution of dimension N with N =
(ii) For j
CJ
i=l
Ki Here
K.-2
,An,ad(AO)An, ,ad J (Ao)An }
are involutive (Le. the < C.>
J
(iii) For j
= l n.
C.
is equal to the
201
Conditions (i), (ii) and (iii) of proposition 2.3 imply that the sets
e k.
I
0
I
Kj-k 0 I 2
0
2
K.-k 0 2
{A ,ad(A )A , . ,ad
(A)A,A ,ad(A )A , .. ,ad J (A )A
K.-k
,An,ad(AO)An, ad J (Ao)An }
1, . , n)
sinq2
sinq3
3
43 = q4 + u l
3 10
qs + q4- q l
44
42
(2.7)
4s= u 2
We have Q =~s and
A1
a
=_.
aq 3
verify<.pl~2+2
1. We know that
D.CLAUDE
202
= __d_
dq3
E-linearly independent.
2. Because [AO,A I ]
=1
and A2
d
dqS
are
cosq3 __
d_ and [AO A2]
dq
,
1
= _ __
d_
dq
4
are :JR.-linearly
3. Since
ad 2 (A o )A I = cosq2 cosq
the vector fields AI, ad(AO)A I ,ad 2 (AO)A I ,A2, ad(AO)A 2 ~re :JR.-linearly
independent if q2 "
(f) 1 =
2 and
(f) 2 =
1.
A h2
==
0 ; A h2
==
0 ; A LAo h2 0 or A LAo h2 O.
We can take h 2 (q) = q4' but there are many more solutions.
We now calculate a function hI' We must have
and
203
. 2 d hI
dh l
S1n q ---- + sinq3 cosq2
2 dq l
dql
We can take hI (q) = ql' but that is not the only possible solution.
With the choice of hI (q) = ql and h 2 (q) = q4 the matrix
=[
COSq~COSq3
~J
IT
+ kn}
in invertible on E =ES - {q2 = 2"
+ kIT, q3 = .!!.
2
-+
( QI,Q2,q3,q4,qS )
E5,
.
3 10)
( QI,s1nQ2,s1nQ3cosQ2,Q4,QS+q4-ql
I
111
I
111
111
.2
111
I
2
fO + flv
l + f lv2
I
112
.0
I
112
I
112
fO + f 2I v I + f 22v 2
2
YI
111
.0
(2.8)
with
f~
J
cst (i,j
= 1,2)
state 11. The v.(i = 1,2) denote new inputs that appear (by definition)
l.
when the feedback loop is closed :
u.(t)
1
a.(Q) +
1
j=l
sJ1'(Q)
v.(t)
J
204
D.CLAUDE
with
(2.9)
2 -
where
with
f~
g.
(i ,j = 1,2).
It we take fi
0 (i
1,2); gl2
g2 = 0 ; glI
g22 = I we find to
same results as Hunt, Su and Meyer [34], the application T being a
local diffeomorphism.
Moreover conditions (ii) and (iii) of proposition 2.3. can be verified.
We have :
and
ad(Ao)A 2 }
CI
{A I, ad(A o)A I,
We find : CI n C
CI
et
C2 n C
A2 ,
A2 ,
C2
{AI,A2}
ad(Ao)A 2 }
= C2
A2] =
and if q3 of n
2" + kn,
d!2 > c
205
r +
with
and
= Q ~m
(f)s
s=1
given by :
(f)1
(f)r
linearizable we have
ll~12'"
-1
-n
vJ
--I
-n
-1
0 -
12
, ,ad
m- 1
0 -I
m- 1 0 1
(A )A, ,ad
(A)A m}
-I
-n
= <A , ,A
0 -I
0 _12
ad(A)A , ,ad(A)A , ,
1 , , n.
: V
Q, defined by
D. CLAUDE
206
X.
and ~ 1 denotes
ni
the integral curve associated with the vector field X. such that
x.
1
~01 (q) = q (i = I, ,N).
(Z.IO)
qz
1/3
2/3
-I + qz u l + (ql + qz - 3 qz )
2/3 U
I + 3 qz
1/2
1/2
1/3
ljz
q = AO
+ u l Al +
Uz
Uz
>
1/2
1/2
- ( Zq lq3 + ZQZq3 )
0 and q3
>
Uz
O} ;
AZ
with
(Z. II)
We have
and,for all q E Q,
IZ
I
det (A ,A , ad(Ao)A )
=-
4/31/2
q3
6qZ
~ 0
207
= A2
We take CI
AI , C2
As [A I ,A2]
(l-q2 )A
-1/3
'~2
[A I , ad(A)A I ]
= ad(A)A I .
and C3
- _a_
-I
aql
- 3 (ql+q2)q2 A
X3
a ,
-a-
q3
We have XI E ~I' X2 E ~2 and X3 E ~3
Fix q
is a bijection defined by :
(2. 12)
ql
- n l - n2 + n3
q2
n2 + 1
q3
(n 1 + 1)
All the conditions of proposition 2.4 are thus verified and hence there
exist a diffeomorphism and a feedback which transform the (2.10) system into a linear system in the Brunovsky canonical form.
We now apply the immersion tech~iques to the system (2.10).
]j2
208
D.CLAUDE
: Q
-+]R3 defined by
1/2
III
.0
I
III
.I
III
VI
Il Z
,0
Vz
YI
III
(Z.13)
YZ = Il Z
The v. (i = I, Z) indicate the new inputs that appear via the feedback
~
transformation in
u.
with
al
L s~
j=1
a. +
q}+qz
=~
-~3
qz
I, Z
-2/3
-qz
a Z =-3-
(Z.14)
Sl
v.
=
6Z
}
ql+qz
---3qZ
Sl = 0
Z
e: ]R311l3> O}.
{(Il}, Il Z' 11 3)
209
The examples mentioned above clearly show the difficulties of considering only the dynamics. They also illustrate the fact that, in addition to the rather rigid constraints imposed by looking for a diffeomorphism, calculating the feedbacks laws necessitates first the solving of partial differential equations which is not strictly necessary
in the method of the immersion of a system with its dynamics and its
outputs.
For the study of more general systems the reader is referred to van
der Schaf t [63].
III - APPROXIMATE LINEARIZATION, AFTER FEEDBACK AND
COORDINATE CHANGE, OF NONLINEAR DYNAMICS.
Approximate linearization directly applied to a strongly nonlinear system gives very poor practical results and generally depends on the operating point that is considered. Two new alternative approaches can
be considered : the first one - pseudolinearization - whose aim is to
give a control structure that is independent from the operating points,
and a second one which aims at determining the transformations in automatic control that are commonly used in order to obtain a transformed system that approximates the behaviour of a linear system as accurately as possible.
1-
Pseudolinearization
Reboulet [53] cleverly suggests-within the framework of regulators looking for transformations from the group 13 (local diffeomorphisms
and feedbacks) so that the linearized version of the transformed nonlinear dynamics is independent of the operating points. That process called pseudo linearization - makes it possible to have a synthesis of
the control laws. Using it on an asynchronow current feed machine is an
example of an interesting industrial application (cf. Mouyon,
Champetier and Reboulet [50]).
Let us recall(cf. Choquet and al. [IO], von Westenholz [65]) that if
V indicates an affine connection on Q and
N
N
C1 = \ C1 (q) __d__
C2 = \ C2 () d
l..
k "aqk
l..
k q "qk.
k=1
k=l
a
are two vector fields on Q, then the following expression
(3.1)
C2
C1
L (L
k=l 1=1
C1l ,V l C2k ) d
dqk
dC 2
k
dq l
Vl C2 = - - +
m=l
rk
2
1,m C1
i, k
1 , ,N
D.CLAUDE
210
on lRn.
Since the dynamics of L are given by the vector fields
j
(3.3)
O,I,oo.,n,
an operating point
is defined by :
(3.4)
i=1
----
The state S =
].I. (t) = u. (t)
1.
1.
~l
L\'
(a k +
i=1
u.a 1.)
k
1.
qa~
(q,u 0) = - \ q
Clql
L u .
i=1
1.
(ii) For i
211
m1.
K.-2
An u
K.-2
,\11.
An
c>1
u:F
nC('\)')
L
i=1
for i
u.A
1.
I, ... ,n and k
= I,Z, ..
The bar l:Findicates restriction to :F and C('\)')denotes the set of vector fields on Q that generates T'\)', the tangent space to'\)'
Remark 3. I : In fact Champetier and al. [8] consider more general systems that are of class C~ and have a q f(q,u) type of dynamics.
mi
(3.6)
ql
ul
q2
Uz
ql + q3 u 3
Z
ql - Zq Z + q Zu 1
q4
. g1.ven
.
by
The set of operating points :FclR4 x lR 3 1.S
q3
u {q~
qz
q3
u 0l
u2
:Fl U ~
and
'\)'=
o Z 0
{qOlqO E lR4 and (ql)
-Zqz = O}
O}
212
(3.7)
D.CLAUDE
tI
111
~2
112
~3 = E; 1
~4
+ u 3 E; 3 + q3 113
0
Thus
(3.8)
FE; + I1IG
2
3
+ 11 2 G + 113 G
with E; E
:m. 4
with
We have
:m. 4
(3.10)
o FG 2
= G1 G G2
G G3
= G1 G G2
G FG I G FG 2
if q30
and
:m.4
" 0
if q30 = 0
and there result the Popov - Kronecker invariants (cf. Descusse [20]).
KI
(3. 11)
=2
and
=1
if
q~
" 0
o
if q3 = 0
3
The system (3.7) is always controllable since r K. = 4 but its
i=1 l.
structure varies depending on whether q~ is different from zero or not.
, K2
=2
,. K3
=0
3.1
to the two
213
Thus
e (91)
Cl = _0_ + ql
Clql
(3.14)
We have
m=
I 3
<B ,A >
aq
2
C2
'" Clq3
C3 = _0_
Clq4
with
BI = -Cl- + q -Cl- +
q2
Clql
I Clq2
Cl
and [B I ,A3 ]
aq
4
nl
n2
(3. I 7)
.
n3
n2
vI
=
v2
n4 = v3
It is amusing to note that the methods of paragraph II can be applied
to that example.
The system (3.6) is first considered around the operating points of
:FI
(0
0
0 0
u3
{q
,u 0) ql+q3
uJ '" cst} .
=0
; q30 .L
r 0;
ql0) 2 - 2q 20
=0
; u 0l
0, u 02 '" 0
214
D. CLAUDE
We find
(3.18)
41
)11
42
)12
with)1.
43
u.-u? (i
1.
1.
1,2,3),
1.
44
or
(3. 1 9)
u. (t)
1.
1 (q)
with q
where
(3. 20)
~1 + ~2 + ~3 +
4.
;
q3
~2
2
(ql
= 0
If we take hI = _1__ q4
h 3 = q3 , we have
h2 = ql
6
q2
2
3
I
A .h 1 = 0
1)
A .h I =-1._ q 5 Oon~
A .h 1 = 0
2
2
A0 .h I
2q2) ;
1
A .LAo h l
and ~ 1 ~ 1 on ~1
2)
1
A .h 2
3)
A1 .h 3
2
A .h 2
5
2
A .h 3
A3 .h 2
A3 .h 3
and
~2
q3 and
=0
~3 =
For the equation (3.18) the feedback (3.16) gives the following
feedback :
215
q = ~(q)
(3.22)
YI
Ql
= '6 -
i=1
vi(t)
~i(q)
where q
(QI,q2,q3,q4)
Q4
with
2
- 2q ) a
1
2 aq4
*2 = _a_ + ql -aa- +
*1 =1- -aq2
2 aq2
aQI
q2
10
(3.23)
(q
*3
aq
4
a
aq3
b= ml , b2 , b3) where
b1 = hlt - (qi -
(3.24)
b2
h 2 t + a 2
b 3 = h 3 1 + a3
Thus is we initialize at the operating points (q , u ) E 3} we have
0
btiqO
(3.25)
b2 iqo
b3 iqo
h1iQo.l + aoa l
h2iqo.l + a 2
h4iqo.1 + a 3
Those last three generating series are those of the linear system of
dynamics (3.17) and outputs Yl = n 1 ; Y2 = n2
Y3 = n4
2 - Approximate linearization by feedback and coordinate change.
According to Krener [40] one is concerned with the matter of finding
a local diffeomosphism T and a type (*) feedback so that the dynamics
of the transformed system can be approximated to a given order by some
216
D. CLAUDE
(3.26)
[" - T(q)
n(t)
where
l.
= q - q
and
\.1
= u - u
; with q and
T)
elements of~N.
k=l
k=l
1 < k
1, .. ,n> .
217
riants K1, ... K if and only if both the following conditions are satisfied :
n
(i) The ~k distributions admit as a local basis in a neighbourhood of
qO the family of vector fields
...
In fact Krener shows that there exist functions hl, . hn such that the
diffeomorphism T is given by :
T(q) =
(3.27)
(hl(q),,L:~-lhl(q),,hn(q),,L:~-lhn(q))
K.-I
K "'I
o
~
0
n
, ,nn,,nn
= (nl,,n l
)
(3.28)
I
i=l
S'" I , ,
o~
m < K.-I
~
i
0
0 p+1
g v. + o(q-q ,u-u )
s
with gO
o.
Thus we see that the diffeomorphism T is given by means of the same set
of functions as the one that defines the immersion (1.20).
IV - "VOLTERRA" LINEARIZATION.
We know that the input-output behaviour of a (0.1) type systemL admits
a Volterra functional expansion (cf. Lesjak and Krener [42] ; Crouch
[18]) given by
y(t)
w(o)(t,q(o))+ nL
i"'l
(4.1)
.... +
nL
ft
0
It ITI.. IT p-Iw.( p )
.
. 1 0
~ I" ~p=
. (t, T 1 ' , T
~I""'~
q (0) )
218
D. CLAUDE
w~p)
~I"'~p
rating series of the system (cf. Fliess, M and F. Lamnabhi [24]). They
admit a series expansion of the following type
00
(4.2)
w(o)(t,q(o))
(4.3)
wi
(4.4)
(I)
I
k=o
k
k
t
LAoh!q(o) k!
00
(t,TI,q(O))
(p)
w.
Y.
k 2 , k1 =0
k
k2
I
LAO LAi LAO h!q(o).
. (t,T1, ,T ,q(O))
p
11""'~p
The idea of Isidori and Ruberti [36] is to look for type (*) feedback
A
laws such that the closed systemL (0.2) admits a Volterra expansion involving only the kernels ~(o) and ~~I) (i=l, . ,n). Moreover, as in an
~ A(l)
A(l).
(~=l,
,n) where
(4.5)
Ai.Lm h
~o s
= cst
ned this is also the case for the closed systemL and the bahaviour
related to that output is characterized by its 0 order kernel only.
Thus we shall only consider systems whose characteristic numbers are
all well defined.
219
if O"m
t.0 s LAOh s
t. i .Lmt.0h s =
(4.6)
t.
lP
L sh
*0
with i
if
i
gs
ete
~'~s
"m <IPs
1 , ,n and s = 1, , r
T1 (q)Tk (q)
To (q)T k_ 1 (q)
o
o
O. T (q)
o
k
]
k
with Tk(q) = [ A1 .LAOh(q),
,An .LAOh(q)
,
can be carried out by means of the Silverman algorithm [57, 64] often
used in the study of linear systems.
Proposition 4.2 (cf. Isidori and Ruberti [36]) :
A necessary and sufficient condition for a (0.1) type nonlinear system
:t to be "Volterra linearizable" is that - for all k ~ 0 - the Toeplitz
matrices ek have their~~rank equal to the~(ew) - rank, where~(ew)
is the quotient field of the eW(Q) ring of real analytic functions on Q.
220
D.CLAUDE
~ 1 (q)
(4.8)
with m < r
~=
~m(q)
n defined by
1 = 1, . , n, k
= I, .. ,m
(4.10)
with 6 and
as in (1.19).
(/)s
LAO h s or e 1se
This means that in equations (4.6) we have gs0 :: 0 1'f T s
(/)s+1
.
0
the function gs - LAO hs 1S a real linear combination of the functions
(/)k
LAohk is a function that appears in the matrix ~.
LAo ~k where ~k
\U
given above with the one considered in (1.21). The latter ensures that
theE-vectorial space
O
A(1)(.1= 1 n )
Moreover, t h e or d er 1 k erne 1 s wi
0f
t h e c 1ose d system ;
n = Fn
(4.11)
z(t) = Hn
i=I
GiV i
with
zEE r
r>(q)
(4.12)
Ii
221
= Fn
e(~)=
h(q) - Hn
One can then consider the tracking of the initial system!: thanks to the
relation y(t) = e(t) + z(t).
CONCLUSION
Our title is ambitious but we do not pretend to be exhaustive. We simply wanted to survey the numerous results concerning linearization of
nonlinear systems. Both as the level of concepts and the level of computations we have tried to indicate the respective and complementary
contributions of algebraic and geometric methods. When one considers
the application of the proposed methods to the study of linear systems
it is easy to convince oneself - compared with the typically linear methods - that much remains to be done. Nevertheless the variety of applications considered - in industrial as well as in medical fields-shows
indiscutably that the large amount of collective work that our scientific community has devoted to the subject cannot be neglected in
future technological studies.
BIBLIOGRAPHIE
[1] Richard E. BELLMAN (1920-1984), In Memoriam, IEEE Trans. Automat.
Contr., AC29, 1984, p. 961.
[2] E. BERNARD-WElL, Identification and control of the model for the
regulation of agonistic antagonistic couples. Medical applications
in the field of neurosurgery and cancerology, in Mathematical
Modeling in Immunology and Medicine, G.I. Marchuk, L.N. Belykh,eds,
North Holland, Amsterdam, 1983, p. 267-275.
[3] B. BONNARD, Controlabilite de systemes mecaniques sur les groupes
de Lie, SIAM J. Control Optimiz., 22, 1984, p. 711-722.
[4] W.M. BOOTHBY, Some comments on global linearization of nonlinear
systems, Syst.
Control Lett. , 4, 1984, p. 143-147.
[5] R.W. BROCKETT, Feedback invariants for nonlinear systems, Proc.
Vll th I F A C World Congr., Helsinki, 1978, p. 1115-1120.
[6] P. BRUNOVSKY, A classification of linear controllable systems,
Kybernetica cislo, 3, 1970, p. 173-188.
222
D. CLAUDE
C.I. BYRNES, A.J. KRENER, On the existence of globally (f,g)invariant distributions, in Differential Geometric Control Theory,
R.W. Brockett, R.S. Millman, M.J. Sussmann,eds, Birkaiiser, Boston,
1983, p. 209-225.
[8]
[9]
223
[20] J. DESCUSSE, .Les invariants fondamentaux d' une paire (A, B), leurs
liens avec certaines formes canoniques, in Outils et modeles mathematiques pour l'automatique, l'analyse de systemes et Ie traitement du signal, vol 1, I.D. Landau,ed., CNRS, Paris, 1981, p. 65-8~
[21] M. FLIESS, Fonctionnelles causales non lineaires et indeterminees
non commutatives, Bull. Soc. Math. France, 109, 1981, p. 3-40.
[22] M. FLIESS, Finite-dimensional observations-spaces for nonlinear
systems, in Feedback Control of Linear and Nonlinear Systems, D.
Hinrichsen and A. Isidori, eds., Lecture Notes in Control and Information Science 39, Springer-Verlag, Berlin, 1982, p. 73-77.
[23] M. FLIESS and I. KUPKA, A finiteness criterion for nonlinear inputoutput differential systems, SIAM J. Control Optimiz., 21, 1983,
p. 721-728.
[24] M. FLIESS, M. LAMNABHI and F. LAMNABHI-LAGARRIQUE, An algebraic approach to nonlinear functional expansions, IEEE Trans. Circuits
Syst., CAS 30, 1983, p. 554-570.
[25] C. FOULARD, S. GENTIL and J.P. SANDRAZ, Commande et regulation par
calculateur numerique, Eyrolles, Paris, 1977.
[26] E. FREUND, Fast nonlinear control with arbitrary pole-placement for
industrial robots and manipulators, Int. J. Robotics Research 1,
1, 1982, p. 65-78.
[27] .J.P. GAUTHIER, G. BORNARD, S. BACHA, M. IDIR, Rejet des perturbations pour un modele non lineaire de colonne a distiller, in Outils
et modeles mathematiques pour l'automatique, l'analyse de systemes
et Ie traitement du signal, vol. 3, I.D. Landau,ed., CNRS, Paris,
1983, p. 659-673.
[28] F. GEROMEL, J. LEVINE, P. WILLIS, A fast algorithm for systems decoupling using formal calculus, Lectures Notes in Control and Informations Sciences
63
,Springer-Verlag, Berlin, 1984, p. 378-390
[29] V.W. GUILLEMIN, S. STERNBERG, Remarks on a paper of Hermann, Trans.
Amer. Math. Soc., 130, 1968, p. 110-116.
[30] R. HERMANN, On the accessibility problem in control theory, in
International symposium on Nonlinear Differential Equations and
Nonlinear Mechanics, Academic Press, New York, 1963, p. 325-332.
[31] R. HERMANN, The formal linearization of a semisimple Lie algebra of
vector fields about a singular point, Trans. Amer. Math. Soc., 130,
1968, p. 105-109.
224
D.CLAUDE
[32] L.R. HUNT, M. LUKSIC and Renjeng SU, Recognizing linear systems,
Proc. 23rd CDC, Las Vegas, NV, 1984, p. 413-415.
[33] L.R. HUNT, Renjeng SU, G. MEYER, Global transformations of nonlinear systems, IEEE Trans. Automat. Contr., AC 28, 1983, p. 24-31.
[34] L.R. HUNT, Renjegn SU, G. MEYER, Design for multi-input nonlinear
sys.tems, in Differential Geometric Control Theory, R.W. Brockett,
R.S. Millman, H.J. Sussmann,eds, Birkhauser, 1983, p. 268-298.
[35] A. ISIDORI and A. J. KRENER, On feedback equi lavence of nonlinear
systems, Syst.
Control Lett., 2,1982, p. 118-121.
[36] A. ISIDORI and A. RUBERTI, On the synthesis of linear input-output
responses for nonlinear systems, Syst.
Control Lett., 4, 1984,
p. 17-22.
[37] A. KASINSKI, J. LEVINE, A fast graph theoretic algorithm for the
feedback decoupling problem of nonlinear systems, 8th MNTS conference, Beersheva, 1983, p. 550- 562.
[38] B. JAKUBCZYK and W. RESPONDEK, On linearization of control systems,
Bull. Acad. Polon. Sci.(Math.), 28, 1980, p. 517-522.
[39] A.J. KRENER, On the equivalence of control systems and the linearization of nonlinear systems, SIAM J. Control, 11, 1973, p. 670-676.
[40] A.J. KRENER, Approximate linearization by state feedback and coordinate change, Syst.
Control Lett., 5, 1984, p. 181-185.
[41] A.J. KRENERandA. ISIDORI, Linearization by output injection and
nonlinear observers, Syst.
Control Lett., 3, 1983, p. 47-52.
[42] C. LESJAK and A.J. KRENER, The existence and uniqueness of Volterra
series for nonlinear systems. IEEE Trans. Automat. Contr., AC23.
1978. 1090-1095.
[43] J. LEVINE, These d'Etat. Universite Paris IX Dauphine, nov. 1984.
[44] C. LOBRY. Controlabilite des systemes non lineaires, SIAM J.
Control, 8, 1970. p. 573-605.
[45] R. MARINO, Stabilization and feedback equivalence to linear coupled
oscillators, Int. J. Control., 39, 1984. p. 487-496.
[46] R. MARINO and S.NICOSIA, Linear-model-following control and feedback-equivalence to linear controllable systems. Int. J. Control.
39, 1984. p. 473-485.
225
[47] G. MEYER, Renjegn SU and L.R. HUNT, Application of nonlinear transformations to automatic flight control, Automatica, 20, 1984,
p. 103-107.
[48] S. MONACO and D. NORMAND-CYROT, The immersion under feedback of a
multidimensional discrete-time nonlinear system into a linear system, Int. J. Control, 38, 1983, p. 245-261.
[49] S. MONACO and S. STORNELLI, A nonlinea;r attitude control law for a
satelli te with flexible appendages, 24th CDC, Fort Lauderdale, dec. 85.
[50] P. MOUYON, C. CHAMPETIER and C. REBOULET, Application d'une nouvelle
methode de commande des systemes non 1ineaires - 1a pseudo-1inearisation - a un exemp1e industrie1, Analysis and Optimization of
Systems, Lectures Notes in Control and Information Sciences 63,
Springer-Verlag, Berlin, 1984, p. 327-338.
[51] H. NIJMEIJER, State-space equivalence of an affine nonlinear system with outputs to a minimal linear system, Int. J. Control, 39,
1984, p. 919-922.
[52] H. POINCARE, Oeuvres, Tome 1, Gauthier-Villarset Cie,eds, 1951,
premiere these, p. XCIX-CV.
[53] C. REBOULET, C. CHAMPETIER, A new method for linearizing nonlinear
systems : the pseudo1inearization, Int. J. Control, 40, 1984,
p. 631-638.
[54] W.J. RUGH, An input-output characterization for linearization by
feedback, Syst.
Control Lett., 4, 1984, p. 227-229.
[55] W.J. RUGH, Design of nonlinear compensators for nonlinear systems
by an extended linearization technique, Proc. 23 rd CDC , Las
Vegas, NV, 1984, p. 69-73.
[56] J. L. SEDWICK and D. L. ELLIOTT, Linearization of analytic vector
fields in the transitive case,J. Diff. Equ., 25, 1977, p. 377-390.
[57] L.M. SILVERMAN, Inversion of mu1tivariab1e linear systems, IEEE
Trans. Automatic., Contr., ACI4, 1969, p. 270-276.
[58] S. STENBERG, Local contractions and a theorem of Poincare, Amer.
J. Math., 79, 1957, p. 809-824.
[59] Renjeng SU, On the linear equivalents of nonlinear systems, Syst.
Control Lett., 2, 1982, p. 4S~52.
[60] H.J. SUSSMANN, Lie brackets,rea1 analyticity and geometric control,
in Differential Geometric Control Theory, R.W. Brockett, R.S.
Millman, H.J. Sussman,eds, Birkaiiser, Boston, 1983, p. 1-116.
226
D.CLAUDE
Hi
system to be feedback linearized and output block decoup1ed it is necessary and sufficient that the algorithm is executable. This algorith~ is
novel in the sense that there are no constraints on the number of inputs
and the number of outputs of a system.
1.
INTRODUCTION
M. Riess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control-Theory, 227-241.
1986 by D. Reidel Publishing Company.
T. J. TARN ET AL.
228
Hi
ker(dh j ), i=l, ... ,k, are linearly independent, Tarn, Bejczy, Isidori and
Chen [3], [4], [5] obtained necessary and sufficient conditions for a
given nonlinear system to be locally, around a given point, feedback
equivalent to a linear controllable system and simultaneously output
decoupled and applied it to the design of robot arm dynamic control.
In this paper we derive an algorithm for computing the nonlinear
feedback control law and the diffeomorphism transformation for a given
nonlinear system to be externally linearized and simultaneously output
decoupled. Such algorithm has been given in [3], [4], [5] for the case
where in each decoupled block the number of outputs is less than or
equal to the number of inputs. Here we extend the previous result of
[3), [4], [5] to the case in which there are no restrictions on the
number of inputs and outputs.
2.
f(x) + ~
h(x)
(1)
12k
iii
(x ,x , ... ,x ), each x -(xl' ... ,xn ),
i
iii
12k
(Yl'Y2'" "Yr) - (y ,y , ... ,y ), each Y -(Yl''Yr)'
~
i=l
ni;=n;
Li-l r.-r.
~
229
Aiz i + Biv i
i
Y
hi(zi).
i=l ... k.
(2)
Hi
tribution contained in Ki .
[3],
[4],
[5].
H)
Hi)
S
s
D~ is nonsingular. involutive and [gj.Di]CDi+G.
j-l.2 ... m;
1
D~+l
_ R,(l([f.D~] + ~
1
1
1
L
[
DS ] + G)
.
j-l gJ'" 1
(3)
To facilitate the presentation of our algorithm. we need the following two lemmas which are proved in [8] and [18].
Lemma 2 Let K be an involutive distribution.
codistributions inductively as
We define a sequence of
230
T. J. TARN ET AL.
~lrJQk*
If
We
(5)
If G,
E and GflE are nonsingular, then R is the largest controllability distribution contained in E.
Lemmas 2 and 3 are algorithms for computing the largest (f,g)invariant distributions and the largest controllability distributions
respectively.
Combining them, we can compute the largest controllability distribution contained in a given nonsingular involutive
distribution.
3.
AN ALGORITHM
+~k=T(U).
~i
= Sp (
8:~
Let
T:y~x
Jacobian matrix J.
~
Yl
~
Yl
_8_ in
8Yn
__8__ ) = J I = J.
8Yn
A:[ = Sp{
~
~
a~
Yj
and
x
A~
= Sp{
_a.
ax~
I j=l, ...
231
,n~).
This
ai C Sp{ ~I
aXj~ j=l, ... ,n.).
~
aYj
fact implies that J has the block diagonal form with nixn i blocks, and
T*(A:~)-A~.
Hence
It follows that T*
i
i
hence x depends only on Y . 0
Now we define a map Pi:Tx(U)~Tx(U) as follows.
Definition 2
Assume Al , ... ,Ak are simultaneously integrable distributions, Al +A 2+, ... ,+Ak=T(U), and f(x)cV(U).
For each point xcU there
exists a unique decomposition
Pi:Tx(U)~Tx(U)
pointwise as follows.
Pi(f(x
Proposition 5
tions,
f
i
= v (x), i=l, ... ,k.
Proof.
If we fix a set of parallel coordinates x, then it is
obvious that
i
. i
T
p.(f(x=(O, ... ,O,fl(x), ... ,f (x),O, ... ,0) .
~
ni
(6)
Thus, what remains to prove is the right hand side of (6) is a well
defined vector field, i.e. it is independent of the choice of the parallel coordinates.
To prove it, let y be another parallel coordinates and T:x~y be the
coordinate transformation on U. Then we must show that
J l J2
T + T +
(7)
T. J. TARN ET AL.
232
where
+ denotes
Therefore
T*(Pi(f
D
i
i
T
)
ni
Hi
.
ker(dhJ ).
i=l, ... ,k. Assume peM is a regular point, i.e. there is a neighborhood
of p, say U, such that all the nonsingularity requirements in Lemma 2
and Lemma 3 are satisfied. Let Ri be the largest controllability distribution contained in Ki . As in Theorem 1, assuming that Rl , ... ,Rk are
linearly independent, we have the following algorithm.
Algorithm 6
Step 1.
Check Rl , ...
,~
R.=T(U).
i=l ~
A
Step 3.
Step 4.
n.
Construct F.:V.cIR ~~M according to
~
... 0Fk(y
(i)k
(i)k
(i)i
(i)i
1' .. ,y
~)OFi(Y
1' ,y
ni)(p), i-l, ... ,k.
233
1
2
y(l)l
(2)2
(7)
Step 5.
Compute
Step 6.
Compute qi(f), qi(g ), i=l, ... ,k; j=l, ... ,k, and note that
-j
o
qi (gj) = { [
,j
~i J,
(8)
-i )1 J'
Set -i
g - qi(g
J'
z -z (p), jfi
.' _ [
G
O ] '
T. J. TARN ET AL.
234
.
(~i,-gi)
. d (2
i to l'1near1ze
F1n
Ti,a 2i , (32)
~
an d set
S tep 9 .
=[
Then there exists (3 1 such that
Set
]'
T = T2 Tl
a
(3
(31
o Til + a 2
0
Til) (32
Then system (1) can be locally feedback linearized and output block
decoupled around pM by (a,(3,T).
The following theorem assures the executability of the above algorithm.
Theorem 7 Under the assumptions of Theorem 1, the local feedback
linearization and output block decoupling problem is solvable, if and
only if, the conditions i) and ii) of Theorem 1 hold and the condition
iii) is replaced by the following condition iii)':
iii)'
235
Lemma 8 If Rl' ... '~ are linearly independent and system (1) can be
locally feedback linearized and output block decoupled around pM, then
it follows that
" i
Ri + R{'G,
are involutive, where Ri
Proof.
involutive .
.. , ,
1,
l:
R ..
jfi J
D~=Ri(j~
is involutive.
Thus one may
conclude that
Ri+R.n~
Hence
is involutive.
CJ
as a basis of GflR ..
~
Thus
Then we have
jfi.
gj9 - [
Im
l
0
].
+
+[
~
0
]
Thus it follows that
236
T. J. TARN ET AL.
Sp{ _8_
8x li
8x~
'~
ni
-i.
Since g
(9)
'J.'
of z j , JrL
-1
-k
Now {gl' ... ,~J and (g , ... ,g J are both the basis of G, so there
-1
-k-
(10)
237
mi
poa.
-i
-i
0
Now both qi(f)+qi(g )a i and qi(f)+qi(g )oa i are independent of zj,
-i
D
Lemma 10
Thus
Then
'
238
T. J. TARN ET AL.
Ri+R~fl~
X~l:Rin ~,
is involutive.
is
This fact
ni-m i
, s=l, ... ,m i .
n. om.
1. 1.
-i.
-i -i
g =g I .
.
s s zJ=zJ (p) ,Hi
,ffi i
The other
par~
is trivial.
is a basis of GflR i .
c:J
Let
-i
Then on some neighborhood of p, gs' s=l, ... ,m i is
m.
1.
-i
~i
Therefore
J.
J.
=Z
(p) ,jfi
239
CJ
Lemma 12 In step 7, Ii is independent of zj, jfi .
Proof. Recalling equation (10) in the proof of Lemma 9, we know
that if the system can be linearized and output block decoupled then
there exists an a?1 such that q.1 (f)+q.1 (Xi)a?1 is indepe,n dent of zj, jfi;
'0
-i
Thus
-i
-i
ni-m i components of qi(f) are independent of zj, jfi and remain the same
under feedback.
In Algorithm 6 step 4, if Xj ' i=l, ... ,k; j=l, ... ,n i are group
commutative, i.e.
[X;,X{l-O, s=l, . .. ,n i ; t=l, ... ,nj ; i;;ij ,
then we can define
T. J. TARN ET AL.
240
-1
2.
3.
4.
5.
6.
7.
8.
9.
241
An Introduction, Springer-
William M. Boothby*
x - f(x)
+ u(t) g(x)
X e M
(L2)
f.j = { g. adfg
. .. ,
adj-1
f gl is invo1utive.j
k
Here adfg - g. adfg - [f.g] and adfg - [f. adk-1
f g]; and (
1.2 ... n.
lp denotes
satisfies (L) and there is a ; e C~(M) such that d; is not zero at any
peM but annihilates An_1 everywhere: (f.n _1 .d;)-O.
This is equivalent to the requirement that ; be without local
extrema and be constant on every connected integral manifold of A l'
Although the conditions (L) are very strong. they by no means guRrantee
global integrability. However. global integrability is easily seen to
*This material is based upon work supported by the National Science
Foundation under Grants" ECS 8306789 and ECS 8518832.
243
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 243-256.
1986 by D. Reidel Publishing Company.
.
w. M. BOOTHBY
244
Proposition.
Yi = Li-l~
f ~,
.
~
f M into Rn
- 1 , ... ,n d e f~nes a COO mapp i ng y = F (x) 2-
F*(~J.)
(L
aYn_j+l
~y
).
<P
with
i-I
i-I
g, dYj> = <ad f
j-l
g, dL f <p>,
(_l)n-i<ad~-lg,d<p> when
n
<adn-l
f g, d<p> is the product of
the cross diagonal elements, and thus could vanish only if d<p = O.
Therefore this matrix is never singular and so, from (Ll) dYl' ,dY n
are everywhere linearly independent as claimed. If X
~j'
then X is a
. 1
~j
is spanned
hence the mapping F:M~Rn it determines is not unique. But given UcM a
union of leaves of ~n-l transverse to an open are, then any ~Coo(U) with
the same properties as <pIU satisifes ~ - AD<pIU where A is a COO function,
A:<p(U)~R,
zero.
1.4
245
n.
, a ij , b i constants. We shall
i+l and zero otherwise and b=(O, ... ,l)t, i.e. F*(f) = Y2
~ + ...
Yl
__
8_ This is a sort of weak linearizability: we
8Yn'
are not requ~r~ng that F be either one-to-one or onto. We shall say that
F is global linearization of (*) only if it is a diffeomorphism of M
onto Rn , this being what one would ordinarily want to mean by
linearization of (*).
2. STATE FEEDBACK
We continue to consider a system (*) satisfying (L) on M. Define the
feedback group
w on
M to be the set W -
P) -
l/~).
((Q,~)
(a + ~ a, ~
Q,~
P).
= (t,g) = (f + ag,
~g)
(*)
= f(x) + u g(x) to a new system x .= f(x) +
g(x) by feedback of
the form: u = Q(x) + V ~(x). Clearly this action defines an equivalence
relation on the systems (f,g) on M with (t,g) ~ (f,g). The following
facts are well known:
If (f,g) satisifies (L) so does every equivalent (t,g),
(FB1)
in fact fl. l ,
(FB2)
state from
Xo
x=
f + u g taking the
w. M.
246
BOOTHBY
(FB3)
If the sys!;em (f,g) is globally integrable, then so is
any equivalent system (f,g).
2.1
t:
Coo(M) a function
o.
Similarly,
LII/! - < I, dLII/! > - < f + ag, dLfl/! > = < f, dLfl/! > =
<adfg,dl/!>
L~I/!
since
i 1
i-I.
0 , etc. So that L - I/! - Lf I/!, ~ - 1,2, ... ,n.
It then follows immediately from the local theory [7), [13) that we
have:
2.2
t:
not vanish at any point. Then the system (r,g) in the equivalence class
of (f,g) given by the feedback law:
-<f , dLn-l.L>
f 'I'
<g,dLn-l
f I/!>
Ii-I
<g,dL f
I/!>
247
fields and define f=Y2Xl+Y3X2 + ... + ynXn - l and g=Xn Then F*(f) = Y2
+ ... + Yn
~ and F*(g)
Yn-l
integrable system with
a~l
aa
mining F.
2.4
connected open set and let F : M~Rn be the inclusion. M can be chosen in
many ways.
(*)
on M will linearize to a
regular CeQ maps F : M-+R2. The following procedure gives, in a sense, all
examples of locally linearizable systems on M=R 2 . Let D be an open
simply connected domain of the complex plane C , H : R2-+D a CeQ
diffeomorphism of R2 onto D and w(z) a complex analytic function on D
such that
displays R2 on R2 ,
identified with the complex w-plane. This idea was used in [1) and will
be further discussed below. As will be seen below it gives a rather
complete picture of the nature of systems (*) on R2 satisfying the local
linearizability conditions (L).
In summary, given a system (*) on M satisfying the local
linearizability conditions (L) there are two obstacles to global
linearization: (1) the distribution ~ -J may not have a global integral,
~, i.e. may fail to be globally integPaole, and (2) if it is globally
integrable, then in most cases the linearizing map F : M-+Rn is neither
1 : 1 n~r onto. These difficulties occur already in full force in the
case M-R ; and it js not too difficult to see that merely choosing
another integral ~ will not remedy (2). The best we are able to get in
W. M. BOOTHBY
248
M is a discrete
249
and Reeb [16] for a discussion of these questions). In the n=Z case it
was shown by Kamke that a codimension on foliation on RZ will have an
integral ~ defined on any relatively compact open subset. Haefliger (see
[15]) showed that an analytic codimension one foliation ~n an analytic
manifold M satisfying very light restrictions has a C integral ~
defined on any relatively compact open subset. It appears to the author
that this may be adequate for many applications.
Finally we mention that on R Z if g is a polynomial vector field,
then a theorem of Markus [11] guarantees the existence of an integral ~
of the corresponding curve family. It appears possible that this could
be proved for n>Z in the context of systems satisfying (L).
results. A C
. Z
family on R
foliation
of R
is a family of curves on R
~l' ~Z
on RZ are Cr - equivalent if
Cr _
~l
that:
'
~Z
is a
closed set,
i.e.
~ to within
cO
~
(topological) equivalence,
there corresponds a complex
W. M. BOOTHBY
250
I ~; I > 0
factors :
naG .
G and Ware
curve family
suppose the system (f, g) on R2 is such that the integral curves 'Of g
define the (oriented) curve family
that they have a COO integral u. Then u,v=Lfu give a map F into the
uv - plane. The family
which u must take the same value u o and hence all are mapped onto
251
and D' may cover part of B'. Of course on 3, which is not drawn,
there is no overlapping. Fl :
RZ~3
av
.V
M
Figure
Integral
curve.
of
It
mapped
by
onto
A2
and to give a
qualitative
be
W.M.BOOTHBY
252
by a complex analytic function w(z) -u+iv on D such that w goes onto the
curves u=constant, (f,g) goes onto (-v
a~
, a! ),
D~C.
The
globally
. 2
the set of reachable points is all of R . Then the system is
integrable
and
R2~R2 is a
o:
W~(a,b)
(a,b) such
U-
C over all cw
253
3.3 Lemma. If the system x-f+ug QllR2 satisfies the conditions of the
theorem. then on each integral curve C of the vector field g there is
exactly one point at which f and g are linearly dependent. The set of
all such points is a smoothly imbedded image T of R which intersects
every integral curve and intersects it transversely.
Proof. Let C be an integral curve of g, then C lies in a saturated
open set U on which a COO integral of the foliation is defined. Define
the COO function a on U by a-L ~. Alternatively a- <f+ug, d> = <f,d>
since <g,d> = O. Now a is monotone along each curve of w since
L a = L
g
<[g,f], d>
0 .
'
C is a single
point.
Now consider C,U as above, and p-T
U~R2
8
8
F*(f+ag) - Y2 -8- and F*(g) - F*(pg) - -8- Since a
Yl
Y2
<f ,d>
, is F
-1
({(Yl; O)})
. Hence T is a
U , the
smoothly
w. M. BOOTHBY
254
is
and C 2
'
12 =
0.
Both II and 12
' but this contradicts the fact there is only one such point on each
curve. Thus
imbedding
~(p(t-t
lemma and shows that (f,g) is globally integrable and more: each
integral curve of g has exactly one point on the curve pet)
T . It follows at once that the linearizing map F
(~(p),Lf~(P
the family
p~
, i.e. on
(Yl(P)'Y2(P
carries pet) onto the Yl-axis and the curve thru pet) of
onto an open interval (a(Yl),b(Yl
Remark. The converse of the theorem does not hold. To see this let
2 aYl
,g =
---aa
Y2
form on R2 and let UcR 2 be the open set defined by removing the ray
For y=f+ug restricted to U there is no solution curve
from
Xo -
(1,0) to xl
255
It is also
Xo
R2 and T>O ,
to xl in time T.
I-
Boothby
of Mathematics
University
Missouri 63130
References
[1]
[2]
[3]
[4]
[5]
256
W. M. BOOTHBY
(6)
(7)
(8)
(9)
W. Kaplan, Regular curve families filling the plane II, Duke Math
J., 8 (1941), 11-45.
[10]
[11]
[12]
[13)
[14]
~! +
(a(x,y)
~ =
Topolog~
8 (1969), 47-57.
116.
[15)
[16]
[17]
[18)
[19]
1. INTRODUCTION
In the last fifteen years a theory for nonlinear control
systems has been developed using differential geometric
methods. Many problems have been treated in this fashion
and interesting results have been obtained for nonlinear
equivalence, decomposition, controllability, observability,
optimality, synthesis of control (with desired properties:
decoupling or noninteracting), linearization and many others.
We refer the reader to Sussmann [28] for a survey and bibliography. We want to emphasize only that in most of the papers
devoted to nonlinear control systems (using geometric methods)
only a local viewpoint is presented. This is due to two kinds
of obstructions: singularities of the studied objects (functions, vector fields, distributions) and topological obstructions for the global existence of the sought solutions.
We want to present an approach to global aspects of some
of the above problems, namely linearization, controllability
(via equivalence to polynomial forms) and decomposition,
based on a certain global theorem. This theorem, which has
been introduced to control theory by Krener [17],[18] and
Sussman [29] (and has been given in a more abstract setting
by Nagano [23]), states that an analytic nonlinear control
system is determined uniquely by the Lie algebra generated
by the vector fields which correspond to constant controls,
provided that the state space is simply connected and the
vector fields are complete. The precise formulation will be
given later on. Therefore if we study the question of when a
nonlinear control system is equivalent to a particular form
(linear, decomposed, polynomial controllable) then the above
theorem says how the Lie algebra of the original system should
loook like. Based on this observation we obtain in Section 3
conditions for the global linearization under a pure feedback
[4] and/or change of coordinates in the state space. In
257
M. Fliess and M. Ha;zfwinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 257-284.
1986 by D. Reidel Publishing Company.
w. RESPONDEK
258
= f(~,u),
(2.1)
[h 1 ,h 2 ], hi E VW (M),
given by
ah 2
[h 1 ,h 2 ](O = ay(Oh1(~)
u E lit
Denote by
8h 1
- ay(Oh2(~)
V (M)
generated by all
vector fields of this form. A
typical element of L is a finite linear combination of
elements of the form [1, [fk-1,fk] . ], where fi(~)=
= f(~,ui)
E lit
~(t,~),
dt~(t,~)
h(~(t,~
~(O, ~)
259
it is complete.
A connected submanifold N of M is called an integral
submanifold of L if at every I; EO: N the tangent space to
N equals L(I;). In the analytic case we have
Theorem 2.1 (Hermann [10], Nagano [23]). There exists a
partition of M into maximal integral submanifolds of L.
The dimension of L(I;)
is constant on each submanifold an~
equals to the dimension of that submanifold.
The importance of the above result follows from the fact
that given I; EM then the integral submanifold of L through
I; carries all the trajectories of (2.1) passing through 1;.
Therefore in what follows we will assume that dim L(I;) = n
at every I; EO: M. This property is sometimes called the controllability Lie rank condition or (local) weak controllability (Hermann and Krener [11]).
Consider now two analytic control systems
i
.i i i
~ : I;
= f (I; ,u),
i = 1,2,
where I; i E Ni and in both systems we have same class Um
of admissible measurable controls with values in n. Let
Li denotes the Lie algebra of ~i, i = 1,2. Assume that
we are given initial states I; i E Mi , i = 1,2.
~1
is called to be
homom~rphic(1)
to
~2
if there
a(tp~(t,I;~
tp~(t,I;~),
~2
if
is a diffeomorphism and
~1
is locally
(1)
= 1,2,
~2
i.e.
260
W. RESPONDEK
o }.
We have the following
Theorem 2.2.
[29]). Let ~i, i = 1,2, be analytic, complete systems satisfying the controllability Lie rank condition. Then
(i)
~1
which carries
(ii)
(iii)
Moreover if
lates ~2.
L 1
So
M1
into
L 2.
So
~1
simu-
~2
isomorphically
F;~
L22
F;o
onto
(j): M1 -+ M2
feomorphism
A
L1
261
such that
(j)*.
Recall that
by the formula
(j)*
h 1 , ... ,h n E V (M)
[hi,h j ] = 0,
12.i,j~n
= TF;M.
Then M is diffeomorphic to lR n
feomorphism (j): M -+lRn such that
(j)*h i
where
Cl
ax,'
(x 1 ' . ,x n )
such that
1, .. ,n,
lRn.
Remark. The same result remains true under Coo-assumptions. In this case it does not follow from Theorem 2.3.
However one can observe that globally defined 1-parameter
groups (j)i(t,F;)
generated by hi'
i = 1, .. ,n define a
transitive action of n-dimensional abalian Lie group on M.
This and the fact that M is simply connected imply the
above result in the Coo-case. This argument was used by
Dayawansa et al [6].
Proof. Denote
L1 = {h 1 , ... ,hn I LA
M,
and
L2
M2
{Cl
ClX
lRn.
Take
Cl
262
W. RESPONDEK
m
L:
i=1
u.g.(I;),
1
I; EM,
(3.1)
M.
We
start with the problem of when (3.1) is isomorphic to a linear controllable system of the form
x=
j
adfg
vely
Ax +
L:
i=1
u.b.,
1
(3 .2)
f, g E VW (M)
we will denote
. j-1
~ [f,ad f
g].
and inducti-
(A2)
[ad~gi' ad~gk] = 0,
o .::. p
1;0'
O<j.::.n-1} = n,
for any
and
+ r .::. 2n - 1
i= 1
and (A2)
263
holds for
O 2 P + r 2 Ki + Kk - 1.
Coo-case.
In the analytic case (A2) can be replaced by the condition stating that any Lie bracket involving at least two
gis vanishes at ~o
(Respondek [26], Sussmann [32]).
We state now a global ~inearization result. Theorems
2.2 and 3.1 lead us to the following
Theorem 3.2. System (3.1) is isomorphic to a controllable
if, and only if, there exist (after a reordering the inm
>0,
L: K. = n,
such
puts) Kronecker indeces K 1 -> ->K mi=1 ].
that
(3.2)
(A 1)
dim span
at every
(A2)
(A3)
(A4)
{ad~g i (~): 1 2 i 2 m,
~
2 Ki - 1 }
M,
[adigi, ad~gk] = 0,
o -< p + r < K. + Kk - 1,
].
the vctor fields
complete.
0.::. j
for any
1 2 i, k<m
and
i = 1 , ... , mare
and
is simply connected.
(-1)jA j b ..
].
W. RESPONDEK
264
algebra
-n.
1
1<j<X1
= ax
We have
g. =
1
h ..
lJ
ij
a
aX i 1 '
1, ..
,m.
Compute
= [ [ f , -a-x-.-'----
l,p
f (E;o) = 0
], ax
a
k,r+1
imply that
J.
f .. ,
lJ
the com-
ponents of f
in the considered coordiantes are linear
homogeneous functions. Hence f
is linear vector field in
x.. coordinates. 0
lJ
Consider now
E;
= f ( E; )
+ ug ( E;),
E;
EM
(3 .3)
xE]Rn
x = x(E;)
(3.4)
(change of coordinates)
265
E~.
~o
W. RESPONDEK
266
as a pure feedback without allowing any change of coordinates in the input space (Dayawansa et al. [6] call it to be
restricted feedback). We want to emphasize that this class
of feedback has the possible application to G~rsanov measure
transformations [4],[22].
Theorem 3.4. (Brockett [4]). System (3.3) is locally
equivalent to a controllable (3.4) under x = x(~)
and
v = a(~) + u,
around ~o and 0, respectively, if, and
only if, i t satisfies around
{ad~g(~o)
(C 1 )
dim span
(C2 )
r-1
q
p
[adfg, adfg] cD
~o
O~j <
n -
1}
= n,
for any
O~q~p~r
and r = 1, ... ,n-1.
for any
0 ~ q ~ p ~ n - 1
Lh tp (0
3tp
= "IT
( ~ ) h ( ~)
n-1
L
.
~
y.adfg.
i=O ~
267
(1,0, ,0)
and
(3 .7)
Thus
(_1)n-1_d_
dX
mod on-2
(-1)
y n-1
and
mod o j-2 ,
hence
We have
(_1)j-1_d_
dX,
n-1
x n- 1.
n-1
j
L f Yn-1
(-1)
n ..... 1
Lf
Y n-1 .
We will denote by
j = 1, .. ,n.
x n- 10'
-J
0, ..
,n-2
(_1)n-1 ex . c
f,
i.e.
""
f = f - exg,
( ii)
( iii)
(02)
[g,ad2g]
" f
= 0,
for
= 1,3, .. ,2n-1.
are easily
268
W. RESPONDEK
algebraically verified. A standard condition which guarantees (03) is Ilf(OII,llg(OII <C(1+III;II)fora suitable constant C.
Proof. We show the implications (i) ~ (ii) ~ (iii) ~ (i) .
(i) ~ (ii).
If (3.3) is linearizable to a controllable
form under a pure feedback v = a + u then, as shown by
Brockett [4], (01) and (02) are satisfied. If (3.3) is
globaly pure feedback linearizable then g and modified
(iii).
g
[adfg, adfg]
cD
r-1
for any
0 ~
and any
r = 1 , .. ,n -1 .
(3.8)
r-1
1
J.
rE a .adfg
c 0
j=O J
for suitable
Compute
because
w
a j E C (M),
r-1
.
[ f, Ea. ad~g]
j=O J
r-1
j+1
r-1
.
E a.ad f g + E (Lfa.)ad~g.
j=O J
j=O
J,
a r _ 1 =: 0
g
p]
r-2
[ adfg
,adfg c D ,
and we get
Now observe that (3.8) and (01) form the set of original
conditions of [4] implying pure feedback linearization.
Therefore we can apply Lemma 3.5 and we get a, defined by
(3.6), such that s =
diffeomorphism. Hence
of Theorem 3.1).
""
f
= f(E;)
269
+ vg(S)
s =
f (S)
En.
+ ug ( S)
f (S)
s EM,
+ ug ( s) ,
(4.1)
where the state space M is an n-dimensional simply connected analytic manifold and f and g are analytic vector
fields on M. For such systems Zeitz [33] introduced a concept of a nonlinear canonical form, which is the following
representation
f 10 (x n )
0
x 1 + f 20 (x n )
+
x
x
n-1
+ f O(x )
n
n
x EEn.
(4 .2)
It can be seen that if (4.1) is isomorphic to its canonical form (4.2), then it is globally equivalent under
x = xes)
and v = a(s) + u to a linear controllable system
(3.4). Thus the class of (4.1) which admitt canonical form
is a proper subclass of globally linearizable systems and is
described by the following
Proposition 4.1. System (4.1) is isomorphic to (4.,2)
if, and only if, it satisfies
(E1)
(E2)
(E3 )
[ g , ad~g]
0,
j
the vector fields
[ad~g,ad~g] = 0,
0::.p+r::.2n-2.
270
W. RESPONDEK
j g=(-1)j--aad f
a x j +1
J. = 1 , ... ,n - 1 .
n-1
One easily sees that g,.~.,adf 9 satisfy (E1)-(E3)
and
since these conditions are invariant under diffeomorphisms
they are necessary.
As for the converse define
and
a
g = aX1
= 1, ... ,n.
where
Let
f
a
= -a-'
Xj
1, ...
,n,
f.-ai=1
~ xi
~
En.
We have
(-1)j[f,adt19] = (-1)j [f,(-n j
[ax.,f]
-\!.]
J
j = 1, ... ,n-1.
The
271
(4 .3)
f 1 (x 1 ' , xn )
k -1
i
E x 1 f 2 ,(x 2 , , x )
i==O
+ u
k
n-1
n-1
-1
xn- 1
where
k 1 , ... ,k n - 1
i=O
1 f 2i(x )
n-1
-1
Ex
'i=O
n-
1f,(x)
nl
(4.4)
= g
and inductively
hj
+1
(4.5)
w. RESPONDEK
272
k 1 , .. ,k n _ 1
(F2)
[h. , h .] = 0,
(F3)
(F4 )
hi
< i < n}
-
for any
(f;):
= n,
1.2 i,
at every
such that
a. ECw(M),
EM,
j.2 n,
there exist
f;
1, .. ,n,
[hi' f) = a i + 1 h i + 1
for any
i=1, .. ,n-1.
= g
= _a_
c =1
1
and
j=2, ... ,n
c. 1=(c.) J(k.)!,
J+
h 1 , ... ,h n
k .-1
[h.,f]=[c-a-,f]=c.(k.x. J
J
J Xj
J J J
Since
h. = c.-a- ,
J
J Xj
and
aX 1
= 1, .. ,n-1.
k.-1
+
i-1
a
ix. f. 1 . (x -::-a-i=O
J
J+,~
n
Xj+1
J
l:
'-a-'
J Xj
(4.6)
h 1 , . ,hn
M is diffeomorphic
(x 1 ' . ,x n ) on mn
273
c. 1
J+ dX j +1
h.J+ 1
Recall that
h.
J
k.-1
x.J +
J
_d_ f
and thus
._d_
J dX j
k.
(c .) J (k .) ! d
J
J
Xj+1
i=O
dX. j+1
J
x.f. 1 .(x
J J+
,1
k.
adh~ f.
J
(k .) ! .
Hence
).0
n
L
i= 1
y.h.
1
W. RESPONDEK
274
11.l.
h. I sand
l.
are equivalent.
Remark 2. One can observe that any nonlinear system satisfying the above Theorem with k 1 = ... =k n _ 1 =1
is globally
pure feedback linearizable. To see this use the following
(n-1)-steps transformation. In the
(j+1)-st step introduce
new coordinates
x n-]. 1 = f n-]. ,
where
n-]
is the
xl.' = xl."
i f n - j - 1,
(n-j)-th component of
after j-th
g=ax-'
1
f1 (x)=a(x)
and
x,
x = (x 1 ' ... ,x n )
on
JRn
we have
k k.-1
j
]
i
f. 1(x)-x. + E x.f. 1 .(x),
]+
]
i=O]] + ,l. n
_
j=1, . ,n-1,
where
f 1 , .. ,f n
denote components of
f.
275
where
c 1 , ... ,c n
Assume that
d_
h. = c. __
k JdX.
mod Dj - 1
are de(4.8)
d
h1 =c 1dX1
= ad j = ~
d
mod Dj .
h.
j+1dX. 1
h j +1
J
J+
We prove now that a can be expressed by (4.7) or, in
other words that (4.7) gives f 1 , the first component of
d
n-1
f. From (4.8) we have h n = c nox
-~-- mod D
and hence
tat ion
n-1 i
x
i=O n-
1f
. (x
n,1.
For
we have
n -
-1
Xn -
n-1
Yn
n-1
because of (4.6). Assume
k. 1-1
k 1-1
nh j + 1 . L f Lhn _ 1 Yn
Xj+1
c j +1
L J+
compute
k.-1
L
J
hj
x. 1
( ....l:!:...! )
c j +1
k.-1
L J
h.
J
(c.)
-c---,
n-1
Lh
k.
(----(x. J
c. 1
J+
k.-1
J
k.-1
+
c j +1
because of (4.6). Thus for j=1
k 1 -1
k n _ 1-1
x
Y = __
L~'1 .. LfL h
1 = x
!J
n-1
n
c1
1
x.f. 1 .(x )
J J + ,1. n
x.
x. ::;; -2,
J
cj
we get
(k.)!
i=O
276
W. RESPONDEK
and hence
L f x 1 =a,
'"
f = f - ag
takes in
(x 1 , .. ,x n )
coordif 1 :: O.
Therefore the vector
satisfy (F2)-(F4), because of Theorem 4.3.
h 1 , ,h n
f ( ~)
h 1 , . ,h n satisfy (F2)-(F4).
(see Remark 1) then the system
+ vg ( ~ )
(~)
+ ug (0
+ U
5. DECOHPOSITONS
In this section we discuss briefly how Theorem 2.2 helps us to
study the decorrposition problem for nonlinear systems. We consider c;eneral nonlinear system (2.1) and analyze the problem of when
it is simulated b~ or isomorphic to a decomposed system.
.
Assume now tnat we are given two nonlinear systems E~,
i = 1,2,
of the form
Let
= u
E2
1
(~
and
.
I;
n=
h ( I; , u) ,
g (I; , n , u) ,
I;EN
n EP
(5.1)
h(l;,u),
I;EN
g(n,u),
nEP
(5.2)
277
D(j)t'
the differential of
(j)t
maps
I (so)
into I (s 1). Let g E I. Then a standard result from
differential geometry (see [28]) implies that
I(s1)'
because
dim I(s )
1
<
-
dim I(s ).
0
diffeomorphisms generated by vector fields of L is transitive we conclude that locally dim I(so) = dim I(s1). The
set of s EM such that dim I(so) = dim I(s)
is both open
closed and this gives the first assertion of Lemma.
Fix t
and consider the diffeomorphism (j)t() ~ (j)(t,)
defined by h E L. Denote (j)t (so) = s 1. Let I (s ,L'I)
denotes
the integral manifold of L'I, the distribution spanned by I,
passing through s. To prove the second assertion of' Lemma
we show that (j)t mpas diffeomorphically I(so'L'I)
onto
I(s1'L'I). Since dim I(s) = const.
then L'I is of a constant
rank, say k. Observe that L'I is involutive thus in suitable local coordinates
(x 1 ,x 2 ) we have
d
sp{-,,-} ,
oX 2
278
W. RESPONDEK
where
x 2 = (x 21 , ... ,x 2k ). From the ideal property it follows that in the above coordinates
~t
into
I(~o,6)
If
L= 11 Ell 12
then there exists a parallel cascade (5.2) which simulates (2.1)
(ii)
at
Ii'
i=1,2.
(i) Let
6,
Ell
I2(~o)
holds
integral submanifolds
(~o)
I(~o,6i)
~o
EM
and define
I(~o,61)
(,u)
+ f
(,u).
E1,
is evolving on
h,
the
restriction of
evolving on P
tion of
cade on
f1
to N and the second one,
with the dynamics given by g,
f2 to
Nx P
n
P.
279
E2, is
the restric-
h(r,;,u),
l;
EN
g(n,u),
nEP
(5.3)
It is clear that the Lie algevra generated by (5.3) is complete, satisfies the Lie rank condition and is homomorphic
to L, the Lie algebra of (2.1). Observe now that the isotropy subalgebra of (5.2) at (r,;o,n o ) is mapped into Ll;o.
Thus from Theorem 2.2 (i) we conclude that (5.3) locally simulates (2.1). Now we take, if needed, the simply connected
cover of (5.3) and get a decomposed system which simulates
(2.1) (see Remark 1 following Theorem 2.2).
(ii). It is obvious from the way of constructing (5.3)
that N x P is n-dimensional, provided L (l;o) =I 1 (l;o) Ell I2 (l;o)
One easily sees that in this case the isotropy subalgebra of
(5.3) at (r,;,n)
is isomorphic to L~. Therefore if M
o 0
~o
is simply connected then the constructed parallel cascade
(5.3) is isomorphic to (2.1) due to Theorem 2.2 (iii).O
A similar result has been obtained by Krener [18]. However the decomposed system he constructed does not satisfy
the Lie rank condition and therefore we cannot conclude
whether (5.2) is isomorphic to or simulates (2.1). Fliess
[7],[8] ~tudied the problem of when there exists a decomposed
system which is homomorphic to the given one and solved it
using the idea of the formal Lie algebra corresponding to L.
Also from his paper the existence of isomorphism cannot be
deduced because the decomposed system he obtained lives on a
higher-dimensional manifold than the original one.
In the case of the cascade decomposition we have the
following
Theorem 5.3. Consider the system (2.1) and assume that
its Lie algebra L is complete and satisfies the Lie rank
condition
(i)
If L is the semidirect sum
L
K Ell I
W. RESPONDEK
280
( ii)
L(~o)
every I;;EN=~(~o,r)
we can restrict f(,u)
to I(~,~).
Denote this restriction by g(l;;,n,u), where for any fixed
I;; EN by n we mean points of I (I;; ,~). Lemma 5.1 implies
all the I(I;;,~)
are diffeomorphic to P. Thus we define
the following cascade
.
~
h(l;;,u)
g(l;;,n,u)
I;; EN
n E P.
(5.4)
281
Witold Respondek
Institute of Mathematics
Polish Academy of Sciences
ul. Sniadeckich 8,
00-950 Warsaw, Poland
w. RESPONDEK
282
REFERENCES
[ 1]
[2]
[3]
[4 ]
[5]
[6 ]
[7 ]
[8 ]
[9 ]
[10]
[ 1. 1 ]
[12 ]
[13 ]
[14 ]
[15 ]
[16 ]
D.Aeyels, 'Local and global controllability for nonlinear systems', Systems and Control Letters, ~(1984),
19-26.
W.M. Boothby, 'Some comments on global linearization
of nonlinear systems', Systems and Control Letters,
~(1984), 143-147.
W.M. Bothby, 'Global feedback linearizability of locally
linearizable systems', this volume.
R.Wt Brockett, 'Feedback invariants for nonlinear systems' in: Proc. IFAC Congress, Helsinki 1978.
P. Brunovsky, 'A classification for linear controllable
systems', Kybernetica, 2(1970), 173-188.
W. Dayawansa, W.M. Boothby and D.L. Elliot,'Global state
and feedback equivalence of nonlinear systems', Systems
and Control Letters, 2(1985), 229-234.
M. Fliess, 'Cascade decomposition of nonlinear systems,
foliations and ideals of transitive Lie algebras', to
appear in Systems and Control Letters.
M. FI.i:ess, 'Decomposition en cascades des systemes automatiques et feuilletages invariants', to appear in
Bull. Soc. Math. France.
J.W. Grizzle and S.I. Marcus, 'The structure of nonlinear control systems possessing symmetries', to appear
in IEEE Trans. Automat. Contr.
R. Hermann, 'On the accessibility problem in control
theory', Internat. Symp. on Nonlinear Differential
Equations and Nonlinear Mechanics, Academic Press, New
York, 1963, 325-332.
R. Hermann and A.J. Krener, 'Nonlinear controllability
and observability' ,IEEE Trans. Automat. Contr., 12
(1977), 728-740.
R. Hirschorn, 'Global controllability of nonlinear systerns', SIAM J. Contr. Opt., 11(1976), 700-711.
L.R. Hunt and R. Su, 'Linearequivalents of nonlinear
time varying systems', Int. Symp. Math. Theory of Networks and Systems, Santa Monica, 1981.
L.R. Hunt, R. Su and G. Meyer, 'Global transformations
of nonlinear systems', IEEE Trans.Automat.Contr., 28 (1983)
A. Isidori, A.J. Krener, C. Gori-Giorgi and S. MOllaco,
'Nonlinear decoupling via feedback: A differential
geometric approach', IEEE Trans. Automat. Contr., 26
(1981), 331-345.
B. Jakubczyk and W. Respondek, 'On linearization of
control systems', Bull. Acad. Polon. Sci, Ser. Sci.
Math., 28(1980), 517-522.
[ 17]
[ 18]
[ 19 ]
[20 ]
[21]
[22]
[23 ]
[ 24]
27]
28]
29]
30]
31 ]
283
284
W. RESPONDEK
Wilson J. Rugh
Department of Electrical Engineering and Computer Science
The Johns Hopkins University
Baltimore, Maryland 21218
1. INTRODUCTION
A typical approach to nonlinear systems problems is to linearize about a nominal constant operating (equilibrium) point. Then linear design methods are used,
yielding a system whose linearization has the desired characteristics. So long as the
resulting system is operated in a region sufficiently close to the nominal operating
poin t, the linearization should accurately reflect the behavior of the nonlinear system. Of course, simulation is used to verify the effectiveness of this procedure.
When a system must be operated over a region that is not sufficiently close to a
single operating point, this process can be repeated at anumber of different operating
poin ts. The resulting linear designs then can be pieced together by so-called gain
scheduling. This is usually an ad-hoc process by which parameters in the design are
varied according to sel~cted system variables in a way th at yields the correct linearized design when the system is operated near one of the operating points.
Recently there has been rapid development of a systematic, analytical reformulation, and extension, of this practical approach, called, variously, design by extended
linearization, or design by pseudolinearization. The given nonlinear system is assumed
to have a continuous family of constant operating points, and then is represented by
the corresponding continuous family of linearizations. (Compare with the gainscheduling approach, where a finite number of distinct operating points, and the
corresponding finite number of linearizations, are the basis for the design.) Applying
methodologies for lin'ear design to this family of linear systems yields a continuous
family of linear designs for a specified objective. Finally, and this perhaps is the
major innovation, a nonlinear design is computed so that when the overall nonlinear
system is linearized about any operating point in the family, the desigIied lineariz!Ir
tion is obtained.
In effect, the nonlinear design provides scheduling of its own linearization family, according to the operating point, such that the linearized system haS the desired
characteristics. Relying on the accuracy of linearization, so long as the nonlinear system is operated in a region sufficiently close to the 'surface' in the state/input space
formed by the family of operating points, the behavior of the system should meet
the design objectives. Again, simulation is an important tool for verification, just as
285
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 285-309.
1986 by D. Reidel Publishing Company.
286
W.l. RUGH
2. LINEARlZATION FAMILIES
The mathematical developments in this paper involve real functions, and real
vector fields, that are assumed throughout to be continuously differentiable a finite typically small - number of times. As has become customary, precise accounting of
differentiability hypotheses will be abandoned for simplicity. The following
differentiation notation will be used. If f(x):R n -+ RP, then af lax denotes the
p Xn Jacobian matrix whose (i,j)-entry is the partial derivative afdaxj' For n =1,
a will be replaced by d, and an overdot will be used if the variable is time t. If
f(x,y):R m XR n -+ RP, then af lax denotes the pXm matrix with (i,j)-entry
afdaxi' and af lay is the p Xn matrix with (i,j)-entry afdaYj' Evaluation of a
derivative is indicated in the customary fashion, for example, af lax(xo,Yo).
The nonlinear state equations to be considered can be written in the form
i(t) =f(x(t),u(t))
y(t) =h(x(t),u(t))
(2.1)
where f(,):R n XR-+Rn , h(,):Rn XR-+R, f(O,O) =0, and h(O,O) =0.
(Existence and uniqueness issues will be ignored, though the standard smoothness
287
assumptions imply at least a local solution.) Thus the system has a single input, single output, and has a (nominal) constant operating point corresponding to zero
input, zero state, and zero output. Of course, the location at zero is simply a matter
of convenience. The linearization of the nonlinear system (2.1) about the constant
operating point will be written in the form
af (O,O)xo(t)
ax
+ af (O,O)uo(t)
Yo(t) =~(O,O)xo(t)
ax
+ ~(O,O)uo(t)
au
xo(t)
au
(2.2)
The assumption that (2.2) represents a controllable and observable linear system,
natural from the viewpoint of many applications, will occur repeatedly. For technical
reasons, it will be assumed also that af /ax(O,O) is invertible, though in many problems this is obviated easily by applying a preliminary (output or state) feedback.
To develop the notion of extended linearization, it is fundamental to consider
constant operating points that correspond to nonzero constant inputs. Such an
operating point, written [E, X(E), y(E)], is defined by
f(x(E),) =0,
y() =h(x(),)
(2.3)
With the invertibility assumption above, (2.3) defines a local family of constant
operating points, corresponding to sufficiently small in absolute value, by the implicit function theorem. Of course, in many situations there will exist a constant
operating point family for all in a larger interval (nonlocal family)' or, less often,
for all real E (global family). The function x( ):R -+ R n can be difficult to compute,
but in many practical situations, the constant operating point family is reasonably
apparent.
Linearizing (2.1) about each operating point in the family [, x(), y()] gives a
family of linear systems
xo(t)
af (x(),)xo(t)
ax
ah
yo(t) =a;-(X(E),E)Xo(t)
=
+ ~(x(),)uo(t)
au
ah
+ "fu{x(),)uo(t)
(2.4)
where, for fixed , uo(t) =u(t) - E, xo(t) =x(t) - X(E), and yo(t) =y(t) - y(E) .
. The basic information about the nonlinear system that is used in the extended linearization approach is contained in this family of linear systems. It is clear that this
information is accurate only i{ the nonlinear system is operated in a region
sufficiently close to the surface of constant operating points in the state/input space.
In order to proceed, it is necessary to extend the hypotheses on the linearization
at zero to the family in (2.4). This can be done explicitly, though simplicity of exposition encourages a local view. That is, by continuity arguments, the controllability,
o bservability, and invertibility hypotheses on (2.2) extend to (2.4), at least for
sufficiently small E. This view will be adopted here, though it is not difficult to work
with nonlocal hypotheses..
2.1 Example: The standard notions of accessibility and observability in nonlinear
system theory do not suffice for the controllability and observability hypotheses on
288
W.J.RUGH
x(t)
Xl(t)+U(t)
[2X2(t) + u(t)
X3( t)
x(t) =
-E
, y(E) =0
i,(tl ~[~
yo(t)
~ ~lX,(tl + [H~tl
[0
1]
x o( t)
E,
This example can be used to illustrate another hypothesis that will arise whenever the extended linearization method is applied to problems where only the output
of the system (2.1) is available. In this situation it is crucial that the constantoperating-point output 'observe' the constant-operating-point input, a property not
guaranteed by observability of the linearization family. Technically, from a local
viewpoint, this is accomplished by requiring that dyjd(O) =i' so that the inverse
function y-l(y( E)) =E exists in a neighborhood of E =0. Clearly this requirement is
violated by the example above. However, in general, the controllability and invertibility hypotheses do give dxjdE(O) =i' since, by differentiation in (2.3),
''(0,0) dx (0)
ax
dt
aj (0,0) =0
au
Thus for some component of x(), say the first, the inverse function Xl1(Xl(E)) =E
exists, at least in a neighborhood of f =0.
2.2 Example: Consider the n-dimensional bilinear state equation
x(t) =Ax(t)
+ Dx(t)u(t) + bu(t)
y(t) =cx(t)
Assuming that A is invertible, there is a family of constant operating points, at least
in a neighborhood of E =0, given by
x() =-(A
+ Dtt1bE,
y(E) =- c(A
+ DEtlbE
289
+ D f) x o( t) + [b
i o(t)
(A
YO(t)
ex o(t)
- D (A
+ Dft
le~
1b
f] U O(t)
controllability
and observability
]=n
A n-l
y(t)
00
=-
L; e(-AD)jA-1bfj+l
j=O
y(t) = L;
n=l.
J ... J hn(TV'"
dTn (2.5)
H(sv .. ,soo)
where
(2.6)
290
W.J. RUGH
00
H n( Sv ... ,Sn ) =
00
81 11
. .
=1, 2, ...
( 2.7)
Of course, convergence issues arise in the interpretation of (2.5) and (2.7). Since
technical details are deemphasized here, any convenient set of convergence assumptions can be adopted, for example, those in [3]. However, for concreteness in the
sequel, it will be assumed that each Hn( Sl1 ... ,sn) is a symmetric, proper, rational
function that is bounded on Re [Sl], ... ,Re [sn] ~ 0. Throughout the following
developmen t, the requisite con vergence properties of the various series will be
assumed without comment.
The representation (2.5), (2.6) describes an initially-relaxed system that has a
constant operating point for zero input and zero output, The linearization about this
operating point has the transfer function H 1( s). The appropriate definitions for
nonzero operating points and the corresponding linearizations can be derived as follows. (Related work for a different form of the Volterra series can be found in
[20].) It is natural to adopt a local viewpoint again, since (2.5) and (2.6) typically are
valid representations only for sufficiently-small input signals.
Suppose the input is the real, periodic signal
M
~ o:"e ikWI
u(t)=
k=M
t~O
(2.8)
where 0:0 =E', and 0:_10 is the complex conjugate of 0:" for k =1, ... ,M. Then,
assuming this input is sufficiently small, the steady-state response of (2.5) can be
written as
(2.9)
Considering the constant term in (2.9) when M =0, that is, when u(t) =E', leads to
the definition of a family of constant operating poin ts [E', y( E')] as
y(f)
00
(2.10)
n-=l
Now, the linearization of (2.5) is defined by isolating those terms in the response
(2.9) that would be present if (2.5) was, in fact, a linear system. The terms in (2.9.)
that are linear in 0:10, k ?'= 0, are given by, using symmetry of the transfer functions,
E
00
nE'n-1
n-=l
ik
wI
k~M
k;FO
This leads to the definition of the linearization of (2.6) about [f, y( E')] as the linear
system with transfer function
H.(s)
00
,0)
(2.11)
291
It can be shown that these concepts appropriately correspond to the usual notions of
constant operating point and linearization in terms of state equation representations,
at least in the case where (2.6) is bilinear realizable. [20]
Much as in the state-equation case, various assumptions will arise in the process
of using the operating point family (2.10) and linearization family (2.11) as surrogates for the nonlinear system (2.6). The boundedness assumption on the transfer
functions in (2.6) gives that the individual coefficients in the power series (2.10) are
well defined. In order that y() be invertible, it must be assumed that H 1(0) =1= 0
since dyjd(O) =H,(O) 1,=0 =H 1(0). That is, it must be assumed that the linear
subsystem has no zeros at 8 =0. Apparently it is difficult to explicitly state
hypotheses on (2.6) corresponding to controllability and observability of the linearization family, unless (2.6) is a finite sequence. Thus it will be assumed that H,( 8) is
a proper rational function of 8, with relatively-prime, constant-degree numerator and
denominator polynomials in 8, at least for in a neighborhood of =0. Since
H 1( 8) has no poles at zero, it follows that H,( 8) has no poles at zero, also for f in a
neigh borhood of zero, a property that plays a role similar to the invertibility assumption in the state-equation case.
2.4 Example: To illustrate the Volterrarseries setting, consider the system specified
by
n =1, 2, ...
where the sum is over all permutations 7r n of the integers 1 through n. An easy calculation shows that for If I < 1 this system has a constant operating point family
given by
f
=---
1-
Since
Hn(8,O} ...
}O)=~1
n.
E (8 (n-l)!
+ 1)
n 1
rn=O
n- m
=
8
1/{1- )
+1-
m=O
A:
=0
+1
E
00
j=O
(8
+ l)j
In this example the various assumptions mentioned above hold for the region of
definition of the operating-point and linearization families, that is, for If I < 1.
\
292
W.J. RUGH
3. SYSTEM INVERSION
The problem of system inversion has been studied from a number of
viewpoints since about 1950. For a system with input signal u(t) and corresponding
output signal y( t), a left (post) inverse should be such that if y( t) is the input signal
to the inverse system, then u( t) is the output signal. Because of the inherent
integration typically found in dynamical systems, this formulation is rather narrow.
This has been addressed by requiring that the 'inverse' system have response u( t) to
an input which is a certain time-derivative of y( t). Somewhat confusingly, the term
inverse system usually is retained without comment.
The nonlinear system inversion problem, as described above, can be posed in
terms of state-equation representations, or in terms of Volterra-series representations. Much of the early work was based on Volterra series, while more recent work
has focused on state equations. From either viewpoint, there are a number of
difficulties that arise. In the Volterra-series case, the form ulas for the kernels or
transfer functions for the inverse system become exceedingly complicated, and typically only the first three terms are reasonably handled. Also, effective implementation of the inverse system is hampered by the difficulty of computing realizations for
symmetric Volterra series, even series truncated to three terms. In terms of state
equations, severe smoothness assumptions, usually analyticity, are required. Even
then the 'invertibility manifold' is an open, dense subset of the :state/input space,
which might not contain the desired region of operation, and singularities in the
exact inverse system at the boundary of this set must be mollified in some fashion.
Therefore a complementary approach is suggested here for constructing what
might be called extended-linearization inverse systems. The basic idea is to assume
that the given nonlinear system has a family of constant operating points, and a
corresponding family of linearized systems. Then a nonlinear system is computed
that, for each constant operating point output of the given system, yields the constant operating point input of the given system, and has a linearization that is the
inverse linear system for the linearization of the given system. Therefore, relying on
the accuracy of the linearized representation in a neighborhood of any constant
operating point in the family, this new system should perform as an accurate inverse
for the given system.
For simplicity, only the case of zero relative order will be discussed here. This is
the case where the inverse system has output u(t) when the input is yet). The more
general case of (finite) positive order is treated in [26]. In the state equation case,
zero relative order corresponds to the assumption that the output map of the system
(2.1) is such that ah laue x, u) in not identically zero. Then, under the further
assumption that f('''), h(,.), and the input signal are analytic, there exists a local,
left (post) inverse for (2.1) on the open, dense subset M[ of R n X R defined by [14,
22, 28, 34]
M[ ={(x,u)
R n XR I ah/au(x,u)
o}
293
dy/df.(O) = and ah/au(O,O) = 0. These last two hypotheses guarantee that for
sufficiently-small It: I the inverse y-l(y()) =1' exists, and ah/aU(X(E),E) = 0,
respectively.
The goal is to choose, as an extended-linearization inverse system, a system of
the form
w(t) =c(z(t),v(t))
Ii I,
wb) =y-1b)
(3.2)
a(zb),-y) =0,
~:(zb)'/)= ~~(x(),)-
aj
ah
au-(x( 1'),1') a;(x( I' ),~)
ah
au-(x( 1') ,I')
b.
If~-lh)=Vb)
(3.3)
The right side of (3.3) can be recognized as the standard formulas for the inverse
linear system for (2.4), with an adjustment for the constant operating point specified
in (3.2).
To determine the extended-linearization inverse from these requirements, first
set zb) =x(y-1b)). It can be assumed that the first component of x() satisfies
dXdd(O) =
so that the inverses xl 1(.) and zll(-) =y(xll(-)) exist for
sufficiently-small arguments. Then it can be shown that
(3.4)
satisfy the conditions (3.2) and (3.3), although the calculations are not completely
straightforward. [26]
The smoothness hypotheses on the given system guarantee local existence and
uniqueness properties for the extended-linearization inverse system. These issues
aside, it is reasonable to expect that the extended-linearization inverse system will
w.J. RUGH
294
i(t)
X2(t)
[-Xl(t) - 2X2(t)
y(t)
2Xl(t) -
+ u(t)
X2(t)U(t) + u(t)
z( t)
w(t)
Z2( t) v( t) - 2z 1( t)
- Zl( t) - 2z2( t) +
()
1 - Z2 t
v( t) - 2z 1( t)
1 - Z2( t) .
af
a.;-(X(E),E)
[0-1
~(X(E),E)
[2
ax
-E],
~ ~ (x( E) ,E)
ah
-(X(
E),E)
au
[n
1
Thus, in (3.3),
Vb)
r(--y)
~ [_3 ry~
=
[-2
i(t)
.]
zkr)
Z2( t)
+ zl(t)Zz(t)
~3Z1(t) - 2z 2(t)
+ v(t)
295
An easy calculation shows that the given system has a linearization with stable
inverse for operating points with (' < 2, so this gives an idea of the region of opera;tion over which the extended-linearization inverse might be expected to perform
well. A comparison of the performance of the extended-linearization inverse system
and a linear inverse based on the system linearization about (' =0 is given in [26].
The performance of the extended-linearization inverse is good for small input variation, but degrades with larger variation, though not so rapidly as the performance of
the linear inverse based on the zero operating poin t.
From the viewpoint of Volterra series representations, the nonlinear system
inversion problem has been considered by a number of authors. [28], [3] A Volterra
series (2.5) has a left (post) inverse if and only if the linear subsystem, given by the
n =1 term in (2.5), has a stable inverse. Therefore, in terms of (2.6), an inverse
exists if and only if H l( s) has equal degree numerator and denominator polynomials, and all zeros with negative real parts. Note that (2.5), (2.6) describe an initially
relaxed system, and typically will be a valid representation only for sufficiently-small
input signals. The inverse Volterra series inherits similar features.
To develop the extended-linearization approach to system inversion from the
input-output viewpoint, the additional assumptions for Volterra series listed in Section 2 will be invoked. While it is possible to construct an extended-linearization
inverse exclusively in terms of Volterra series, a more often useful approach is suggested by recent work on pseudolinearization. [24] Using the invertibility of y('), the
conditions to be satisfied by the extended-linearization inverse system are easily
expressed in terms of the transfer function family He( s) and operating point family
[(', y( (')] for (2.6). Specifically, from consideration of the cascade connection of two
systems, the operating point and transfer function families for the extendedlinearization inverse must satisfy
G-y( s)
~--'---;-~
Hy-l(-y)( s)
(3.7)
wh)
y-1h)
(3.8)
G (s)
-y
sn
+ dh)
( 3.9)
where it follows from the assumptions that ao( ')') is nonzero, at least in a neighborhood of')' =0. Then it can be shown that the nonlinear state equation
.i(t)
zn(t)
an ( z( t), v( t))
wet)
c(z(t),v(t))
( 3.10)
296
W.J. RUGH
with
an(z,v)
=E
-aj_l( Zl l(zd)Zj
+ aO(Zll(zd)ZI(V)
j~
c(z,v)
j~
+J
v
ao(a)d(a) + coCa)
da
ao(a)
zky)
Z(')')
(3.11)
o
is an extended-linearization inverse system, that is, has transfer-function and
operating-point families that satisfy (3.7) and (3.8). Again, some of the calculations
involved in the verification are less than straightforward.
3.2 Example: Consider the system specified by
n=1,2,
sinh(x)
Pn xn
n~
(l/4)cosh()(s
(2s+1)
+ 4)
so that, from (3.7) and (3.8), the extended-linearization inverse system is specified
by
G.y(s)
_ 28(",2 + 1)-1/2
- I
_
s+4
8(')'2
+ 1)-1/2
z(')') =.:1.4
a(z,v) =-4z
c(z,v)
=-
+v
28(16z 2 + 1t1/2z
The interpretation of the extended-linearization inverse system from the inputoutput perspective is similar to that for the state-equation case. In particular, if the
steady-state output of the given system remains in a neighborhood of some constant
297
4. STATE OBSERVATION
The problem of observing (estimating) the current state of a nonlinear system
from the output signal is important in its own right, as well as a key component of
one approach to ou tpu t-feedback control. Existing approaches to the design of nonlinear observers can be described in terms of three broad categories. First there is
what might be called the Liapunov approach, where the form of the nonlinear
observer is chosen to facilitate the stability analysis for the resulting error equation.
[12, 18, 33] Second, there are restrictive classes of nonlinear systems where an
observer can be chosen to yield a linear error equation, with the result that linear
stability analysis can be applied. [13] Third, there has been some work on the application of linearizing transformations to the observer problem. [6, 19] However, it
seems fair to say that these approaches all suffer serious limitations, and represent
only first steps toward a theory of observer design.
The extended-linearization approach offers an alternative that is relatively simple technically, and can be applied, or at least tried, in very general situations.
(Perhaps it should be noted, however, that the approach cannot be applied to
observer construction for unforced systems.) The idea is to choose a nonlinear
observer so that the family of linearized error equations has eigenvalues that vary in
a specified way with respect to the constant operating point family of the given system. Again a local view is appropriate, and intuition suggests that the extendedlinearization approach should work well so long as the nonlinear system is operated
sufficiently near the constant operating point surface in the state/input space.
Consider the system (2.1)' where it is assumed that dy/d(O) ~ 0, and that the
linearization at zero is observable, with af /ax(O,O) invertible. In this situation the
observer is chosen to have the form
i(t)'
f(x(t),u(t))
+ g(y(t))
- g(y(t))
yet) =h(x(t),u(t))
( 4.1)
where x( t) and fie t) are the observer state and output, and g():R -+ R n , with
g(O) =0, is a nonlinear observer gain. Corresponding to the constant operating
point family for (2.1), this observer has a constant operating point family for
u(t) =, x(t) =x(), and fi(t) =y(t) =y(). Of main interest is the nonlinear
error equation for the system (2.1) and observer (4.1), which is given by
+ g(fi(t))
(4.2)
This error equation inherits the obvious constant operating point family, and a
straightforward calculation shows that the corresponding family of linearized error
298
W.J. RUGH
equations is
ddt [xo(t) - xo(t)]
[aaxf (x(),)
-#-(y())~(x(),)]
[xo(t) dy
ax
xo(t)]
(4.3)
The goal now is to choose the observer gain g(.) so that the eigenvalues of
(4.3) have the desired scheduling with respect to the operating point. That is, so that
the eigenvalues are specified functions of , at least in a neigh borhood of =0. It is
assumed here, and throughout the sequel, that the specified eigenvalues have nega,tive real parts and occur in conjugate pairs. A choice of scheduling that might be
appropriate in the absence of particular intuition is to fix the eigenvalues at specified
locations that are independent of . Such a choice means that the dynamics of the
error equation (4.2) should be similar near any operating point in the family.
The first step in solving this problem is to mimic the standard linear observer
theory, and consider the (transposed) condition
det
] =).,n + O'l()).,n-l+
rafT
ah T
lH
- ~(x(),) + ~(x(),)qT()
g(y)
Jo
q()-L()
d
( 4.5)
It is not difficult to apply the extended-linearization idea to the case of reduceddimension observers, again by simply following the well-known linear theory. [4]
However, it should be noted that a variable change typically is required, and this can
significantly complicate the construction.
The problem of where to place, and/or how to schedule, the eigenvalues is
difficult to address in general terms. Indeed, the determination of the observer gain
does not depend on the particular objective behind the choice of q (). Rather than
basing this choice on eigenvalue considerations, a parameterized, steady-state Kalman filter\ng problem can be used to determine q () since the solution of the algebraic Ricca'ti equation is analytic in the coefficient matrices. [1] Such an approach is
loosely related to the extended Kalman filter, though the loose relation is restricted
to a neighborhood of the' family of constant operating points.
4.1 Remark: Unlike system inversion, the extended-linearization observer construction yields a unique solution for the single-output case considered here. However,
the solution will not be unique for the multi-output case. Also, it is interesting to
note that the observer gain in (4.1) acts separately on the system and observer
299
outputs, and not on the output error. It is not hard to check that if the gain term
9 (y( t)- y( t)) is used in the observer, it is impossible to obtain the eigenvalue
scheduling in the resulting linearized error equation.
4.2 Example: The standard textbook problem of balancing an inverted pendulum
on a motorized cart leads to the following system description when a particularly con-
voni,n t
00,",( ,)
2 - cos2XI( t)
yet) = XI(t)
Here y( t) is the angle of the pendulum from the vertical, and u( t) is the force on
the cart. In this case it is easy to check that there is a family of constant operating
points given by
x( f) = [tan -ole f)] ,
-00<10<00
y( f) = tan-l( f)
and that the corresponding family of linearizations is
X6(t)
=[(102:1)3/2
:]X6(t)
210 2+1
Y6(t) = [1
0]X6(t)
5. STA'IE FEEDBACK
The difficulty of the feedback control problem for nonlinear systems is well
known. One aspect of this difficulty is the lack of meaningful design criteria, other
than formulation of an optimal control problem. Another, not unrelated, aspect is
the inherent complexity of the relation between open and closed-loop systems under
nonlinear feedbaek. These are two important reasons for the interest generated by
the recently-developed design approach involving linearizing transformations. [7, 10,
15, 16, 29, 31] If the problem can be made linear by preliminary state feedback and a
variable change, then standard linear criteria can be used, along with accompanying
linear design methods. However, there are a couple of difficulties with design by
linearization that can impede its application. First, the hypotheses required are
somewhat restrictive - in fact they are nongeneric in a precise technical sense. [32]
Second, the coordinate changes involved in the approach scra.mble together physical
W.J. RUGH
300
variables of the system, tend to mask implicit constraints on the model, and thus
tend to damage engineering intuition about system features. It is appropriate to keep
these issues in mind through the following development of the extendedlinearization approach.
( 5.1)
( 5.2)
-+
R satisfies
( 5.3)
j(x(t:)'t:) =0
t: =/3 - k(x(t:))
/3
=0). Then
(5.5)
xc(t)
(5.4)
301
where the deviation variables are xs(t) =x(t) - x(), ws(t) =w(t) - {3.
The idea behind the extended-linearization approach to feedback control is to
choose the family of linear feedback gains, ak lax (x( )), such that at each value of
the family of linear systems in (5.5) meets a specified linear design objective. While
various design objectives can be considered, attention here will be focused on the
eigenvalue placement/scheduling problem, which probably is the most basic. That is,
the objective is that the eigenvalues of (5.5) have desired values for each , at least
in a neighborhood of =0. This objective corresponds to specifying the basic
dynamical behavior of the closed-loop system in a neighborhood of any operating
point in the family.
Similar to the observer design, this objective translates into choosing a 1 X n
vector q() such that
det [AI- !!.L(x(),)
ax
(5.6)
where the right side is the desired characteristic polynomial for the closed-loop
linearization family. Again, (5.6) can be solved for q() in several ways, though the
Bass-Gura formula provides a convenient general solution.
In order to find a nonlinear gain k (x) that satisfies
~~(x())=q()
(5.7)
k(x)
q()
dd~() d +
C
lh
j=2
IS
qj(Xl1(Xl))[Xj- Xj(Xl1(Xl))]
( 5.8)
5.2 Remark: Again the issue arises of where to place or how to schedule the eigenvalues of the closed-loop linearization. One approach is to determine q () in (5.7)
via linear-quadratic optimal control problem for the linearization family. [1] As in
the observer case, this is possible because the algebraic Riccati equation is analytic in
the coefficient matrices.
6. OU'IPUT FEEDBACK.
Most of the recen t results on state feedback control of nonlinear systems have
not yet been extended to the output feedback control problem, and, in fact, appear
to have limited potential in this regard. The fact that extended linearization offers a
systematic design procedure for this very important problem is possibly tne major
contribution of the approach.
Following the linear theory, the basic idea is to combine the observer and state
feedback results of the preceding two sections. It is assumed that the system (2.1) is
such that the linearization at zero is controllable and observable, aj /ax(O,O) is
invertible, and dy/d(O) =1= 0. Using the nonlinear observer to generate the
302
w.J. RUGH
estimated
i(t) =J(x(t),u(t))
i(t) =J(i(t),u(t))
+ g(y(t))
- g(y(t))
y(t) =h(x(t),u(t))
y( t) =h(i( t),u( t))
u(t) =w(t) - k(i(t))
(6.1 )
( 6.2)
at least for f (and (3) in a neigh borhood of zero, under the standing assumption of
nonzero eigenvalues for the closed-loop linearization. The closed-loop linearization
family, in terms of the (customary) deviation variables xo(t) =x( t) - X( f),
xo(t) =x(t) - i(t), wo(t) =w(t) - (3, and Yo(t) =y(t) - Y(f), can be written in
the block-partitioned form
aj
aj
ak
-(x( f) ,f)- -(X( f),f)~X( f))
ax
au
ax
+
y,( t)
[ :~(X(f),f)l
~ [:~
(x( ,),')
w o( t)
')"}(t)
(6.3)
Following the standard linear analysis, it is clear that the eigenvalues of this family
are given by the eigenvalues of the linearized state-feedback system, and the eigenvalues of the linearized observer-error equation. Since both sets of eigenvalues can
be specified, by virtue of the choices of gains k() and g( .), the closed loop
303
This form for H.c1( s) can be derived from state-equation representations of the su bsystems by linearizing, computing the subsystem transfer functions, and then computing the closed-loop transfer function. Alternatively, from Volterra-series
representations of the subsystems, a 'harmonic balance' approach similar to that in
Section 2 can be used.
/3
W.J.RUGH
304
NH(<",S)
DH(<",s)
H ()
s = -:-----:----,,....
( 6.5)
where NH(<",s) and DH(<",s) are degree-n polynomials in s, with NH(<",O) and
DH(<",O) nonzero in a neighborhood of t =0. Then the following form can be
chosen for the compensator transfer-function families:
( 6.6)
Here the polynomials Ndt,s), N~t,s), and C(s) are degree-(n-l) polynomials in
s, corresponding to the reduced-dimension observer design. For convenience at no
loss of generality, the compensator families have identical denominators that are
independen t of t, and the fact that y( t) is known has been used to write the
numerator of Fy()(s) as a function of t, rather than y(t).
In terms of these definitions, the closed-loop linearization transfer function
(6.4) is given by
( 6.7)
The goal now is to show that Ndt,s) and N~t,s) can be chosen such that the
denominator polynomial of HcI(s) takes the form C(s)D(t,s). It is left understood
that C( s) has negative-real-part roots that occur in conjugate pairs, but is otherwise
arbitrary. The degree-q polynomial D(t,s) represents the desired closed-loop
characteristic polynomial. Then the closed-loop transfer function family will be
H
cl()
NH(t,s)
s - --:::'7------,-'D(t,s)
( 6.8)
C(s)D(t,s) =C(s)DH(t,s)
+ Nd.,t,s)DH(t,s) + NF(t,s)NH(t,s)
(6.9)
for given polynomials O(s), D(t,s), NH(t,s), and DH(t,s) now follows from standard linear system results. [9, 17] Specifically, the linear coefficient equations
corresponding to (6.9) have a unique solution at <" =0, and by continuity arguments,
therefore have a solution for all t in a neighborhood of t =0. This solution
specifies Gf(s) directly. Using the invertibility of y(t), NF(t,s) can be rewritten as a
polynomial in s with coefficients that are functions of y( t), so that FY(f)( s) is
specified.
In order to determine nonlinear compensators corresponding to these linearizartion families, the appropriate constant operating point families must be specified.
Comparison of (2.10) and (2.11) shows that consistent choices are provided by
g(t)
Jo Gq(O)
y( )
du,
f(y(t))
J
0
Fq(O) du
(6.10)
305
Gs)
(6.11)
sn+ an _ 18 n 1++ ao
with
g(c)
ao
dO'
(6.12)
Then it is not difficult to show that a system of the form (3.10) with linear dynamics
an(z,v)
n
=
aj_lzj
~ j=4
+v
(6.13)
c(z,v)
=-E gj_l(aOzl)Zjj=1
go(aozd v +g(v)
ao
( 6.14)
yields the linearization and operating point families in (6.11) and (6.12).
6.2 Example: The constant operating point and linearization families for the
inverted pendulum in Example 4.2 are described by
H8) =
82 -
h(c)
(c 2+1)h(c)
where
h (c)
_(c 2 +1)1/2
2c 2
+1
Writing
Nd,f.,8) =gl(C)8
+ go(c::)
0(8)=8+5
D(c::,8) =S2+ 48 +4
gives (6.9) as
(8 + 5)(8 2 + 4s + 4) =(8 + 5)(8 2 - (c 2+1)h(c::)) + (gl(C::)8 + gO(C::))(8 2 - (c:: 2+1)h(c::))
+ (j 1( c::) 8 + f o( ))
Equating coefficients of like powers of
go(C::) =4,
gl(C::) =0
h (c::)
gives
306
W.J. RUGH
G,(8)
_4_
_
F y (,) ( 8 ) -
8+5
f o( tany) - 5f l( tany)
--'---'--::---'--'--
8+5
+ f l( tany)
Clearly the family G,( 8) can be implemented by any realization of the linear-system
transfer function 4/( 8 + 5). For the compensator family Fy( ,)( 8), the corresponding
operating-point family given by (6.10) is
f(y) =
Jo -f
o( tan
5
y
0" )
dO"
it =-5z + v
1
w =[fo(tan5z) - 5fl(tan5z)][z - -v]
5
+ f(v)
7. CONCLUDING REMARKS
In order to implement the extended-linearization approach for problems involving all but the simplest systems, it is necessary to use the symbolic computation capabilities of programs such as MACSYMA and SMP. Indeed, a MACSYMA program
has been written to implement the state-feedback gain calculation described in Section 5. [1] The program prompts the user for the system equations, the nominal constan t operating point (not necessarily at zero), and the desired eigenvalue schedule.
It then computes polynomial approximations (with user-specified degree) to the family of constant operating points and corresponding linearization family, and then the
nonlinear feedback gain. Of course, the same basic program can be used for determining the nonlinear observer gain in Section 4. A portion of current work is
devoted to exploring the effectiveness of the approach for more complex problems
than the inverted pendulum discussed in [4]. Also, it should be noted that an application of the pseudolinearization approach to control of an induction motor is
reported in [21].
The results described herein for state observation and feedback control have
been generalized to the case of multi-input systems in [5]. This generalization is
rather difficult because the nonlinear gains are determined by total diff eren tial equar
tions in more than a single variable, and it must be shown that the Frobenius conditions for integrability can be satisfied. These results are presently restricted to anar
lytic systems because of the proof technique. The pseudolinearization theory also
307
has been extended to the multi-input case, including a treatment of the eigenvalue
placement problem using precompensators. [8, 24]
An important generalization of the nonlinear feedback work discussed here
involves the use of more general feedback of the form u (t) = k (x (t), w( t)) in place
of u(t) =w(t) - k(x(t)) . It is shown in [24] that this so-called implicit feedback
can yield simpler control laws. Also, preliminary work by W . Baumann on
observer/feedback approaches shows that results can be obtained under weak;r
hypotheses when more general feedback is permitted.
Finally, though the engineering intuition behind the extended-linearization
approach seems reasonable, an important topic for further work is more quantitative
analysis of the designs resulting from the method. For example, in the state feedback control problem, estimates of the regions of closed-loop stability with the nonlinear control and with the linear control based on the nominal operating point would
be interesting to compare. Of course, typically the bounds obtained from this sort of
analysis are very conservative.
8. REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
D. Bestle, M. Zeitz, "Canonical Form Observer Design for Nonlinear TimeVarying Systems," International Journal of Control, Vol. 38, pp. 419-431, 1983.
[7]
[8]
[9]
C. Chen, Linear System Theory and Design, Holt, Rinehart, and Winston, New
York, 1984.
308
W.J. RUGH
[11] P. Crouch, "Realisation Theory for Dynamical Systems," Proceedings of the Institute of Electrical Engineers, Vol. 126, pp. 605-615, 1979.
[12] I. Derese, P. Stevens, E. Noldus, "Observers for Bilinear Systems with
Bounded Input," International Journal of Systems Science, Vol. 10, pp. 649-668,
1979.
[13] S. Hara, K. Furuta, "Minimal Order State Observers for Bilinear Systems,"
International Journal of Control, Vol. 24, pp. 705-718, 1976,
[14] R. Hirschorn, "Invertibility of Nonlinear Control Systems," SIAM Journal on
Control and Optimization, Vol. 17, pp. 289-297, 1979.
[15] L. Hunt, R. Su, G. Meyer, "Global Transformation of Nonlinear Systems,"
IEEE Transactions on Automatic Control, Vol. AC-28, pp. 24-31, 1983.
[16] B. Jakubczyk, W. Respondek, "On Linearization of Control Systems," Bulletin
of the Polish Academy of Science, Science and Mathematics Series, Vol. 28, pp.
517-522, 1980.
[17] T. Kailath, Linear Systems, Prentice Hall Inc., Englewood Cliffs, New Jersey,
1980.
[18] S. Kou, D. Elliott, T. Tarn, "Exponential Observers for Nonlinear Dynamic
Systems," Information and Control, Vol. 29, pp. 206-216, 1975.
[19] A. Krener, W. Respondek, "Nonlinear Observers with Linearizable Dynamics,"
SIAM Journal on Control and Optimization, to appear, 1985.
[20] R. Lejeune, W. Rugh, "Linearization of Nonlinear Systems About Constant
Operating Points," IEEE Transactions on Automatic Control, Vol. AC-30, pp.
804-808, 1985.
[21] P. Mouyon, C. Champetier, C. Reboulet, "Application d'une Nouvelle Methode
de Commande des Systemes Non Lineaires - la Pseudo-Linearisation - a un
Exemple Industriel," Congres INRIA, Nice, France, 1984.
[22] H. Nijmeijer, "Invertibility of Affine Nc:mlinear,Control Systems: A Geometric
Approach," Systems & Control Letters, Vol. 2, pp. 163-168, 1982.
[23] C. Reboulet, C. Champetier, "A New Method for Linearizing Nonlinear Systems: The Pseudolinearization," International Journal of Control, Vol. 40, pp.
631-638, 1984.
[24] C. Reboulet, P. Mouyon, C. Champetier, "Recent Results on the Local Linearization of Nonlinear Systems," in this volume.
[25] W. Rugh, "Design of Nonlinear Compensators for Nonlinear Systems by an
Extended Linearization Technique," Proceedings of the Twenty-third IEEE Conference on Decision and Control, Las Vegas, NV, pp. 69-73, 1984.
[26] W. Rugh, "An Extended Linearization Approach to Nonlinear System'Inversion," Technical Report No. JHU IEECS-85 106, The Johns Hopkins University,
Baltimore, Maryland, 1985.
[28] M. Schetzen, "Theory of pth_Order Inverses of Nonlinear Systems," IEEE Transactions on Circuits and Systems, Vol CAS-23, pp. 285-291, 1976.
309
[28] S. Singh, "A Modified Algorithm for Invertibility in Nonlinear Systems," IEEE
Transactons on Automatc Control, Vol. AC-26, pp. 595-598, 1981.
[29] R. Sommer, "Control Design for Multivariable Nonlinear Time-Varying Systems," Internatonal Journal of Control, Vol. 31, pp. 883-891, 1980.
[30] E. Son tag, "An Introduction to the Stabilization Problem for Parameterized
Families of Linear Systems," in Lnear Algebra and ts Role in Systems Theory, (B.
Datta, ed.) AMS Publications, 1985.
[31] R. Su, "On the Linear Equivalents of Nonlinear Systems", Systems & Control
Letters, Vol. 2, pp. 48-52, 1982.
[32] K. Tchon, "On Some Applications of Transversality Theory to System Theory,"
Systems & Control Letters, Vol. 4, pp. 149-155, 1984.
[33] F. Thau, "Observing the State of Nonlinear Dynamic Systems," International
Journal of Control, Vol. 17, pp. 471-479, 1973.
[34] J. Tsinias, N. Kalouptsidis, "Invertibility of Nonlinear Analytic Single-Input Systems," IEEE Transactions on Automatic Control, Vol. AC-28, pp. 931-933, 1983.
1. INTRODUCTION
x=
f(x.u)
(1)
31.
M. Fliess and M. Hazewinkel (eds.). Algebraic and Geometric Methods in Nonlinear Control Theory, 311-322.
1986
c.
312
&on
(o.p.)-
REBOULET ET AL.
is
locally
M:
6x.= x.- x~
111
be
6><. =
approximated
by
i= 1 ..... n
Note that local coordinates have been needed to define the l.t.m.
However. under weak conditions (see below). an intrinsic description of
the local behaviour can be achieved. Futhermore. in the control design
using local linearization. the l.t.m. controllability must be assumed.
So. let us introduce the following definition:
Definition 1: The system (1) is said to be locally controllable if
linear tangent models are controllable.
its
Under this hypothesis. it can be easily shown that the set of o.p. & is
a submanifold of MxR m. In the sequel. we will assume that the projection
of & on H is a regular embedding. In this case. 'lJ is a submanifold of M
and the following result holds:
Proposition 1 (cf [6]):
Consider a locally controllable nonlinear
system (1) s.t. the projection of & on M is a regular embedding. Then.
there exists in the neighborhood ofCO' an operator field F on M and
vector fields Gi i=1 ..... m on M such that the l.t.m. can be written at
Xo as:
3. PSEUDOlINEARIZATION
of
finding
nonlinear
transformations
313
-change of states z=T(x), state feedback v=S(x,u)- such that the l.t.m.
in the (z,v)-coordinates does not depend on the o.p. anymore. Then, the
system is said to be pseudolinearized. In the theorem below are recalled
necessary and sufficient conditions of pseudolinearizability. Some
practical considerations will be given later.
pseudo
A locally controllable nonlinear. system is
Theorem 1:
linearizable iff:
(i) the controllability indices (v 1 ' . . . ,v m) of its l.t.m. are
constant on '\)
(ii) the distributions of vector fields:
v.-2
tJ..
{ G,
FG,
G } n T,\)
most
Indeed, in this case, rk tJ. i ~ 2. As dim'\) < 2, the c.ondition (ii) of the
the theorem 1 is automatically satisfied.
3.2. Pseudolinearization via Dynamic Precompensation [5]
When the condition (ii) of the theorem does not hold, a dy~amic
precompensator can be used for the pseudolinearization. Indeed, one can
show that the rank of the distributions tJ. i is given on
by:
rk tJ..
1,
to
the
following
Proposition 2:
A locally controllable nonlinear system is pseudolinearizable if the controllability indices of its l.t.m. are constant
along,\) and satisfy:
(2 )
C. REBOULET ET AL.
314
Xl =
X2
X3
U1 X l
k
+ T U2
(gU 2x l - ax 32 )
u1
1
T
: j [:::j [~:!
o -2Kax~
6x 3
o
k
T
K9X l
.
.
0
at the equlllbrium,
one out of the quantities xl'
x 02 ' x 03 ' u02
vanishes, so do the three others. The l.t.m. structure changes at this
point. It is why we restrict the design control to the operating set
[,11 =
x 3 0 =0 }. Let us define:
If
e\ {
ll
~
....L
kx2
i tplj!
k
At every point of
11,
the local structure corresponds to
the
controllability indices (v 1 ,v 2 ) = (1,2). From corollary 1, the system is
pseudolinearizable. Let
us
now
examine
how
to
compute
the
pseudolinearizing transformations. In the new coordinates, the l.t.m.
must be constant, for example in the canonical form:
315
leads
x2
k
- a.
t
That
(3 )
defined on '\Y*
a.
be
z 1=T 1 (x) on R3 s. t.
= dT 1'\Y
I*
a.
(4 )
2
tp
iii
Then:
a.
But
'\Y*
c. REBOULET ET AL.
316
= ..JL.
k 11
cp til
02
x 3
Hence:
11
cp
KYIl
02
x3
dX 3
dT 1nY
with
T1 = Ln(cpl
+ ---
KYIl
x3
In fact, any function of the form g(T 1 (xll can be chosen. We shall take:
z,
1
T1 (x I
2:,
z2
Then:
kUX 2
cp
-1 I
tl
- ..JL.
k 11
x2
with:
=
2
Y
points
so
that,
state
z3
2:3 = v2
4. PSEUDOCOHPENSATION
317
e,
along
that is, by finding functions Ti on M such that ai=dTi1e
Generally. an integrating factor ~ is needed and we must take:
x1
)(2
COS)(1)
length.
318
C. REBOULET ET AL.
with
---g-- ( 43 1 _ aml cos 2Xl ,-I
COSX ,
= - a cosx,
4 1
aml cos x,
41
one
~=,.
3a ~ Ln !secx ,
p2+~p+a.
~ tgx ,
a
( ~~ secx , - ml cosx ,
Note that in this case, the closed-loop static gain depends on the o.p ..
since the transfer function of the l.t.m. is given by:
o
- a cosx,
~
6u
2 0
41/3 - amI cos x,
+ ~p +
a cos x,
2
nonlinear
"implicit"
this
is
partially
measured,
the
pole
assignment
can
be
319
f(x,u)
h(x)
Assume that the system is locally controllable and observable on & .Then
it is easy to show that the l.t.m . can be parametrized almost everywhere
either in terms of the control values or the output values at the
equilibrium. Suppose that the input parametrization can be chosen. At
every point of & , a linear compensator Quo(s) can be constructed for
assigning the pole of the l.t.m. (cf figure). The integration in a
nonlinear controller -called a pseudocompensator- is carried out from a
particular state realization of Quo. To simplify, suppose that Quo is
proper:
s
m-1
+
6y
- ao(u ) 6z 1 - .. . - am_ 1 (u O) 6z m
bO(U O) 6z 1 + .. + bm_ 1 (U o ) 6z m
Among the infinity of nonlinear systems having Quo
function, the following can be chosen:
21
z2
2
m-l =
2
m
u
zm
where qI and
zl) - a 1 (qI(z1)) z2
qI ( z 1 ) + b 1 (qI(zl)) z2
1jJ(
satisfy:
as
l.t . m.
- a m_ 1 (qI(zl)) zm
+ bm_ 1 (qI(zl)) zm
transfer
C. REBOULET ET AL.
320
=
For output parametrization, the integration is performed in a similar
way.
Finally, let us indicate that the method remains available when an
unknown constant (or slowly varying) perturbation enters the system. In
this case, the pseudolinearization -as a number of linearizing methodsis unefficient, due to the fact that the o.p. is unknown.
Example:
Consider an object moving in an unknown
simplified equations of motion can be written as:
X,
x2
X2
1
( f
M
x1
stream
V.
The
+ u )
mass,
~he
=
s ( Ms +
the
2~
s
is calculated so
assigned to:
that
the
closed-loop
characteristic
polynomial
is
a (u)
321
MA2 - 2dJkiU\
b(u) = MA 1
c(u)
MAO
dIu) = A3 - 2JkiU\ 1M
The
integration
pseudocompensator:
method
Zl
z2
Z2
-d(CZ 1 )
a(cz 1 )
presented
z2
cZ l
here
yields
the
following
b - ad(cz 1 )) z2
5. CONCLUSION
[1]
[2]
[3]
[4]
[5]
322
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
C. REB()ULET ET AL.
CHAMPETIER C., MOUYON P., REBOULET C. 'Pseudolinearization of multiinput nonlinear systems', Proceedings of the 23rd IEEE CDC, Las
Vegas, 1984
HUNT L., SU R., MEYER G.
'Global transformations of nonlinear
systems', IEEE Trans. Aut. Control, 28, 1983, pp24-31.
JAKUBCYZK B., RESPONDEK W. 'On linearization of control systems',
Bull. Acad. Polon. Sci., Ser. Sci. Math. Astronom. Phys., 28,1980,
pp 511-522
KRENER A.J. 'Approximate linearization by state feedback and
coordinate change' , Systems Control Letters, 5, 1984, pp181-185.
MOUYON P., CHAMPETIER C., REBOULET C. 'Application d'une nouvelle
methode de commande des systemes non linea ires -la
pseudolinearisation- a un exemple industriel', Congres INRIA, Nice 1984.
NIJMEIJER H. 'State sPace equivalence of an affine nonlinear system
with outputs to a minimal linear system', Int. J. Control, 39,
1984, pp919-922.
RUGH W. 'Design of nonlinear compensators for nonlinear systems by
extended linearization technique', Proceedings of the 23rd IEEE
CDC, Las Vegas, 1984
RUGH W. 'An extended linearization approach to nonlinear system
inversion', this book.
Optimal Control
H. J. Sussmann 1
Department of Mathematics
Rutgers University
New Brunswick, NJ 08903, U.S.A.
1. Introduction
The theory of envelopes and conjugate points constitutes an important
chapter of the classical Calculus of Variations.
In Optimal Control theory, it is
often desirable to get better information about optimal trajectories than that
provided by the Pontryagin Maximum Principle.
To do this, it is useful to have
necessary conditions for optimality that exclude, for instance, bang-oang
trajectories that have too many switchings, even if those trajectories satisfy the
Maximum Principle. It turns out that, in a number of cases, this can be done by
means of a suitable generalization of envelope theory. The purpose of this note is
to outline one such generalization and to illustrate its use by proving theorems on
the structure of optimal bang-bang trajectories for certain problems in two and
three dimensions.
The paper is organized as follows. We begin by outlining the theory without
giving any technical details. In particular, we give a general definition of envelopes
for control systems, and we show that, for the classical Calculus of Variations, it
reduces to the usual concept. After this we proceed directly to some applications,
including the theorems on bang-bang extremals referred to above. We then give a
simple but nonrigorous derivation of the fundamental formula on which the theory
rests.
A complete rigorous development of the theory (including, in particular,
proofs that all the examples discussed here satisfy the necessary technical
conditions) will be given in a subsequent paper.
x'
=f (x,u),
x e M,
u e U,
M. Riess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 325-346.
1986 by D. Reidel Publishing Company.
(1)
326
H. J. SUSSMANN
J (r)
T+(f)
=J _
T (f)
L (X (t) , u (t) ) dt ,
(2)
where r denotes an ad mi ss; bl epa; r of the system (1) . and T - (r) , T + (r) are the
initial and terminal times of r (i.e. r is a pair (x(),u(, that consists of a
trajectory x ( )
and a corresponding control u ( ), both of them defined on a
compact interval Intv (r)
=[ T -
aH
ax
(3)
(4)
(If M is just an open subset of Rn, then x and ffx,uJ should be thought of as
column vectors, X as a row vector, and "<',' >" as the ordinary product of a row
and a column; more generally, in the manifold case, one should think of X ( ) as a
field of covectors along x (.) , i.e. a mapping that assigns, to each t , a covector
X(t) at x{t), and of "<.,.>" as the ordinary pairing between vectors and covectors.)
A multiplier A is nontrivial if the pair A (t) = (X (t) ,1') is nonzero for some
(and hence every) t E Intv (r). The multiplier A is minimizing if, for almost every
t E Intv (r) , the equality
H(X(t),v,x(t),u(t)
= Min
{H(X(t),I',x(t),v) : v
(5)
U} =c
[a,b]}
of
327
We let J ,T ,T ,x ,x
l'
l'
l'
l'
+
l'
denote, respectively, the cost, initial and terminal times, and initial and terminal
states of r . (The initial and terminal states of an admissible pair r = (x(), u('))
l'
l'
We let ~
l'
l'
l'
l'
l'
,~
+
l'
l'
l'
of
respect to 1', and "I" to denote differentiation with respect to t. Also, we let
l'
l'
'A = 'A (T ) ,'A = 'A (T ). Then, if all these objects depend "sufficiently smoothly"
l'
l'
l'
l'
v 'D J =<'A
1'1'1'
,~
1'1'
1'1'
1'1'
(6)
l'
l'
2. the trajectories
l'
dJ
l'
+
+
v'-=<'A ,~ >-<>.. ,~ >.
d1'
l'
l'
l'
l'
3. Envelopes
Let F = {r : l'
l'
(7)
[a,b]}
l'
l'
l'
l'
vT '
328
H. J. SUSSMANN
t T' t T
(i) the
'
T
normalized, so that v T = 1. An initial arc of F is, roughly, an arc that joins the
initial
points of the
r.
arcs
However, we want to
allow this
arc
to
be
= xT
= (0- (.)
for every T
,v -
(.
[a,b] . We will
is a trajectory
pair of the family F. The interval [a - ,b -] is the domain of the initial admissible
pair Il.
J (Il ) = J
b
_ L (0
(c de
(c), v
(8)
where we have used the chain rule to conclude that D 0- (T) D 0- (0- (T
=~- .
T O T
One can define in a similar way what is meant by a terminal arc of the
one-parameter family F, a control terminal are, and an admissible terminal pair.
We use the obvious notations 0 + , 0 + , v + ,Il+ , and it is clear that a formula exactly
analogous to (8) holds for the cost of an admissible terminal pair.
If we combine (8) and its analogue for terminal arcs with (6) we get the
+
following identity relating the costs along r , r ,Il and Il , if the family F consists
a
J(Il+) -J(Il-)
=Ja
+
T
(9)
i.e.
(10)
329
+
T
+
T
(11)
,r )dT.
T T
(12)
(13)
We now list a number of cases where the generalized envelope condition is
easily verified:
points of the
In this case we
can take a' = b' , and let a +, a be constant functions. Since every
point arc is trivially a trajectory corresponding to some control, the
existence of 11+ and 11
is trivial.
, trivially
vanishes as well.
2. F is a fixed-endpoint equal-time family. The reasoning here is as in
the previous case, except that the vanishing of R 2 now follows from
the fact that
a-13 is a constant.
+
T
(14)
and
330
H.J. SUSSMANN
+
+
+
H("A. ,1,0 (T),V
T
where (i) we let 0- (T) = X
(r
,(i i)
+
+
+
=H("A. ,1,0 (T),U (T
T
T T
),
v - (. )
(15)
T ~ X
T ~
+
T
is a pOInt
for an appropriate
If F satisfies the GEC, then (13) holds, and we can conclude that
(16)
and
T2=ra*A+.
(Often A-is
just a pOint trajectory.) If [a,b] is the interval on which A-is defined, then one
tries to construct extremals r
for each l' e [a,b] , in such a way that, if
l'
A-=(o-(')
0+ (T)
,v-( , then r
l'
obtains a new trajectory T 2 which has the same initial and terminal pOints as T,.
Moreover, this new trajectory has exactly the same cost as T . This implies that
the new trajectory is optimal if T, is. If one can then prove that T 2 cannot be
optimal, it will follow that T
point p to a point q. (The time interval [a,{3] is not specified in advance.) Assume
that the Lagrangian L is smooth and the Hessian matrix of L with respect to the x'
variables is nondegenerate at every point (x, x'). Then we can regard this situation
331
aL
au
(17)
r 'I'
1.
'I'
r 'I' =
(x
'I'
ro=r,
a.
8 (0) = 13,
b.
the curve S :
'I' -
'I'
(8('1')) is tangent to
'I'
(0) at
'I'
(8('1')) for
each '1',
c.
either
+
+
ii. S (,.)
Ei
q , but r
'I'
for '1'<0, or
is not tangent to
at q.
together with the unique corresponding control. and we can let A-be the constant
trajectory with value p. The GEC is then trivially satisfied. (The existence of
= v + ('I')
If S+ ('1') does not lie on r for '1'<0, it then follows that the given extremal
is not minimizing. Indeed, (16) implies that
H. J. SUSSMANN
332
(18)
If
r -e *l:.. +
r -e *l:.. +
would have to be an extremal. This, however, is impossible, because the EulerLagrange equations, together with the nondegeneracy of the Hessian matrix of L,
imply that an extremal is uniquely determined, locally, by the position and velocity at
a given time. In particular, since
r -e *l:.. +
and
q and are tangent there, it would follow that they coincide, contradicting the
r .,.
for .,.<0.
at q, then the
generalized envelope condition is trivially satisfied. This does not yet suffice to
prove that r is not optimal. However, if r' is any extremal defined on an interval
[a,j3'] with 13'>13, such that the restriction of r' to [a,j3] equals r, then r'
cannot be optimal. .Indeed, if r' was optimal, then all the trajectories r' .,. ,obtained
by following
r .,.
up to q, and then
r',
exactly the same cost as r'. Hence they would all be extremals,
impossible since they have a discontinous first derivative at q.
which is
4. Conjugate points
Conjugate points are, roughly, points for which the "first order conditions"
needed to construct envelopes are satisfied. It then follows, modulo suitable
technical hypotheses, that envelopes exist whenever p and q are sufficiently
"nondegenerate" conjugate pOints. We illustrate these ideas by referring again to the
classical Calculus of Variations situation considered in the preceding section.
Let r = (x ( 0) ,u ( 0) ) be a normal null-minimizing extremal that goes from p
to q and is defined on the interval [a,j3]. Assume that a =0 . Then r is the unique
solution of the Euler-Lagrange equations with initial conditions p, u (0) at time 0
and, moreover, the condition (17) is satisfied along r. Let 4> denote the flow of the
Euler-Lagrange vector field, that is, let t -+ 4> (y , v, t) be the solution of the EulerLagrange equations which goes through (y,v) at time O. (Then 4> takes values in x, u
space.) Fix y =P , and consider the x-component of 4> as a function of v and t. Let
the resulting map be denoted by t/>. Then q, is a smooth map that satisfies
t/> (u (0) ,13) = q. If we use D, D, D to denote, repectively, partial differentiation
v
.,.
v ,
with respect to
t and .,., we see that, when teO, D t/> vanishes, but D t/> is equal
v
t
to v, so that DDt/> is the identity matrix. Hence (D t/>)(u,t) is nonsingular for t near
v
r .,. = (x .,..,.
(0) ,U (0 , with x .,. (t)
r .,. extend from 0 to 8(.,.), where
th!i! curve .,. -+ t/>(v(.,.) , 8(.,.)) and t -+ t/>M.,. *) ,t) have the same tangent vector at
333
.,.
t.,.
are,
8(.,.)
(19)
aL
.,. = <(p , v (.,. > ,
L (p , v (
(20)
au
t -H (p, v (.,.
(21)
where H is the Hessian matrix of L with respect to the u variables and "t" denotes
transpose. Since we are assuming that H is nonsingular, (21) simply imposes the
extra requirement that the vector D .,. v (.,.) belong to a linear subspac~ S (p ,v (.,.) )
of codimension one of the tangent space at p.
=D .,. 8 (.,.)
.,.
(22)
If the mapping L (u (0) ,~) is nonsingular, then the solution of (22) with initial
u(O) , 8(0)
~ is unique.
Moreover, it is clear that vIr) EE u(0) ,
condition vIOl
8 (r) is ~
is a solution, which simply corresponds to letting r (for .,.<0) be the
+.,.
334
H. J. SUSSMANN
xEM,
UE
[-l,lJ,
(23)
where the vector fields f, g are smooth, and the state space M is an open subset
of Rn . We will show that, in certain cases, it is possible to exclude extremals that
are bang-bang with too many switchings. In the discussion that follows, it is always
understood that L51. In particular, all extremals are understood to be extremals for
the optimal time problem.
We use X, Y to denote, respectively, the vector fields f - g,f + g. We use
exponential notation for the flows of vector fields, and we write the exponentials
to the right of the points on which they act. (This is needed in order to make the
algebraic calculations involving the Campbell-Hausdorff Formula come out right.
Alternatively, since vector fields act on the left as differential operators on
functions, and there is an obvious duality between functions and points, it follows
that vector fields have to be regarded as acting on the right on pOints, and the
same should be true of their exponentials.) So, for instance, X Exp (tX) Exp (sY)
is the point obtained from x by following an integral curve of X for t units of
time. and then an integral curve of Y for s units of time. Any formal product of
exponentials of scalar multiples of X, Y can be regarded both as a name of a map
obtained from the flows of X and Y and as a formal power series in X and Y.
The precise relationship between the two is that the series gives an asymptotic
expansion for the map. If one considers any finite product of factors
P =rI
j=1
ordered from left to right, then the product P represents the map which to each
state x assigns the point obtained by starting at x and following the integral
trajectory of Z for t units of time, then that of Z for t units of time, and so
1
on. If the product P is formally expanded as a power series, using the usual
formal power series formula for the exponential, then one obtains a series S
whose terms are noncommutative monomials in X, Y. multiplied by scalars. The
series S is asymptotic to P in the sense that; for each positive integer N, each
compact subset K of M, and each smooth real-valued function t/J on M, there exist
positive constants C.a such that the inequality
I (SNt/J)
I S C' IP IN+ 1 ,
such that IP ISa, where:
(24)
(i)
obtained from S by omitting all monomials of degree >N. (i i) S I/> is the function
N
obtained
by
applying
the
partial
differential
operator
SN
to
1/>,
and
335
is time-
optimal. Let us assume that the vectors f (p) , g (p) are linearly independent and
let us shrink U so that fIx) , g(x) are linearly independent for every x in U. Then it
follows easily that G has to be a normal extremal. (Proof: Since r is time-optimal,
r has to be a null extremal. Let 11.( -) = ("A (-) , p) be a nontrivial null-minimizing
multiplier for r, and assume that p = O. Then the Hamiltonian H, evaluated along (r,A),
is just equal to <"A, f (x) + u- g (x) >. At the switching points of r the switching
function <A, g (x) > has to vanish. Then <"A, f (x) > also vanishes, since H:; 0 along
r. So, if x is a switching point of r, and t is the corresponding time, we see that
the linear functional "A (t) annihilates both f (x) and g (x). Since f (x) and g (x) are
linearly independent, we conclude that "A (t) = 0 , and so the multiplier A is trivial.
This contradiction shows that p';: 0.) Let A ( -) = ("A ( -) , 1) be a normalized multiplier
for r. Let a, b be the times when the switchings occur (so that a = t 1 ' b =t 1 + t 2 , if
we assume, as we certainly may without loss of generality, that the initial time of r
is 0). Then H(A(a), 1,x(a),v) and H("A(b), 1,x(b),v) vanish identically as functions of the
variable v.
We will say that two points x (s 1) ,x (s2)
on an extremal
are conjugate
along r if there exists a nontrivial normalized multiplier such that the corresponding
switching function vanishes at both points. In particular, the points x (a) ,x (b) are
conjugate. As we shall see, this imposes strong extra conditions on r.
The condition that x (a) , x (b) are conjugate implies that there is a nontrivial
"A ( - )
of
the
adjoint
solution
equation
along
r
that
satisfies
<"A (a) , g (x (a > = <"A (b) , g (x (b > =0 .
It
is
then
easy to
verify that
<A(a) , g(x(a))> = <A(b) , h(b - a , a
for
each
s,a,
we
let
where,
h (0, s) = (Exp (aY * (g (q - Exp sX . (Here, if F is a smooth map, we' use F* to
denote the differential of F.) Therefore, a necessary condition for
is that the vectors h (b - a, a) , g (b) be linearly dependent.
Clearly,
h (a , s)
is
the
tangent
vector
at
7" = 0
q - Exp (sX) - Exp (7"g) - Exp (oY) ,
which
is
the
same
7"- (q- Exp (sX) - Exp (oY - Exp (-aY) - Exp (7"g) - Exp (oY) .
to
the
curve
as
the
curve
In
particular,
7" -
to be optimal
336
H. J. SUSSMANN
h (b - a, a)
is
the
tangent
vector
at
l' = 0
to
the
curve
l' x(b)oExpAd((a - blY)(Exp(1'g)) where. for a smooth vector field Z. we use
ExpAd(Z) to denote the map that takes each smooth map 41 to the map
Exp (Z) 041 Exp (-Z) . From this it follows that
h(b - a,a) =g (x (b - (b- a) [Y,g] (x (b
+ o(b- a) ,
(25)
h (b - a, a) ... g (x (b
+ 0 (b - a)
(26)
i.e.
- (b - a) [f, g] (x (b
Therefore
h (b-a,a) "g (x (b
= - (b-a) [f, g] (x (b
"g (x (b) )
+ 0
(b-a) ,
(27)
where ",," denotes exterior product. If we assume that g and [f, g] are linearly
independent at p. and then shrink V further. so that g(x) and [f ,g] (x) are linearly
independent at x for every x in V, we can conclude that the left-hand side of (27)
is not equal to zero, provided that b - a is sufficiently small.
Moreover. by
shrinking V even further. if needed. we may also assume that the fact that r is
contained in V already forces b - a to be so small that the conclusion
h (b - a, a) "g (x (b
(28)
follows. (Indeed. if ~>O is arbitrary. then one can always find V 'such that.
whenever a trajectory r is entirely contained in V. then the duration of r is not
greater than ~. This follows from the linear independence of f and g at p.) So the
vectors h (b - a, a) ,g (x (b
are linearly independent. and therefore r cannot be
optimal.
Let us now consider the more degenerate case when [f, g] (p) "g (p) = O.
Let S be the set of all points x in M such that
[f ,g] (x) "g (x) - 0 .
(29)
towards V.
and Y (x)
point
vnS.
it
337
stays in U
that an optimal trajectory which is entirely contained in U or U must be bangbang with at most one switching, and therefore one obtains a complete description
of the optimal trajectories in U, which turn out to be bang-bang with at most two
switchings.
We now turn to the most interesting case, namely, when X (p) and yep)
point to opposite sides of unS. In this case, we can also shrink U and assume that
+
Y points towards U.
and
Again,
one
can
show
that
optimal
Therefore x (a) e U-
Since
g (b) = o.
is
(30)
Let us now allow the times t. to vary, while keeping q fixed. More precisely, we
J
vary a, and then try to select b as function of a so that (30) holds. We may regard
(30) as an equation that defines b implicitly as a function of a, provided that we
can apply the Implicit Function Theorem. Let us assume for the moment that this
can be done. Let a ~ 13 (a) be a function, defined for a near a, such that 13 (a) =b
and
h (13 (a) - a, a)
g (f3 (a) )
=0
(31)
for all a.
Then we can consider, for each a, the XV-trajectory
switches fron X to Y at time a, and terminates at time 13 (a).
r *a
that starts at q,
(In particular,
ra*
is the
piece of
r*
a-e
*A
r.*
a
Since
r*
a
is time-opimal,it
338
H. J. SUSSMANN
follows that
* *I:J. +
r a-e
t:. +
time-optimal trajectory in U
the
terminal
is time-optimal as well.
But
t:. +
=b
r*
a
will
actually
constitute a
I I
I I
that x <O<x , then the conjugacy condition becomes the condition that the vectors
1
(x 1) = S(x 2)
where
S is the function
If
and x
lie on
does.
In
339
time
3'
Tj
until
that all four pieces actually occur. Assume, as we may without loss of generality,
that the starting time of r is equal to zero. Let t. = TO+' + T .. Then x (0) = p ,
J
The times
,T
cannot be arbitrary.
is an
extremal requires that there be a nontrivial solution ). (.) of the adjoint equation
such that <).(t.l, g(x(t .))> vanishes for j = 0 , 1 , 2, so that the vectors
J
g(p) , g(q) , g(r) , after they are transported by the flow of the control
corresponding to r, must be linearly dependent. Let choose to transport these
vectors to the point r. Then we conclude that 9 (r) , Exp (T X) (g (q) ) and
2 *
Exp (T 2X) * Exp (T 1 Y) * (g (p are linearly dependent. Clearly:
(32)
and
Exp (T 2X) * Exp (T 1 Y)
* (g (p
D(rExp (-T 2 X) Exp (-T 1Y) Exp (sg) Exp (T 1Y) Exp (T 2 X
(33)
where D denotes differentiation with respect to s at s=O. The left sides of (32)
and (33) are equal, respectively, to
(34)
and
H. J. SUSSMANN
340
g(r)
[f,g] (r)
-(1"1 + 1"2)
1 2
+(-1"2+ 1"1'1"2)
1 2 2
[Y, [f , g] ](r) + 0 ( (1" 1 + 1" 2) )
+ -1" 1
[X,[f,g]] (r)
(35)
[f,g] (r)
A (1" 1
[y,[f,g]]
+1" 2
+0 (1" 1 +1" 2)
=0.
(36)
requirement that
1"
and
1"
brackets of f and g.
Precisely, let us assume that f, g, and [f,g] are linearly
independent at x, and that U has been chosen so small that these three brackets
remain independent throughout
Then we can write both [X, [f , g]] and
[Y, [f , g]] as linear combinations ~f f, g and [f, gJ. Let tP , 1/1 be the coefficients
of f in these expressions. Then (36) simply says that
u.
1"
'1/1 (r)
+ 1"
tP (r)
+ 0 (1"
+ 1" )
=0
(37)
Since tP and {do not vanish on U, they must have opposite signs. Hence we have
established that, if the three triples (f,g,[f,g]), (g, [f,g], [X,[f,gJ]) and
(g, [f, gJ , [Y , [f , g]]) are linearly independent at x, and the numbers tP (x) ,1/1 (x)
defined above have the same sign, it follows that x has a neighborhood where
there are no extremals with three switchings.
Now let us assume that tP (x) ,1/1 (x) have opposite signs. In this case, we can
construct a two-parameter family of extremals containing r by letting r vary along
a Y-trajectory, and letting 1" vary independently. That is, we consider the trajectory
2
r *S,U
gets to a point q
1" (s,u)
LU
S,U
(In particular,
r *0, 0
is the portion of
Moreover, each
r *LU
1"
1" (s,u)
1
+u , ending at
Then the
r *LU
LU
r *S,U
until it
such that,
341
5,0
We use:
p
where
5,0
by
Exp T 2 + 0) X) * (g (q (5,0) ) )
Exp T 2 + 0) X) * Exp (T 1 Y)
* (g (p (s,o) = 0
(39)
(38)
evaluated at 5 = 0 = O.
$,0
and
(41)
= o.
0,
We then get
AA8=
,r 2 and
particular,
8 = 0 (T 2)'
is the product of
derivative of
0,
T2
T1
vanishes when
co
times a C
(42)
(p) +0 (T 2) ,
implicitly as a function
1
T
2+
function.
does, so that, in
Therefore the
= 0, is divisible by
Hence
From this it follows, in particular, that the leading term of the right side
of (42) does not vanish, if U is small enough. Hence the map (s, 0) -- p
$,0
defines
a smooth two-dimensional surface S near p, and the vectors A,8 form a basis of
the tangent space of S at p. If we compute the exterior product of AA8 with X(p)
and YIp), we find
(43)
and
(44)
=T 2 (-1/11/1- 1 + 0 (1) ),
follows
from (37).
together with fact that tJ> and 1/1 have opposite signs.
We
can then conclude from (43) and (44) that the bases (X (p) ,A, 8), (Y (p) ,A, 8) of
R3 have opposite orientations.
342
H. J. SUSSMANN
for p e [0,
EJ.
E'
< T 3'
Let
r #p
By making
r#p
Fix a number
s(p),o(p)
l'
r *S (P),0(P)
E'
with
are well defined trajectories. Moreover, it follows from the construction that
rp ,
rp
*r E
r0
. Since
r0
*r E
takes
is a part of
r,
is optimal as well.
To complete the argument, all that is needed is to exclude the p'ossibility that
6
*r E
be optimal.
Clearly, 6
*r E
arc and a bang-bang arc with two switchings. We will show that no such
concatenation can be optimal. However, in order to do this, we have to distinguish
two cases, according to whether tP is positive or negative on U. (Recall that we are
considering the case when tP and 1/J have opposite signs on u.) It will turn out that
in both cases no concatenation of the above type can be optimal, but the reasons
will be different in each case.
Let T = (x ( ) ,u (
U and is a concatenation of a singular arc, an X-arc and a Y-arc, in that order. Let
A= (X, 1) be a normalized multiplier for T. Let S, Tj, X be the functions <X.f>, <1..,9>,
<X, [f, 9J >, evaluated along (T,A). (In particular, Tj is the switching function.) Then it
is easy to see that the derivative of Tj along (T,A) is equal to X. Also, the derivative
of X along the X -portion of T is equal to <X, [X, [f, 9 J J >. Since [X, [f, 9 J J is a
linear combination of f, 9 and [f ,gJ, and the coefficient of f is tP, it follows that
343
= --tP (x (t))
T/ (t')
2
If tP is positive on U,
then the minimization
contradicting the fact
possibility that tP>O on
(t' - t) 2 + 0 ( (t' - t) 2)
(45)
then (45) implies that T/ (t') <0 for t' >t but close to t. But
of the Hamiltonian requires that u (t') = 1 for such t',
that, for t'>t but close to t, T is an X-trajectory. So the
U is excluded.
= /J. 1 S +
and
x'
V3
are
defined by
(iii) ~l ,~
(so that v
/J. T/
2
1
,~
/J.
X,
the
where
identity
(i) /J.
= - (1 - u) v. + - (1 + u)
2
[X, [f , g]]
= v,t
+ v2 9
~.,
J
(i i) v
v2 ,
+ v 3 [f ,gl and
are defined similarly, using [Y, [f, g]] instead of [X, [f, g]]
== tP and
== '" ).
are bounded by constants that do not depend on the trajectory T, but only on the
neighborhood U, as long as the duration T of T remains bounded. It then follows
that these functions are Lipschitz with constants that depend only on U and T, but
not on T, If we now restrict ourselves to trajectories T of the particular type
considered before, then the second derivative of T/ along the X -portion of T is
equal to tPS + V 2 T/ + v3x. Since, at the switching time t, T/ and X vanish and S=-1,
we conclude that the second derivative of T/ on the X-piece is positive for all
possible T's that are contained in U and have a duration not greater than T,
provided that U and T are small enough Since T/ and its derivative vanish at the
switching time t, it follows that T/ is strictly increasing along the X-piece.
Therefore T/ can never return to the value zero, and so the second switching, from
X to Y, cannot occur. Since it is always possible to choose U small enough so
that all the trajectories in U will have a duration not greater than any preassignep T,
we conclude that. if U is small enough, then no extremal T of the above type can
exist in U. This completes our analysis. We have proved:
Theorem 1: Let f, 9 be smooth vector fields on an open subset
3
H. J. SUSSMANN
344
r .,. =
(..
123
(46)
where
1
J .,.
=Ja.,.
< )L(x .,. (s),u .,. (sds,
(47)
/3<.,.)
=JB L(x .,. (s),u .,. (sds,
(48)
J .,.
J .,.
(49)
and
(50)
Then
1
--
D J (1/) = - D a (1/) L (x (T ) , u (T
.,. .,.
.,.
1/
1/
(51)
and
2
+
+
D J = D /3 (1/) L (x (T ) , u (T
.,..,..,.
1/
1/
As
for
the
derivative
of
J3.,.
with
.
respect
(52)
to
.,.,
it
can
be
computed
by
We get
=JBA.,.
D (L (x .,. (s), u .,. (s) ds .
(53)
aL aL ax aL au
a.,. ax a.,. au a.,.
aL
ar
VO-= -A' - A
ax
ax
-=---+-.- ,
O -
and
(54)
(55)
345
aL aH
au au
af
au
v-=-- A-.
(56)
vD J (71)
1'1'
=J
Z (71, x
71
(57)
where
aH au
au aT
ax
aT
af au
au aT
afax
(58)
Z=A'-'--A"--A'-'--A'-'- .
ax aT
afax af au
(59)
(DTx)' = - . - + - ' - ,
ax aT au aT
aH au
au aT
ax
aT
ax'
Z=A---A'--A(-)'.
aT
aH
au
-l [A
AT
(60)
r ,
71
l'
1'1'1'=71
d5,
(61)
i.e.
vD J
l'
3
l'
(71)
<A (A), [D x ]
(A) > - <A (8), [D x ]
(8) > .
71
1'1'1'=71
71
1'1'1'=71
If we combine (62) with (5 1) and (52), we get, for l' = 71
(62)
vD J =
l'
l'
<A , [D x ]
(A) > - <A , [D x ]
(8) > +
1'1'1'1'=71
1'1'1'1'=71
Now
+
+
v [D 13(71)L (x (T ) , u (T
71
l'
71
71
-D
--
l'
a (71)L (x (T ) , u (T
71
71
(63)
(64)
and
H.J. SUSSMANN
346
d
aX1"
d(3
=-[x (3(1"]=-(3(1" +x '(3(1".-(,-)
(65)
T
d1" 1"
a1"
1"
d1"
so that. for 1" = 77, the right-hand side of (63) is equal to that of (6), and so (6)
follows.
+
I.A.K.
KUPKA
CHAPTER 1
1.1. Introduction
The purpose of this paper is to give a short and somewhat incomplete survey of the optimal control problems particularly of its geometric aspects.
The basic problem of optimal control is the following: let
S be
n{ . On M , the dynamics of S
= COO
CW
or
= real
F: M xU --
nuous,
1)
is measurable,
t E [a, i3 1 ,
md
being a
analytic).
md ,
(x, u) :
2)
la., Sl --
MxU,
is absolutely conti-
= 0,
i3 = Tu
Tr(M, M)
by
and
Tr(A, B)
x(O) E A ,
Tr.
(x; u) E Tr
r, a
we define its
347
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 347-369.
1986 by D. Reidel Publishing Company.
348
I. A. K. KUPKA
cost
C(X, u}
as follows :
C(x,u)
= r(x(s
1i3 c(x(t),u(tdt
Cl
J c+(x(t), u(tdt
c _ (x(t) , u (t dt
[0, +a>]
and
SSc(x(t),u(tdt
Cl
taking values in
Cl
tC(X(t),U(tdt = +co.
Cl
let A, B
such that
C(x, ii) = inf ~ C(x, u) \(x, u) E Tr(A, B)~. An important variation of this problem,
the fixed-time problem is the following : besides the data
Tr(A, B, ,.)
the subset of
of the
u)
= inf~C(X, u)l(x, u)
E Tr(A, B,
,.)~
= IRd
and
F(x, u)
=u
z: [0, T(z) -- IR
a, b
t E [0, T(z)] , -a s;
d 2z
(t) s; b
0, find one
z : [0, T) -- IR
conditions above] .
349
= IR2,
= [(O,O)},
= [-a, b) c 1R ,
= (L, O)}
F(x u)
,
and z
= xl
Il ddt2(t)\\
a)
= x2 l"'x + u ox'
l
c =I
,
u I
2
=0
l'
z:
t E [0, T(z)l .
0 ~ z2(t)
b)
for all
t E [0, T(z)l , z(t) = (zl (t), z2(t , L given constant. c) z(O) = (0,0) ,
dz
dz
ill (0) = (-v, 0) , z(T(z = (0, L) , dt
(T(z = (0,0), v given constant;
find
or
Z : [0, Tl - IR2 such that T = inf [T(Z) I z satisfies a), b), C)] ,
STill~f(t)11
o
dt
= inf[ST(Z)illi(t)11 2 dtjz
o ~ x2 ~ L
t,
satisfies a),b),C)]
M c 1R4 ,
0 in
IR2
A
and
c = I
3 oXI
4 oX2
(0,0, -v, 0)
,B
c
2 o~
I oX3
(0, L, 0, 0)
=i
(x 3
+~)
c =I
,1'
=0
in the second.
and
=0
In this section
0,
A and B
are two
1. A. K. KUPKA
350
pOints in m d .
To make sure that
Tr(A, B)
T and any
y Em, let
ip(T, y)
denote the
(x,u): (0,'1"1 -
m xU
of the system.
Notation. If K, L
distance
h(K, L)
m n , their Hausdorff
is defined as follows :
Ji: (mn ) .
Proposition O.
For all
body and
d _
-
1)
t>0
y Em
;p(t, y)
is a convex
U ip(t, y) .
t>-O
y E m d , the mapping
2) For any
is lipschitz , i. e. for any
that for all
and all
m + 3 t - ip(t, y) E Ji:(illd)
C (T) > 0
such
t 1 , t2 E (0, T) ,
t - ip(t, y)
jectory starting at y .
We shall not prove proposition 0 here (it is fairly easy,. We
shall use it to solve the minimal time problem.
Let us denote by
from state
to state
T> 0 .
T=
inf
351
[q B E q,(t, A)]
(p(t), B)
i.
such that
(tn )
such that
and
A) ,
B E ~(f, A)
~(t,
p(t)
n
for all
n:?; 1
where
(Xl'
If
(p,
ex> z') ) (p"" B) , a contradiction to <!B> .
Let us draw some implications from <!B>
that
z = e fX [ A+
S:
z, E if>(T, A) such
e-tXrV+Y(G.(tl
d~
for some
(PIN)
u. Hence
is equivalent to :
@)
u: [0, T] -
X.
If we introduce the function
H : lRdxlRdxU -
lR ,
352
1. A. K. KUPKA
A to
B.
H(x(t), p(t), u)
t E f 0, f
H(x(t), p(t), u)
:S:
and all u E U
If
rable subset
f 0, fl
in
> 0 , a measu-
such that
Define au: f 0,
w
:S:
TJ -
tEE. Then :
if
fro H(x(t),p(t),u(t
dt
A contradiction.
as follows:
tEE.
u(t) = u(t)
f(t)
continuous. In fact
lim t
is a
is continuous : let
n
t '- E
and
mes(E) .
f: f 0,
and
un E U
if
TJ -
1R
is lower
there
such that
Uo E
{j
lim inf
n-ex>
Hence for any
1
f(t )-f(t)
t n_ t
n
t E [0,
T[
nf f(t') - f(t) - 0
t' t
""
1m 1
t'-t
t'>t
t -t
n
0 .
353
-a.
f (13) > f (a.) .
E:
Let
f (t) - f (T)
lim inf t
t-T
-T
t- T
-T
t> T
A similar reasonning, in the case that
by
such that
replacing
is constant.
(x, p, u) : [0, T) -- IR x m. x U
dPk(t)
:>:
of a curve
dxk(t)
oH _
_
_
~= OPk(X(t),P(t),U(t
- """dt
Tl -- md xU
oH _
_
_
oX (x(t), p(t), u(t
k
foraH
k=1,2,._,d
H(x(t),p(t),u(t = SUP[H(X(t),P(t),V)\vEU]
tE[O,Tl.
, is constant.
CHAPTER 2
THE MAXIMUM PRINCIPLE "IDEENKREIS"
2.1. Introduction
The results stated at the end of chapter 1 are valid in general. This is essentially the content of the maximum principle. For simplicity let us assume that
A and
I. A. K. KUPKA
354
cost r
is zero.
If a trajectory
projection on
M xU
(i, ii)
[0, of] -
of a trajectory
E~,
M xU
is optimal, it is the
A=
or
1 : let
M x IRd X U of one
HA: M X rn.d X U - rn.
- AC (x, u) .
OHA _
_
= -;;- (x(t),p(t), u(t
dPk(t)
-~
oPk
aHA _
_
= oXk (x(t),p(t),u(t
all
for almost
of}
t E [0,
p(t)
Moreover
i(O) E A,
p(O)
at
normal to
at
x(O) ,
x(of) E B,
pef) normal to
x(T)
E~
that
and
such
'T
be the fixed-time.
Theorem 2.
An optimal trajectory
(i, p, ii)
of the fixed-time
E A , A=
or
1 :
355
dxk(t)
-dt
E
oHA _
_
-(x(t), p(t), u(t
oPk
all k = 1, 2, ... ,d
almostall tE(O,T)
dPk(t)
oHA _
_
_
dt
= OXk (x(t) , p(t), u(t
t E (O,T)
Notations.
Mx IRd
of a trajectory of
E~
or
EA
E~
or
E I , exceptional if it satisfies
E~
or Eo.
T
,(t)
if
0 s; t s;
fiii::.
ab+a2
~:
=
[0, T) -
IR ;
T =j2L(~t)
tl
btl
a
2
- T+btl .... 2(t-tl ) if
tl
S;
t s
the function
missible control
e: ,
O
u (t) = ).
e:
-e:
Ostse; 2
e: s; t S; L/ e:
2
L/e: 2 S; t s;(L/e; )+e:
= Te;
I. A. K. KUPKA
356
z(O)
dz
dz
ill (0) = 0 , to z(Te;) = L , dt
(Te;) = 0 ,
= 0,
is zero. But it is easy to see that there is no trajectory having this cost.
y)
~~(O)
= v,
v> 0
z(T) = L ,
~f(T)=
where
z(O) = 0 ,
where
-1 , T- p s;t s; T
X2 + X(T-v) - L2_ tv 2 = 0 ; if
as follows:
0, the time
+1 ,Os; t s; v- p
0, v- pst s; T- p
ii (t) =
~)
vT s; L +~ v
vT > L2 + tv 2, then
, then u(t) =
X2 - (T+V)X + L2 + tv2
=0
rn d xU
1)
x EW -- (*(x), cp(x) E
such that :
SUP = ~ (x, p, u) I
= sup[H\x, p,v) Iv E u1 = O~
The graph of
3)
t E 1a.,
t,
means that
1
s.
s,.
2)
d1jl~X(t
= - ~ (x(t), 1jI(x(t, cp(x(t)
t
oX
. Geometrically this
evaluated at
= (p. F)
-c
HI
of
4)
357
::p (x(t
Wand
(y, v):
y(O)
[0, T J --- Mx U of
v
x(O) , y(T) = x(T) .
v
F, such that
L Pkdxk ,to '. Since this last space is lagrangian and simply conk=1
nected (it is homeomorphic to W), UJ, has a potential V. V is a
UJ =
lipschitz function
W - IR
such that
'.
t E CO, T J
v
respectively
CO, TJ
J'Iy H1 (y(t),w(y(t),v(t)dt
V(x(T -V(x(O
+ C(Y,v)
+ C (x, ii)
_ cp(x(t
_) = 0 and H1( y(t), W(y(t, v(t) ) s;
But: H1(_x(t), W(x(t,
s; H1 (y(t), w(y(t , ::p(y(t) = 0 . Hence: C(x,u) s; C(y, v) .
Remark.
If,in theorem 3,
of x (C 0, i'J) contained in
W will be
that way.
2.5. Optimal control treatment of the classical case
We shall show, briefly, how to use the results stated above, in the classical
case. Let us state the problem: given a open subset
pOints A and B
in
in IRd , two
c: MxIRd --- IR
358
I. A. K. KUPKA
x:I 0, T) -
x(T)
x(O) = A.,
= B)
M joining A.
to
B (i. e.
= A.
= B.
x(T)
(x, p)
= u(t)
dXd-t(t)
f 0, i)
in
, - dp(t)
dt
is the projection
: f 0, T) = - A OC (x(t)
ax'
HA(X(t),p(t),ii(t) = SUP[HA(X(t),P(t),v)lvEmd]
A = 0 or 1, H A(x,p,u) = (p,u) -AC(X,U) .
2)
x rn d satis-
u(t
2)
For
conditions:
Since
i) ,
t E f 0,
p(t)
A(
ac (- (t) dx(t"
dt au x , dt ~
t E f 0,
T)
= Dc (- (t) dX(t
ox x
'dt
x will be optimal
in the
satisfied:
a)
(x, u) E W x1R
for all
d
.~
1,
b)
J=1
and all
of
x(f 0, i)
such that
S Ern, S f 0
a2c
t E ) 0, T)
(1.
e.
is injec-
tive). Then
b)
359
(x, p) can
satisfied:
1)
2)
3)
0)
~~ (x(t), d:?
p(t) =
Wand
at
is injective on
I-a., T+a.) ;
."
"
a)
x([-a., f+a.)
a) is satisfied,
x is d and
hence 1)
of
is true everywhere.
dX
dt (-a.) in the
hypersurface L =
_
y =
~x (-a.)
t
for
satisfie the
b)
n: S- -- W of the projection
WX ffid
z(t)= (x(t),p(t ,
of
n: n-\W) -- W
x(l 0, f])
contained in
~:
W,
S-
S-
is
Lagrangian and
S : W -- ffi
~~ (t, y),
we can
apply theorem 3 . To be able to extend the preceeding considerations to optimal control theory, we need to extend the notion of Mayer fields (seelMJ).
This has not been done up to now and it is called the problem of synthesis
of optimal control. A first step to do this appears to be the study of the
extremals. We take this up, in the next chapter.
360
I. A. K. KUPKA
CHAPTER 3
GEOMETRIC STUDY OF THE EXTREMALS
3. O.
Genericity
Given a topological
Y is gene-
(F,c)
c: M XU -
is
will be the
JRd
and a
MxU
mC
JR
361
Notations.
a)
Let denote by
H(x,p,u) = SUP[H(X,P,V)\VE U]
u
Finally
Sin
U:;'V -- H(x, p, v)
such
(resp.
au:;, v -- H (x, p, v) , au
c)
such that
d
b) Let Sini
that
(x, p, u) E M x IR x U
is contained in
Sin.
Elementary considerations
There exis ts a generic set !:to such that for any pair (F ,c)
in that set:
1)
Sinb
a submanifold of codimension
in
c in
in
Sinb
Let TT
d
Mx IR xU
Remark. Since
in
!:t1
!:t1 contained in
!:to
G of
such that
Sin
and CB
G (resp. <B)
362
I. A. K. KUPKA
CB
CB is an
by
u j : B - U, j E J , such that
For each
such that
The function
a nbd
Aj is the graph of
z 'E;: B , denote by
J(z)
such that
J(y) c J(z)
H.
of all j E J
for all
z. For each
:f
such that
= sup [HWt) , v)
Hj(C (t
Iv E U]
Simple points
Definition 1.
(F, c)
. This
of Hk .
(t)
Hk
A point
zoE M x JR
in
the subset of
uj
of z in B
V - ill
is
nal of J(z)
J
B E CB
CB. If
and is
of Zo and
smooth mappIngs
such that
363
H(zO'u) =sup[H(zO,v)\vEU}
b)
u EU
such that
H(z,u) == supfH(z,v)\VEU}
is
zo.
plicity is
card J(zO)
If m == 1 , the extremals
in the nbd Vo
_
UJ
Zo
two hamiltonians
-+-
H(z, u 1 (z
and
H+
and
. For simplici-
H_
respectively.
(resp. HJ
of
the extremals
H+
(resp. HJ.
zo' It is crucial
+
l:: == Vo n {H + = H_} of V 0
V~ in Vo .
If
Zo
is a point of multiplicity
2 such that
H+(ZO)
ar~
H_
at
Zo
Vo c Vo of
zo'
and HJZo)
of
H+
and
l:: is a regular
hypersurface, H+ and H
are transversal to l:: and point both toward
+
Vo (resp. yO) along l:: if {H+,H_}(zO) > 0 (resp. {H+,H_}(zO) < 0). It is
easy to see that the extremals in
V~
Zo
364
I. A. K. KUPKA
3.3.
H+
and
~.
i(
H+ (z)
0
Zo
of multi-
H- (z)
0
and
are tan-
H+
and
H_
with
the
will de-
E at
zO. This
H+
and
A point
Zo E El
O. Zo is hyperbolic if
~H~(H_ ,H+}!(ZO)'
co
L;l, its
L;1
called
the residual field. In the hyperbolic and parabolic cases nothing very exciting
happens.
Notation.
in
T*M, a subset
N c::; W is invariant
365
Theorem 1.
1
Zo E L:
a)
Let
of
meeting
L:
along
both
+
and having a contact of first order with L: there.
1
L:
~(H+,H_} >
(resp. ~(H+,H_}<O~). The five sets Sep+USep_, W+, W_ '
ot
extremals switch exactly once on . L: and are the trajectories of a piecewise smooth flow. In Sep+USep_ ' the extremals switch at most once.
b)
If
Zo
{H + s: H_). W - Sep
has two c onnec ted components W+, W_ with W c {H+ < H_} . The three
sets
i(.
Sepu W_
do not
If
i)
if
Z0
z: fa., 131 -
nbd W
of
Z0
such that :
where
Ii) if
z(o.)
tends to a point
Zoo
z : fa., 131 -
on
L:1
~H_{H+,H_}~H+ + ~H+{H_,H+HH_
~HJH+,H_l(+ JH+{H_,H+H
passing through
is
C 113-0.1 1
d(z (a.) , :E )
is the distance of the initial point z(o.) of z to :E 1 .
Zoo
at time a..
J!.
Z <!:
z tends, in the CO
of the field
topology,
366
I. A. K. KUPKA
H >H
+ -
H <H
Hyperbolic
Parabolic
Elliptic
3.4.
and
where
space we shall not do it here. In fact, noting essentially new shows up, at
least in the generic case.
Now, if we assume that at a simple point
all brackets of length 2, 3 and 4, between:
7:0 of multiplicity 2 ,
zO'
An extremal
[0..~):3 t -
z(t) E T*M
is called a Fuller
a.
nb
~ t
.~ .. ~ ~
property
I-t
n+ n
kn
as
n tends to
tl :s: t2
k> 1
...
with the
co.
H+(z 0)
Let
and
Zo
E T*M
if_(z 0)
367
b)
and H
are zero.
nal projective space P(5) , with a non empty interior such that if
4
3
2
cp(H+,HJ = Lad H+(HJ(O),adH_ad H+(HJ(zO)' {ad H+(HJ,adH+(HJ}(zo)'
ad 2H _ad 2H+(HJ(Zo)' {adH_ ad H+(H J, ad HJHJ }(zo) , ad4 HjH) (Zo)]
r.
belongs to
Corollary. If the dimension of the state space is at least 12, the "Fuller
phenomenon" is generic.
We cannot give a complete proof of theorem 2 here but we shall
sketch it, giving the main ideas. Let F
=i
(H++ HJ
' G
i(H+- H J
Xo
zo),
xl" .. , x d _ l '
w(Y) = w(x1 ) = 1 ,
w(X 2 ) = 2 ,
if j ~ 2
J
degree at least 5 and FO
= 4
w(p.) = 3
2)
1)
w(X j ) = 3 if
3)
G = q
. We
F = F +F'
3 , w(q)
where
=
F'
w(PI)
is of
is of degree four :
2
2
3
k 3-k
FO = Pl- i aX 2 +P2Y +bx~ + y L: cll Xl
k=0
The constants
brackets of lenght 5 of H+
a
= ~ ad F
and
(or
Zo
of the Poisson
F, G) . For example:
(H +' HJ
a discrete dynamical
[HPS})
telle us that if So
1. A. K. KUPKA
368
Finally, using the blow-up again and techniques from real algebraic geometry,we can show that if the point
A
curve.
Final remarks. In this paper we have barely skimmed some of the main
topics of a vast subject: optimal control theory. In his survey, in this volume, H. Sussmann discusses another very interesting and important theory,
the extension of the classical concept of conjugate point to the optimal con,..
trol case. In a contribution to this volume, B. Bonnard discusses the role
of singular extremals (introduced here) in the optimal time problem. Also,
Fliess and Lamnabhi-Lagarrigue, in their paper, study
extensions of the
[M}
(e)
. For the
last chapter which states some of my own work see [EJ, [KlJ, {Kul},
[Ku2J .
369
I.A.K. KUPKA
INSTITUT FOURIER
Laboratoire de Mathematiques
BP 74
38402 ST MARTIN D'HERES (France)
REFERENCES
[B}
B. BONNARD, 'On singular extemals in the time minimal problem'. Proceedings MNTS 1985.
[C}
C. CARATHEODORY, 'Calculus of variations and partial differential equations of the 1st order'. Holden Day 1965, vol. I and II.
[E}
[G, W, P, L} C. G. GIBSON - K. WIRTHMULLER - A. A. du PLESSIS E.J. N. LOOIJENGA, 'Topological stability of smooth mappings' .
Springer LNN 552 (1976).
[H,P,S}
[Kl}
[Kul}
I. A. K. KUPKA, 'Geometric theory of extremals In optimal control', to appear in Trans. Amer. Math. Soc.
[Ku2}
I. A. K. KUPKA, 'Generic properties of extremals in optimal control problems' in 'Differential geometric control theory'.
Birkha:user PM 27 (1983), pp. 310-315.
[M}
[S}
Fran~oise
LAMNABHI-LAGARRIGUE
371
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory,
1986 by D. Reidel Publishing Company.
371~387.
372
~ q(t)
~ y(t)
(q) +
u. (t) A. (q)
1.
i=1
1.
h(q)
(ul, ,um)
~I,
i=1
ki
N
u.(t) O.(q
, ,q ), k= l, ,N.
1.
1.
Recall that to each vector field there corresponds a derivation, called Lie derivative or directional derivative. It is given by the firstorder differential operator
A.(q) =
J
kiN
Cl
,q ) k
L O.(q ,
k=1
J
Clq
' j
= O,l, ,m
Expansion in noncommutative generating power series and its relationship with Volterra series, as discussed in [8, 10], is treated in a
chapter of a new book [14] by Isidori which is excellent. Therefore we
will be quite concise on this subject.
Given the initial state q(o) = qo' the input/output behaviour is characterized by the generating power series
g
= h(q o )
, .. ,j
A A. h(q ) x . x.
=0
JO
Jv
Jv
Jo
(1)
v
This is the (first) fundamental formula. It can be interpreted (numerically) by replacing the noncommutative words in x ,x 1 ,x by the
t
0
m
d s .... d s .. These are defined by induction
interated integrals
o
Jv
J0
on their length :
o
373
I; (T) = T
fo
dl;.
I;.(t)
1, . ,m
,I;.(T)
a,I, .. ,m
( JT dl;.
dl;. dl;.
Jv
Jo
J v- I
= h(q
)+
0
1.
j , ... j
\I~O
=0
+.
1.. _
~l'.'.'~k-I
It
0
f Tk- 1
I ft
i=!
fT I
0
Wk
WI
(i)
(t,ol;qo) ui(ol)do l +
(i1,,i k )
(t,ok,,ol;qo)u i (Ok)
k
. u i1 (1)
dokdo l
374
wo (t;q 0 )
tV
tA
= L A0 h(q 0 ) -= e h(qo)
v!
v~o
(4)
v
v
v
A A. A } A. A k
o 1} 0
1k 0
For the convergence of the Volterra series see Lesiak and Krener [21].
Remarks : (i) It is well known that. for t fixed. (3) can be considered
as a functional Taylor expansion (cf. Volterra and Peres [29]. Berger
[3]). This fact which is at the origin of Volterra's works [28] is not
true anymore when t is considered as variable. although this has sometimes been asserted in the literature. It is worth mentionning that the
generating power series (1) is a Taylor expansion with respect to a new
derivation called causal derivation [9].
(ii) Non-commutating indeterminates permit us to generalize to the
nonlinear domain numerous facts concerning the Heaviside symbolic calculus. i.e. the Fourier and Laplace transforms (cf. [10]).
II - Expression of Volterra kernels in terms of Lie-brackets
Let a and S be two noncommutative indeterminates. A by-product of the
Baker - Campbell - Hausdorff formula gives (cf. Bourbaki [4]. p. 59)
e
-a
375
where
s , ad
a.
S = ad I S = [a., S]
a.
[a., adov S]
a.
and
a.s - Sa.
(Lie bracket)
, v ~ I.
Applying this
wI
(i)
(t,uI;qo)
:Ie.
J
, j
= O,I, ,m.
376
aw
~, k
N aq
we obtain Je. = L o~ Pk
J
k=1
J
Recall that if Pk =
I, ,N and A.
L O.
k=1 J
k'
aq
(llll_ll~)
k~1 aPk aqk
aqk apk
For the first-order Volterra kernel, formula (5) and the Hamilton
equations lead to
Proposition BI : A first-order Volterra kernel of the system L can be
written in the form
Jej
I u=o
q=qo
One notes that the Taylor expansion of the first-order Volterra kernels
reproduces some of the usual calculations in optimal control (Gabasov
and Kirillova [12], Krener [17]).
For higher order Volterra kernels one needs to introduce the notion
of functional derivative due to Volterra (cf. [28, 29]) which is well
known from physics. There exists a precise mathematical definition of
this concept; see e.g. [6], where this derivative is called the
Frechet-Volterra derivative.
I
CO ([0,1], m) denotes, as usual, the set of continuous functions
[0,1] ~:R. Consider a functional U : CO ([0,1]. m) ~m which is-differential in the Frechet sence (cf. Berger [3]). Moreover assume that
this derivative can be expressed as an integral
fo!
<:r>(a)
where of is a variation. The functional derivative is then. by definition, the function ~ which in general will depend on f. Let us denote
this, as is usual in physics, by
377
.!Q.. =
Of a.
4>( a.)
This leads to :
Proposition B2 : A Volterra kernel of order k can be written in the
form :
vI
V2
dod
-------v2
92
vI
s:
dT
d9 2 uU.
~2
u=o
9 k = 9 2= T
q=qo
By analogy with (1.2) we will call formulas of Proposition Bl and B2,
second fundamental formulae.
378
I - Calculus of variations
Consider the system
~ q(t)
F (q(t), U(t))
? y(t)
h (q(t))
(6)
eu i
F(q,u) + F (q,u) eu + + F i (q,u) (~)
+
u
u
L
.
.
e~F
.
.
where F i denotes the partial der~vat~ve ---. wh~ch ~s also a vector
F(q,u+eu)
u
eu~
field. Putting A =
o
T,
T,
T,
u, q)/q=a
V(e;) is given by
379
with
fT+W(E)
3 V
T
+
r+W(E)
T
3 (0"1) (ou(cr l
f'2
T
r+W(E) f2
T
dcr l
dcr l dcr 2
etc
From order two on the previous expressions "ontain products of first
orfer differential operators. By means of integrations by parts one
can extract a particular terms which only involve first-order differential operators ; for instance,
and
380
We will make use of this technical Lemma by employing variations chosen such that the dominant term of the expansion in will be
rr
f+
T
()
for the third order. We will see for instance that the classical generalized Legendre Clebsch condition comes from the vector field "associated" with the second variation O2V.
381
hence p(t) is in fact a covector), which, written as a row vector, satisfies the equation :
p(t)
with
= - p(t) (IF
aq (q(t), u(t
, p(T)
ah
= -aq
(q(T
for all u c n.
- - (The barred variables u, q, p denote a solution of the control problem).
With the usual Hamiltonian notations
H
= < p,
.F(q,u
= ;:
p(t)
,q(o)
= qo
,p(T) = ;: (q(T
= min
aH -
dV ~
aH
and consequently, --dtV
aU
H(P(t), q(t), u)
uEn
n, then,
, t E [O,T]
= 0
0, for all v
>,
aH
ad
\)
~
AI>
382
will show that higher order necessary conditions for optimality which
are essential for singular optimal control problems (cf. Bell and
Jacobson [2]), can be interpreted via the higher order variations of
oy expressed in terms of Volterra expansions. More precisely this involves Lemma CI.
First of all we need a fundamental Lemma on the positivity of the
leading term of oy = Vee) wo (T",q) Iq=q ()
, developped in powers of e.
Let us denote by
that
~ ~
0 such
E.
<
pc,), D >
(8)
is verified along the trajectory. As before, every expression evaluated along this trajectory will have a bar on top. Thus, for instance,
~~
will be noted
~~
AO
AI' v ~ 0.
383
a) Jacobson-Gabasov test
Consider again the variation (8). There follows
02V = a 2 (A 2 E + adAo A2E
If A2 belongs to
X then
Proposition C3 : If
_0_ H
UT
Iq=q(T) >, .
This is the socalled Jacobson-Gabasov condition, [15]. An interpretation in Hamiltonian terms, apparently new, results from chapter B above.
b) Generalized Legendre-Clebsch condition
With the variation
E [T,T+E 2 [
E[T,2-E 2 , T+2E[
else where
if
C1
if
C1
we find
or
AO
384
~a
Qu(er)
(I +
(er-T
~O elsewhere
Thus,
Q2 V
Q3V = a 3I}
if
EX
then, as a is arbitrary,
Therefore
If
Proposition C5 :
[AI' [Ao' AI))
U=
or
<If(T),
or
Q
[-a 2
QU
(T'T,q)/
q=q(T)
HullIU.O
a2 = a! = T
q = -q T)
385
IV - An application
The motion of the center of mass of a moving object is described by the
equations
qO(t)
(ql(t2
ql(t)
2
q (t)
q2 (t)
3
3 + fl
I 2
(q ,q ,t)
a (q (t
q3(t)
We examine the problem of finding a control u(t), lu(t)1 ~ I, minimizing the functional
h(q(t
We assume that (ql (t2 + (q2(t2 f. 0 , for all t E [O,T].
Proposition C6 : The optimal control for this problem is necesseraly
bang-bang, i.e. u(t) = I
Proof: Let us assume the existence of an extremal control u(t), such
that lu(t)
< I.
However the expression <p(,) , [A I ,[A I ,[Ao ,A 1 ]]] > cannot vanish on a
subinterval. This contradicts Proposition C6.
Acknowledgement :
The authors wish to thank M. Hazewinkel for revising the English version
of this paper.
REFERENCES .
[1]
[2]
[3]
386
[4]
[5]
[6]
M.D. DONSKER and S.L. LIONS, Frechet-Volterra variational equations, boundary value problems, and function space integrals,
Acta Math., 108, 1962, 147-228.
[7]
[8]
[9]
387
[19] F. LAMNABHI-LAGARRIGUE, Sur les conditions n~cessaires d'optimalit~ du deuxieme et troisieme order dans les problemes de commande
optimale singuliere, in "Analysis and Optimization of Systems"
(A. Bensoussan and J.L.Lions eds), Lect. Notes Control. Informat.
Sci 63, 525-541, Springer, Berlin, 1984.
[20] F. LAMNABHI-LAGARRIGUE, S~rie de Volterra et commande optimale
singuliere, These d'Etat, Universit~ Paris XI, mars 1985.
[21] C. LESIAK and A.J. KRENER, The existence and uniqueness of Volterra series for nonlinear systems, IEEE Trans. Automat. Control, 23,
1978, 1090-1095.
[22] L. PONTRYAGIN, V. BOLTYANSKI, R. GAMKRELIDZE and E. MISCHTCHENKO.
The Mathematical Theory of Optimal Processes, John Wiley, New-York,
1962, French version: Mir, Moscou, 197
[23] W.J. RUG~, Nonlinear System Theory, The Johns Hopkins University,
Press, Baltimore, 1981.
[24] M. SCHETZEN. The Volterra and Wiener Theories of Nonlinear systems,
Wiley, New York, 1980.
[25] R. SIEGMUND-SCHULTZE, Die Anfinge derFuntionalanalysis und ihr
Platz in Umwilzungsprozessen der Mathematik um 1900, Arch. History
Exact Sci. 26. 1982. 13-71.
[26] H.J. SUSSMANN, A Lie-Volterra.expansion for nonlinear systems, in
"Mathematical Theory of Netwoeks and Systems" (P. Fuhrmann ed.),
Lect. Notes Control informat. Sci 58. 822-828, Springer. Berlin,
1984.
[27] C.A. UZES. Mechanical response and the initial value problem, J.
Math. Physics. 19.. 1978. 2232-2238.
[28] V. VOLTERRA, Le/ions Sur Les Fonctions de Lignes, Gauthier-Villars,
Paris, 1913.
[29]V. VOLTERRA and J. PERES, Th~orie Gen~rale Des Fonctionnelles,
1. Gauthier-Villars, Paris, 1936.
1. Introduction
Let us consider a smooth (Le. COO or Ck ) nonlinear control
system
(1)
f(x,u)
X ,
+ ..
and K : X
+ I.
( 2)
= K(x(T + f
o
J(xo'u(o
L(x(t),u(tdt
~(t)
= f(x(t),u(t
, x(O)
tells
H : X x ..n x U
us
to
introduce
the
pseudo-Hamiltonian
I defined as
389
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Meth.ods in Nonlinear Control Theory, 389-4fJ7.
1986 by D. Reidel Publishing Company:
390
(4)
oH
xi = F-<x,p,u) (=fi(x,u
Pi
(5a)
(5b)
1, , n
oH
- oX i (x,p,u)
Pi
x(O)
Xo
(6b)
p(T)
- - (x(T
ox
oK
be
E
(7)
optimal
(in
the
sense of (2
is
that
for every
[O,T]
H( x * (t), p* (t), u* (t
(.
oH (x,p,u* )
-0uj
1, ,m
391
Pi
oH
(x,p,u)
OPi
oH
- ox. (x,p,u)
Yj
oH
(x,p,u)
oU j
x.
1
(10a)
(lOb)
1, ,m
and a
the
l, ... ,n
outputs
Yj
of
an
system,
resulting
boundary conditions
Now for
this
arbitrary smooth
function H(x,p,u),
equations
(external forces,
torques etc.)
the
displacements
and
the
outputs
the
corresponding
notice
that
to
every
Hamiltonian
system
(10)
there
is also
oH
uU j
-~--
reasoning
in
(this
Apply a feedback u =
such that
i.e.
when does
there exist a
392
em
'
-~-(X,p,U)1
Pi
(11)
~H
~x
(X,p,U)1
oR
a(x,p,U)
= ~p.
~Ii
= a(x,p,U)
(x,p,U)
1.
= ~x
Ii
1, , n
(x,p,U)
1, ,m
where S : U
x Im + I
is
smooth
function.
= ~S
~y
Moreover
R is
(U , y) together
with
~R
OS
~H
~H
(13) ~Uj (x,p,U) = ~Uj (U 1 ,,Um' -~-,
u 1 ,-~-)
um
(In fact u =
ay
~S
formation on U
(U,y) , Y -
x I
~S
~U
l, .. ,m
).
The purpose of this paper is to show that it may be advantageous for the understanding of the optimal control problem
(2),(3)
properties
of
the
associated
Hamiltonian
system.
~ui
of
393
2. Symmetries
In this section we will mainly follow [13]. Let us first
introduce
function
some
then
vectorfield
Pi
we
aF
ap-
xi
(14)
notation.
denote
If F : X
the
In + It
corresponding
is
smooth
Hamiltonian
(x,p)
i
aF
- ax. (x,p)
1, ,n
by
XF
(15)
As
{H(x,p,u), F(x,p) }
= Fe (u,
aH
au (x,p,u
aH = (-",-,
aH ,-",-).
aH
with au
(For computing the Poisson bracket
u
vU 1
vUm
0, u
394
(17)
{Ho(x,p),F(x,p)} = 0
BH
*
*
~ (x,p,u (x,p{u.(x,p), F(x,p)}
uU j
J
+ m
I
j=1
Fe(u,
~
u.
since
*(
)
u=u x,p
~~
(x,p,ulu=u*(x,p) = Fe(u,O) = 0,
(x,p,u*(x,p
= 0 by (9).
Hence,
with
we
know,
(10)
lating the optimal Hamiltonian HO(x,P), that the optimal Hamiltonian equations
xi
BHo
Bp. (x,p)
Pi
BHo
- (x p)
Bx.
'
(18)
have
1, , n
first
integral.
Then by
standard
techniques
from
the
complexity of
solving (18),
it also
395
1, ,m
Hence
02H
ui
uj
(x,p,u)
*
{F(x,p), u.(x,p)}
( 20)
oF
= --~-UY
Therefore a
(o,u* (x,p
1, ,m
set of
g(x)
an
n-vector.
{pTf(x,U)-L(X,U),pTg(x)}
with [
(21 )
In
this
case
(16)
reduces
to
= pT[f(x,u),g(x)]+g(L(X,u = 0,
[g(x), f(x,u)]
g(L(x,u
The vectorfie1d g is
[5]. If g
{HO,p g(x)}
(18)
and
to
the
factor
{HO ,Pl}
o~
= o~
Then by (17)
= - oX I =
and so we may restrict
(2n-1)-dimensiona1 manifold F = PI = constant
out
dimensional set
by
the
Xl-coordinate
to obtain a
(2n-2)-
396
ou.*
( 22)
~ (x,p)
changing f(x,u)
into f(x,a(x,v))
and
associated
Hamiltonian
system and
its
symmetries
do
~x,p,u)
uj
F(x,p,u)
0,
=0
for
all
1, ,me
written as a function of y.
(x,p,u)
such
that
be
uj
This
to
the
search
for
(23)
o whenever
397
oH
u.
-0- (x,p,u)
o.
it
configuration
coordinates x = (xl"" ,x n ). We
prolong g to a vectorfield g on the space of configuration
and velocity coordinates (x3:) In coordinates
n
n
og.
0
1 (x) x. -0- Then g is
g(x,;')
+ 1:
1: gi(x)
.
-lox.
J oXi
1
i=l
1,JJ
called a symmetry of L i f the derivative of L along g is
ox:-
problem is
Minimize
straints ~i
the associated
Ham~ltonian
~: ). Restricting ourselves,
1:
i,j=1
og.(x)
PJ.
-,,-'0~"-1.-
ui
+ 1:
i=1
oL
-0-
Xi
g. (x)
1
A.J. VANDERSCHAFT
398
oL
with Yi = Pi - -.-' Now suppose g is a symmetry of L(x,x)
oX i
oL
I ox:-
g(L)
i=l
Since x.1
(x,x)gi(x) +
oL ogj'
-- - - x i
i, j= 1 ox. ox.
n
(26)
og.
p j ox J
i
j=l
oL
oi:
i,j=l
Hence
og.
(P.- - )
we
have
oX i
i,j=l
obtained
F(x,p,u) = 0 whenever Yj
ox.J ox.1
i,j=l
og j(x)
J u
oL og.J x.
____
n
Ui
ox 1.
YJ
pair
of
oR -- 0
au:-
If
F(x,p,u)
Ui
F can
be expressed as
3. Minimality
We will now indicate the relationship with minimality, by
showing that conservation laws (F,F e ) for (10) with Fe
identically zero can only occur if the Hamiltonian system
(10) is not minimal.
Consider
the
different u
U.
By
(w.r.t.
IG ~
'G}.
If (F,Fe) is
conser-
then
G
the
Jacobi-identity
'G. Hence
i f F is
implies
that
{G,F}
399
=0
for amy
cannot be accessible.
For most purposes [11,12] a slightly different Lie algebra
of functions is more useful than
G. Instead of looking at
x.
( 27)
Pi
1, . ,n
1, ,m
u.
v.
oH
au.-J
Yj =
(x,p,u)
with
~
u1
,"',
g~
on X x In x U together
with
all
(repeated)
Poisson brackets with H(x,p,u) and all (repeated) differentiations to ~ So an element of Ge is a linear combinaui
02
{H(x,p,u), {H(x,p,u),
oukou"t
~
u
i
{H(x,p,u) ,
~~.}}}
J
The
Poisson
an
function
along
the
vectorfield xi
= g~
(x,p,u),
i
400
2n,
< 2n+m,
dim Ge(x,p,u)
(x,p,u),
and
suppose
taking
all
Hamiltonian
vectorfie1ds
..,J, . ,
oU
{H(x,p,u), F~(X,P)}
V(x,p) E N,
1, ,k
laws
dim dGe(x,p,u)
tions
F(x,p)
everyw~ere
(F~,F~)
< 2n+m,
with
then
such
there
exist
Alternatively,
non-trivial
if
funcfor
401
every G
Ge
1, ,m
{{H,
{F l'
- {{~~.J
~~),FO}
J
~~)
J
- au.
J
{H, F1 }
1, ,m
{H(x,p,u), Fi(x,p)}
F i + 1 (x,P)
0,1,
If for a certain r,F r + 1 happens to be zero then we have obtained a conservation law (F ,Fe) with Fe == O. In the more
suggestive
notation
of
writing
%r instead
of
the
Poisson
bracket with H (see the remark above) we are therefore looking for functions FO(x,p) such that for a certain r
d r +1
(32) - - F
dtr+1 0
<
00
d FO
vU j
dt
o , while -,,- - - .l -
= 0, i " r, j
1 , ,m
==
0) then so is
G and
Ge
402
will
adopt
the
following
(classical)
definition
of
~
(x(t),p(t),u(t
U
o2H
(x( t) , p( t), u( t
ou.ou.
(33b) det
l.
l, ,m,
a ,
[O,T]
[O,T]
where (x(t),p(t
the
implicit
[4,8,9])
there
function
theorem.
In
the
literature
(cf.
such singular controls. They are mainly based on the observation that if u(t) satisfies (33a) then also all time-derivatives of (33a) have to be constant zero:
(34) d k oH (x(t),p(t),u(t
dtk oU j
(Here
%t is
differentation
a ,
along
(5)
[O,T], k
with
u(t)
0,1,2,
a
time-
function. )
The equations (34) give additional conditions on the optimal
controls ul'
* 'um* These additional conditions can be investigated by calculating the expressions
dk
oH
-o-~(~x,p,u),
.
i,]
dt
j
*
*
the conditions on u1, ,um
ui
l, ,m, for
k) 1,
just
like
o2H
(x,p,u).
(lui (lu j
This problem of evaluating singular controls
l.]
is very much
submanifold of
con-
403
tained in Yl
Definition 4.
= Ym =
(10)
Let
In. A submanifold N
iant (c.!.)
if
definition
([6,7]).
of
controlled
invariant
distribution
property that
the feedback
maxima1
c.!'
submanifolds
contained
in
the
(35)
=:-
So -l(Pi(m.
0, j=l, ... ,m}.
'!b.eerea
Suppose
{(x,p,u)I~:.(x,p,u)
J
( Pi < m,
i = 1J
,m. Then
p.
by
defi-
d 1
oH
nition of Pi the functions ou
oH ' dt
d oH
oU i , ... , ~oui do not
i
p.+l
dt
d 1
oH
depend on u, while
--- is affine in u for all i
p.+l oUi
dt 1
404
l, ,m.
I ~~i
N = {(x,p)
(36)
i = l, ...
while
the
(x,p)
p.
oH
(x,p)
= - - =:P. uU.
dt
,m}
necessary feedback u.
r, ... , m,
a i (x,p), i
0,
is
(37)
-~-uU.
1
0, i
1, ,m
p.+l
1
oH
- - are affine in u and
Pi+l ~ui
A(x,p) has rank m, (37) has dJ unique solution
oH
----0- (x,p,u),
dt r u i
formed vectorfield
r = O, ,Pi' along
oH (x,p,u)
-0Pi
(38)
Pi
yields
= -
dr+l
oH
xi
dt r
ui
the
u=a(x,p)
(x,p,a(x,p.
(39)
dr+l
oH
- - -,,- (x,p,a(x,p
r
1
dt +
uU i
while
d 1
~--77
dt
Pi+l
oH
-,,-
uU i
trans-
u=a(x,p)
Since
for r
d r . OH
- - - - - does not depend on u we have for r
dt r oUi
p.+l
feedback
(X,p,U)1
~-o--
Differentiation of
(x,p,a(x,p
dr+l
oH
-:-:r+r au:dt
O,l, ,Pi'
O,l, ,Pi- l
(x, p, u)
is by definition of a zero.
405
p.
d 1. oH
oH
finition of Pi the functions -0-- , , --p-. au. ' i=1, ,m,
ui
'
d 1.
1.
all have to be zero on an arbitrary Jontrolled invariant
the maximal c. i.
[]
I dp.hdx.
i=1 1.
1.
restricted to N is non-degene ra te, and hence def 1.nes a sym-
form wN on
N.
Moreover
the
resulting
controlled
is given by
Olif
oH
aU i
406
has
full
.
di str1but10n
contained in the k ernels of d (OH
~) ,
uU l
oH
,d(~)
uUm
In particular in the
oH
(x p u) is
ou
"
defined as p+l.
of
the
to
integers Pi and
the
the
full
feedback u
rank
=
condition
a(x,p) is
then
of
a
407
References
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[ 11]
[12]
[13]
[14]
[15]
V.I. Arnold, Mathematical Methods of Classical Mechanics, Springer, New York, 1978
R.W. Brockett, Control theory and analytical mechanics,
pp 1-46 of Geometric control theory (Eds. C. Martin and
R. Hermann), Vol VII of Lie Groups: History, Frontiers
and Applications, Math Sci Press, Brookline, 1977.
P.A.M. Dirac, Lectures on Quantum Mechanics, Belfer
Graduate School of Science, Yeshiva University, New
York, 1964.
R. Gabasov, F.M. Kirillova, High order necessary conditions for optimality, SIAM J. Control~, 127-168,
1972.
J.W. Grizzle, S.I. Marcus, Optimal control of syste.s
possessing symmetries, IEEE Tr. Aut. Control, 29, 10371040, 1984.
R.W. Hirschorn, (A, B)-invariant distributions. and disturbance decoupling of nonlinear systems, SIAM J. Control & Opt, 19, 1-19, 1981.
A. Isidori, A.J. Krener, C. Gori-Giorgi,S. Monaco,
Nonlinear decoupling via feedback : A differential geometric approach, IEEE Tr. Aut. Control, 26, 331-345,
1981
A.J. Krener, The high order maximal principle and its
application to singular extremals, SIAM J. Control &
Opt, 15, 256-293, 1977.
F. Lamnabhi -Lagarrigue, Doctoral thesis, Paris, May
1985.
A.J. van der Schaft, Symmetries and conservation laws
for Hamiltonian system with inputs and outputs: A
generalization of Noether's theorem, Systems & Control
Letters, 1, 108-ll5, 1981.
, Observabillty and contro11ability
for smooth nonlinear systems, Siam J. Control & Opt.,
~, 338-354, 1982
, System theoretic descriptions of
physical systems, Doct. Dissertation, Groningen, 1983.
Also: CWI Tracts No.3, CWI, Amsterdam, 1984.
, Symmetries in optt.al. control, Memo
491, Twente Univ. of Technology, 1984.
, Conservation laws and symmetries for
Hamiltonian systems with inputs, pp 1583-1586 in Proc.
23rd CDC, Las Vegas, December 1984.
, On feedback control of Hamiltonian
systems, to appear in: Proceedings MTNS Conference,
Stockholm, 1985.
Discrete-Time Systems
Abstract. The paper deals with a survey presentation of recent results obtained in the area of nonlinear discrete time systems following an approach based on the functional
expansions of the state and output behaviours.
1. INTRODUCTION
412
fact that the difference structure, which involves composition of functions, does not imply any particular property but in the case of an
affine dependence on the state in the dynamics and in the output equations (state affine system). It must be also noted that the difference
structure does not preserve any "locality" around the initial state xo.
In effect a local analysis of the behaviour of the system can be performed around any point attained under the free evolution, or around
any fixed evolution, by considering variations of the controls. This
must be considered the basic idea of the approach here proposed. The
state and output evolutions at each time t are nonlinear functionals
depending on the initial state Xo and the input sequence u(O), .. ,u(t-1).
Taylor-type expansions can be performed to develop those functionals in
powers of the controls thus obtaining the discrete Volterra series associated to the given system. The Volterra kernels will represent, for
any fixed xo, the coefficients of the development around the point attained under the free evolution from Xo ; hence the structure and the
properties of the Volterra kernels characterize the local behaviour of
the system around the free evolution. The mentioned functional expansions will be presented in the next section where the structure and the
properties of the Volterra kernels are studied. The study of those properties is at the basis of the realization results pointed out in section III. In particular the realization of finite family of stationary
kernels and the application to an approximation result.
Particularly interesting properties arise by considering discretetime systems with invertible drift term (fo(x) = f(x;O)). In this case
a family of vector fields can be associated to the discrete-time system.
The main property of those vector fields, denoted in the sequel by
Gi's, is that each Volterra kernel can be expressed as iterated Lie derivatives of the free evolution. The imposition of suitable properties
to the Gi 's ref~ects into suitable structures of the difference equations and vice-versa. This is the case of a realization problem studied
in section III where the knowledge of that correspondence is crucial to
solve the posed problem.
The role played by the vector fields Gi's in the analysis of the
system is clarified in the sections IV and V where the existence of
invariant structures in the state space and the controllability property are studied.
Section VI deals with the discretization of a linear analytic
continuous time system driven by controls which are piecewice constant
on time intervals of prefixed small amplitude. Approximated discretizations and the implementation of sampled continuous control laws on continuous processes are also briefly discussed. The results there recalled are in our opinion basic in the study of control scheme which involve digital components. Finally in the last section, the solution to
some control problems is presented ; we note that the restriction to
the class of linear analytic control systems, there adopted, could be
overcame on the basis of the functional expansions presented in [13].
In the following sections the main results are recalled without
the proo~s which are contained in the referenced papers.
413
(4)
Qk')
il"" .i k =l
a (.)oX.
"ox
"
'1
'k f
xg '....
g .
l'k
k~1.
Qk
414
2 /
w2(t"I"2;xo ) =\
'
t-, -1
'I = '2 ~01 oQ 2oQo 1 (h)lxo
To give the general expression of the typical k-th kernel let us introduce the following notations. Let r 1 k the number of different
'i~S in wk ; let 01 , ,ar the number of coincident instants of time
.I ai = k) ; moreover, just for the compact/TIess of the expression,
(1=1
let to = 0, tl = 1, t; = ti-l + ai-I' i = 2, ... ,r and t = '0. One has
.
't
't -'t -1 1
.
.
_
r
[
1
(7)
wk(t"I'"k'xo) - Qo . ~asoQo s-1 s ] s=r oh Xo
415
where in the right hand sides, we denote by Pk(T) the following differential operators :
Pk(T)(.) = Q~oPkoQ~T (.) , Pk(O)(.) = Pk()
_
ak(.)of-T-1
L
. 1a
x. ."
.. aX.
1 1 x (g '1
... g.' k)1
.
'1'"k=
'1
'k fT+
fT
In general, with the previous positions for the indices, one has
( 9)
( 10)
Wk(t;Tl'.,Tk;x o ) =[Hcr (T,Q, )]1 oQ~hlx
s s s s=r
0
We note that the coefficient of u(T1) in y(t) is equal to the evaluation at Xo of the application to h~ft of the differential operator
P1(T1) and similarly for the high order kernels.
A relevant fact now happens ; namely the action of the differential operator P1(T) on a given function is equivalent to the Lie derivative with respect to Pl(T)(Id)lx (i.e. P1(T)(Id)lx is a vector field).
For :
ah f- 1
ah af- 1
P1 (O)(h)lx = a~ If x g = a~ x a~ If x g
ah
= a~ x x P1(O)(Id)lx
416
-[Jf(x)]
-1
-1
Jg(x)[Jf(x)] g(x)
(11 )
Gi(T)(x) = [Jf(X)]-T(Gi(O)(fT(x))), i ~ 1, T E Z
It can be shown that those functions are vector fields ; denoting by
Gi(T)(A) the Lie derivative LGi(T)A, of a given function A, one has:
Theorem 1 ([13])
Any
1, T E Z can be decomposed as :
. L. C(il,,i~)Gi (T)o .. oG i (T)
(12)
1 1"", 1 ~ =1
1
~
differentia~ opera~or
Pk(T) =
~=1
Pk(T), k
il++i~=k
417
418
where
<1
n
aa h
il
i
LG( T;X )(h)
\
a
a
x
G
(T;X)
...
G
a(T,X),
.
/... 1 x ... x.
.
'1'"a='I'a
with G'(T;X) the i-th component of the function G(T;X)
The application of theorem 2 to discrete-time linear and bilinear
systems restitutes the well known separability properties ([2]).
Let us assume for the sequel of the section that the kernels Ai
in definition 1 are finite and stationary (i.e. Ai depends on
t-Tl, ... ,t-Ti). It is well known from the continuous-time realization
that a finite family of stationary kernels can be realized by a bilinear system if and only if the kernels are differentially separable
([16]). The discrete-time counterpart of this result has been stated in
[17] involving an adequat generalization of the separability property
and polynomial-affine systems (i.e. systems which are polynomial in the
input and affine in the state). More precisely:
Theorem 3 ([17])
A finite family of stationary discrete-time Volterra kernels has a polynomial affine realization if and only if the kernels are separable
i . e. :
1
al
a2
a~
=
I
,Wk l(t-T 1)W k 2(T n -Tn ) ... Wk r(Tn -Tn)
aIa r ,
' N l N2
'
N r -l
Nr
a
where W'., i = 1, .. ,r, are matrices of appropriate dimensions, which
- - k"
can be factorized in the form :
ai
ai
ai
Wk,i(TR,. I-TR,.) = Pk~i(TR,. I-T)Qk,i(T-TR,.)' V T, T. ~ T < TR,. .
,-,
,-
,-1
This result is applied in [17] to the realization of the fi-rst k kernels in the input-output map of a system of the form (1) assumed to be
initialized at an equilibrium point Xo (f(xo;uo) = xo). A generalization of (7) to the case under study enables one to show that the kernels associated to the overmentioned input-output map satisfy the separability property of theorem 3 ; hence one has:
Theorem 4 ([17])
The first k kernels in the input-output map of (1) at an equilibrium
point can be realized by a polynomial affine system.
we note that no invertibility of the drift term f(-;uo) is assumed in theorems 3 and 4. Under such an hypothesis further properties
of the factorization arise enabling a lower dimensional realization
procedure ([17]).
We conclude this section by noting that general results on the
existence of realizations as in [18,5] could be expressed in terms of
the vector fields Gi'S.
419
420
An algebraic characterization of the notion of invariance can be obtained on the basis of the differential operators introduced in section II,
in particular if the drift tenn fo is invertible. Denoting by no an input subset such that for any -u E no, f(.;u)
- be invertible, one has:
Theorem 5 ([24])
Assume fo be invertible and u E no, a distribution 6 on Rn is globally
invariant with respect to the dYnamics (1) if and only if :
(i) (foh.6 = 6 (fo)
(18)
(ii) [G i (O),6] c 6, V i ~ 1
Remarks
(i) The previous result gives a characterization of invariance which
does not depend on the control u. It makes reference to the general
class of systems (1) and in particular (2) ; in this last case (18) must
be considered the equivalent of (15).
(ii) Recalling that in the characterization of the k-th kernel,
just Lie derivatives with respect to the first k Gi 's are involved, it
can be proved that the validity of (18) for 1':: i So k implies the invariance of the integral structure of 6 with respect to "approximated"
state evolution specified by the first k kernels.
A less restrictive concept of invariance can be,defined by means
of the local dynamics associated to (1).
It is the following dynamics:
~(t+l) = (f~i(t)-lof(,;y)of~(t))(t;(t))
i(t+l) = i(t)+I, t;o = Xo
(19)
in this sense
Definition 3 ([24])
6 on~o' a neighborhood of xo, is locally invariant until time t if it
is invariant with respect to the dynamics (19).
Theorem 6 ([24])
6 on~o is locally invariant until time
OriTYif :
[G j (i),6J c 6
for any j
1 and i
(1) if and
(20)
O.. ,t-I.
Remarks
(i) Let us note that, for a discrete time linear system with an inverti-
421
ble drift term : ~(t+1) = Ax(t) + B~(t). t~e flat distribution spanned
by the vectors A-1B = G1(i-1) for 1 ~ i ~ t defines a foliation into
linear varieties which evolve one into the other until time t.
(ii) On the basis of the results of section II. it can be shown
that if (20) hold true for 1 ~ j ~ k. then the invariance property is
verified by the "approximated" evolution which involve only the first
k kernels. until t.
It is well known that. following a geometric approach. the solution to several feedback control problems can be formulated in terms
of the existence of invariant structures with respect to the feedback
system dynamics ([12]. see also [25] for discrete-time systems). The
algebraic characterizations briefly recalled in this section, enable
to formulate synthesis problems until a fixed time t and/or a fixed
order k of the kernels in geometric terms and is moreover at the basis
of the construction of feedback solutions.
V. CONTROLLABILITY
It is well known that. for the class of linear analytic continuous-time
systems defined by the vector fields A(x) and B(x), the local weak controllability ([26]) can be studied in terms of the Lie algebra generated by A(x) and B(x). In particular, denoting by C that Lie algebra,
dim C(xo) = n is necessary and sufficient for local weak controllabillty at Xo and dim C(x) = n. V x. is necessary and sufficient for weak
controllability. Similar results can be stated for the discrete-time
systems of the form (1) with reference to a particular kind of controllability property.
Intuitive arguments can be used to give a first idea of the fact
that the vector fields Gi(T)'s and their brackets playa prime role
in the characterization of controllability. At this end let us consider, just for simplicity of notations, a system of the form (2) with
invertible drift f ; assume, moreover, to be interested in the characterization of the tangent structure at Xo to the reachable set from Xo
by means of "local trajectories". Local trajectories are assumed to be
of the form :
-1
-1
(f+uog) .. (f+u t _1g) (f+(u t - 1+t-1)g)(f+(u o+0)g)
i.e. trajectories of a local dynamics of the form (19) around a reference dynamics (f+uog) .. (f+ut-19) with (uo ... Ut-1) E
such that
(f+Ui1) be invertible. The reachable set by means of local trajectories can be thought the equivalent of the locally weakly reachable set
in zero unit of time characteristic of continuous time systems.
Let us now consider the curve, CUO(o), around Xo which gives
this reachable set at time t = 1 ; it is
Uo
-1
C (0) = (f+uog) (f+(uo+o)g)(x o )
u
The tangent vector to C 0(0) at Xo is. obviously
Q5
422
u
O(XO) = a;
(8)
o s o=0
sk u
(11)
u
L. k~PkO(f+Uog)-lIX = a~ I
soG 10 (O)(x o ) +
o s0
=0 k>l
0
0 s =0
- 0
a~ I
L
k>2
s~
Uo
~k k (f+uog)
.
-1
Ixo
Uo
= G1 (O)(xO)
Uo
Uo
where Pk and G1 denote the operators in (8) and (11) associated to
a drift of the forms (f+uog). A development of G~o w.r. to Uo gives
([131) :
k
.
Uo
Uo
G1 (0) = L j(T Gk 1.( 0)
k>O . +
this is enough to understand that the vectors Gk(O)(Xo). k ~ 1. characterize the tangent space at Xo of the reachable set under local trajectories at time t = 1. The same arguments can be used to show that
the tangent vectors Gk(i)(xo). i = 0 ... I. characterize the tangent
space at Xo of the reachable set at time I under local trajectories of
the form :
-I
(f+uo~ (f+uog+sog)(f+uof+sI_1g)(xo)
When local trajectories of more general form are considered Lie
brackets of the vector fields Gi 's appear.
u .u Let us consider. for example. the two dimensional surfaces.
S 0 l(So.St). around xo. parametrized by Uo and u1 which characterize
the reacnab e set at time t = 2 ; i.e.
SU O.u 1 (sO.sl) = (f+uog)-1(f+u1g)-1(f+u1g+s1g)(f+Uo9+sog)(xo)
two independent tangent vectors at Xo are given by :
uo u 1
a
uo u 1
i = 0,1.
vi
(x o ) = as.,
S
(so.sl)
, so=sl=O
By performing the computation for vI one has
uo ,u 1
1 u1
vI
(xo ) = (f+uog)- G1 (O)(f+uog)(x o )
which can be develop_ed in powers of Uo and u1 and Lie brackets between the Gi'S will appear ([131, theorem 3).
By means of similar arguments it has been obtained in [271 a result on the controllability which can be expressed in terms of the vector fields Gi 's as follows :
Let us consider the control set U such that for any u U,
f(. ;u) be invertible and denote by'T the pseudogroup of diffeomorphisms
423
F~(X) = ~I
eO(A+UB)(I)1
i > 1
(24)
1
au 1 u=O
x
An explicit expression for F~(~) has been given in [28], one has
1
0.,
(_I)J=
L k.+i
. 1 J
.0J=
i-l- ;
i
kl'''' ,ki::O IT k.!(k.+l)! IT (j+ L kQ,)
j=l J
1
j=2 Q,=1
kl
k.
> 1
adA B. ad A1B(I)1 oA
Fi(x) = 1.
L k.
. 1 J
(25)
(x)
Remark
In the linear case, A(x) = A.x and B(x) = B, one easily gets from (23)
424
o
and (25) : Fo(x)
0
eAo x, FI(x)
fO eArdr.B, Fi(x)
0
. 1.
0,1>
o
G2(o)
k +k +1
k2 0 1 2
ki k2_1
k ~O(-I)
kI !(k 2+I)1 adA (adA
w adB)B
1-
2,
CO =[.(:D,.(O]
C.co
k2~1
425
It is a matter of computation to verify that in case of an approximation of order two in o. a good choice for the sampled feedback is :
u(ko) = Fx(ko) +
%F(A+BF)x(ko)
wet) = a + SV + 2 Lf+ga+6V(a+sv)
(26)
If
426
427
Theorem 8 ([33])
oQ
p
n-1
.
J
CiA
if and 0.D.lY if :
i 1... i
Qv
d.
di
=.0 Q
.
J
CiA ,
vOQo'(h i ) = 0, vi
{l, ...
,q}, V v ~ 2
(31)
We note that it follows from the proof of theorem 8 that the feedback
system is immersed into an input-output decoupled linear system; i.e.
each input ui affects only the corresponding output Yi. Moreover the
dynamics of each input-output pair can be arbitrarily assigned.
To conclude we note that the use of linear analytic feedback of
the form (30) is restrictive for discrete-time systems also if the given system is linear analytic. This can be easily understood thinking
to the fact that the characterization of the input-output behaviours,
as mentioned in section II and developed in [13], involves an infinite
family of vector fields Gi'S as it happened in the general nonlinear
situation. More precisely, by assuming a linear analytic feedback,
only the kernels which do not involve coincidence of instants of time
can be modified, this motivates assumption (31). It has been shown in
[34] that by considering a general nonlinear feedback, assumption (31)
can be removed.
On these bases a result stated in [35] with respect to the invertibility property can be strengthenned. Assuming a p-inputs, q-outputs with P ~ q, linear analytic control system, a sufficient condition
for the left invertibility is that the previously introduced matrix
has full column rank equal to p.
* S. Monaco
Dipartimento di Informatica e
Sistemistica, Universita di Roma
"La Sapienza", via ElAdossiana 18,
00184 Roma, Italy.
** D. Normand-Cyrot
Laboratoire des Signaux et Systemes, CNRS-ESE, Plateau du Moulon
91190 Gif-sur-Yvette, France.
REFERENCES
[lJ LJ. Tarn, D.L. Elliot and T. Goka, "Controllability of bilinear
systems with bounded controls", IEEE Tfta.YL6. Au;(;. Cant., 18,298301, 1973.
=
[2] A. Isidori, "Direct constructions of minimal bilinear realization
from nonlinear input-output maps", IEEE Tfta.YL6. Au;(;. Can.t., 18,
626-631, 1973.
=
428
[3] M. Fliess, "Un codage non commutatif pour certains systemes echantillonnes non lineaires", In6o. Con.t., 38, 264-287, 1978.
[4] E.D. Sontag, Potynomial Re6pon6e Map~, Lect. Notes in Info. Sci.,
13, Springer-Verlag, Berlin, 1979.
[5] LD. Sontag, "Realization theory of discrete-time nonlinear systems: Part 1. The bounded case", IEEE'T!tan6. Cbl.c..ui:t6 andSy~t:.,
26, 342-355, 1979.
[6] D. Normand-Cyrot, Theo4ie et: p~atque de6 ~y~t:eme~ non linea-tJLe6
en t:emp~ ~c..~e~ , These de Doctorat d'Etat, Paris-Sud, Orsay,
1983.
[7] W.J. Rugh, NonL{ne~ Sy~t:e~ Theo~y, t:he vott:~/W~en~ app~oac..h,
The Johns Hopkins University Press, Baltimore, 1981.
[8] B. Jakubczyk, "Existence and uniqueness of realizations of nonlinear systems", SIAM J. Cont:. Opt:., 18, 455-471, 1980.
[9] C. Lesiak and A.J. Krener, "The existence and uniqueness of Volterra series for nonlinear systems", IEEE T~an6. on A.C., 23,
1090
1095, 1982.
=
[10] M. Fliess, "Fonctionnelles causales non lineaires et indeterminees
non commutatives", Butt. Soc... Mat:h. FMnc..e, 109, 3-40, 1981.
[11] P.E. Crouch, "Dynamical realization of finite Volterra series",
SIAM J. Cont. Opt:. , 19, 177-202, 1981.
[12] A. Isidori, NonL{ne~ Co~ot Sy~t:e~ : An I~odu~on, Lect.
Notes on Info. and Cont., Springer, Berlin-New York, 1985.
[13] S" Monaco and D. Normand-Cyrot, "Developpements fonctionnels pour
les systemes non lineaires en temps discret", Report R-89~ CNR
I-taly, 1984, submitted for publication.
[14] S. Monaco and D. Normand-Cyrot, "A Lie exponential formula for the
nonlinear discrete-time functional expansions", MTNS'85,
Stockholm, 1985.
[15] S. Monaco and D. Normand-Cyrot, "On the realization of nonlinear
discrete-time systems", Sy~t:. and Cont. LetieM, 1, 2, 145-152,
1984.
[16] R. W. Brockett, "Vol terra series and geometric control theory",
Aut:omatic..a, 12, 167-176, 1976.
[17] S. Monaco and D. Normand-Cyrot, "Input-output approximation of
nonlinear discrete-time systems from an equilibrium point", submitted for publication to the IEEE on Aut. Cont., preliminary version, 23rd IEEE Conf. on Dec. and Cont.,
90-95, Las Vegas,
1984.
[18] B. Jakubczyk, "Invertible realizations of nonlinear discrete-time
systems", Proc. Pri nceton Conf. Inf. Sc. and Syst., 235-239, 1980.
429
430
TI~m
NONLINEAR SYSTEMS
J. W. Grizzle
Department of Electrical and Computer Engineering
University of Illinois
1406 W. Green Street
Urbana, Illinois 61801
ABSTRACT
A local treatment of the (restricted) block input-output decoupling problem is given. The major tools employed are the invariant and
locally controlled invariant distributions which have recently been
extended to the discrete time domain.
1.
INTRODUCTION
= f(x k )
E uig(xk )
i=l
1.
to be feedback equivalent to a parallel cascade of single-input/singleoutput linear systems plus an unobservable nonlinear part. It should
431
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 431-439.
1986 by D. Reidel Publishing Company.
432
J. W. GRIZZLE
This section fixes the notation and setting employed in this study
of discrete time nonlinear systems and summarizes some results of [8,9]
on controlled invariant distributions.
Definition 2.1:
A nonlinear
and h : M-+ N
the system,
spaces, and
defined by
7T :
-1
(~)
Definition 2.2:
A feedback function y is a bundle isomorphism from B to B; i.e., y is a
diffeomorphism such that the following diagram commutes:
(2.1)
433
for each local coordinate chart (of the given open cover). ~ is locally
controlled invariant if for each boEB there locally exists a feedback
y (i.e., y is defined on some open set about b o ) such that ~ is an invariant distribution of the closed loop system E(B,M,foY).
It tu+ns out that locally controlled invariant distributions can be
quite easily characterized. In the following, V(B) = {XE TBI7T*x= O}
denotes the vertical distribution on B.
Theorem 2.1:
If ~ is an analytic involutive locally controlled invariant distribution
on M, then for each vector field XE 7T~1(~) and bE B,
(2.3)
Moreover, i f ~, f~l(~) nV(B) and f restricted to the fibers of B all
have constant rank, then (2.3) is also sufficient for ~ to be locally
controlled invariant.
Remark: Due to the analyticity assumption, there will always exist open
dense subsets M' C].I and B' C B, 7T (B') :::l M', on which the aforementioned
constant rank hypotheses are satisfied.
Though it is not true that a maximal locally controlled invariant
distribution contained in a given distribution always exists, something
very close to this does in fact hold.
Definition 2.4:
An analytic distribution
f*~
C Mf(b
+ f*(b)V(B)
J. W. GRIZZLE
434
3.
x,
-)
h i (x) = h i (x
(3.1a)
1
HI i - -1
-i-l i -HI
-HI
f(x,u , .. . ;u
)R f(x,u , . ,u
, u,u
, .. ,u
)
(3.lb)
and
i,j=l, ... ,L
Proof:
Assume that such equivalence relations exist; consider one of them, say
Ri. Then (3.1) gives that ~ projects, in a set theoretic sense, to a
system on M/Ri:
435
from which it is clear that yi does not depend on u j for j ". i. On the
other hand, for fixed x and ~, define xRix if, for arbitrary input
1 .. ,u.HI) an d
sequences ( Uj ) 00'=1 and (hUj )00'=1 such that . u. = ( u.,
hI
hi,2l i hi+l J hHl
J
J
J
h
,u.,u.
, ... ,ll.
),
3.2:
436
J. W. GRIZZLE
=a ,
= 1, ... ,
a)
h* (~ )
b)
i
i
f oy ( , u) ,~~ C ~
c)
foy*(span{~
dU J
= , .. ,
Remark 3.2:
1) In continuous time, the noninteracting control problem is usually formulated in terms of controllability subspaces [1] or
controllability distributions [5-7]. To the author's best knowledge,
such distributions have not yet been introduced for discrete time nonlinear systems. Let
(A+ BF)8iC 8 i
c' )
1
i-I i+l
+1
i
BG span {u , ... , u
,u
, ... , u
}C 8
Now b') and c') are the essential ingredients in the definition of a
controllability subspace [1] (one need only add that 8 i is the smallest
such subspace). One therefore sees that the formulation of the decoupling problem arrived at in Definition 3.1 is in fact analogous to those
posed in [1,2,5-7] for continuous time systems. More importantly, b)
and c) will perhaps lead to a notion of a controllability distribution
for discrete time nonlinear systems.
2) Unless one adds a simultaneous integrability [13] condition on
~l, ... ,~, E will not necessarily decompose into a parallel cascade of
subsystems as in [5,10].
The main result characterizing the local solvability of the above
problem is the following. One should note that due to analyticity, the
constant rank hypotheses that will be made hold on open dense subsets
of M and B.
437
b)
i
-1 i
-1 i
for the family of distributions E := f* (~ ) n Tf * (~ )
i)
ii)
Moreover, if f restricted to the fibers of Band Ei n V(B) all have constant rank, then these conditions are also sufficient.
Proof:
Necessity: Suppose the RBDP is locally regularly solvable. Fix boEB
and let (x,u) be a sufficiently small coordinate chart pair about b o and
let u=y(x,u), u= (ul, . ,u t + l ) and ~l, ... ,~t be as in Lemma 3.2. Since
y i~ al~ays of the form y(x,u) = (x,yx(u, b) of Lemma 3.2 gives that
Tf*E l = ~l. To establish ii) of b), first note that V(B) =
a , .. ,y* t+l}'
a
a !
span{y*---l
Now c) of Lemma 3.2 gives the span{y*---.
au
au
a
~uJ
j.;,nCEi so that EinV(B):Jspan{y*---. !j';'i}. Therefore, n ElnV(B)
a
auJ
iEI
:Jspan{y*---. ! j~ I} which establishes ii) of b) once one uses the disauJ
jointness of I and J.
Sufficiency: The key point is that. by the proof of [Thm. 5.1,[5]],
t
condition ii) of b) implies that the family of distributions {E i n V(B)} .-1
is simultaneously integrable [13]. Hence one can choose coordinates
lu= (ul, . ,u t + l ) for the fibers of B, each u i possibly being a vector,
i
a
a
a
such that E nV(B) = span{l"'"
i+l""'-WL Now condition i) of
au
au
au
b,Tf*E i = ~i, implies that ~i is locally controlled invariant [8]. Moreover, as ~E Ei n V(B) for j .;, i, a local feedback iy rendering ~ i invarauJ
iant can always be chosen to be of the form iy!x,u) = (x,u l , ... ,ui - l ,
yi(x u i ) ,u i + l , . u t + l ). Now define y(x,u): = Lyo oty(x,u) =
(x,yi(x,u l ), .. ,yi(x,uY.) ,u t + l ). It is claimed that y is a decoupling
feedback. To show b) of Lemma 3.2, let XE ~ and consider
438
J. W. GRIZZLE
foy(o,u)*X=
(~
+~
ax
au1
(~fl
X y(X,U)
~ ~I
ayj(x,u))X
aX
ayj
a
.
af
-a- XE span{--.} which is mapped into lI 1 by dU from the construction of
x
dUJ
Ei. This establishes Lemma 3.2b. c) is similarly shown.
0
One of the drawbacks of the above result is that it gives no hint
as to how the lIi,s are to be determined. However, in conjunction with
Theorem 2.2 one has the following result.
Corollary 3.1: Let L be as in Theo~em 3.1 and let lIi~*~e the ~aximal
LCI distribution contained in ker h~. Suppose that lI 1 , f;1(lI1*) nV(B)
and f restricted to the fibers of B all have constant rank. Then the
RBDP is locallY regularly solvable i f and only i f n (f;l(lI i *) n V(B))
iE I
+ n (f~l(lIj*) nV(B) = V(B) for all nonempty disjoint subsets I and J
jEJ
of {l,2, .. ,}.
4.
[2]
[3]
[4]
D. Claude, 'DEkouplage des syst~mes non lineai.-res, series generatrice non commutative et algebres de Lie,' SIA}l J. of Control, to
appear.
439
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
Eduardo D. Sontag'
Department of Mathematics
Rutgers University
New Brunswick, NJ 08909, U.S.A.
ABSTRACT
This paper proposes a notion of smooth action on a manifold, and establishes a
general integrability result for certain associated distributions.
As corollaries,
various classical and new results on manifold structures of orbits are established,
and the main theorem on preservation of transitivity under sampling is shown to be
a simple consequence.
1. Introduction
One of the basic results in control theory, due independently to [SUI] and [ST],
states that, for continuous time systems, each orbit (set accessible with positiveand negative-time motions from a given starting state) has a natural structure of
immersed submanifold of the state space. This structure is obtained, roughly, as
follows. Given any piecewise constant control steering a state into the state
this
control having switches at times tl' ... , t k , tangent vectors to the orbit at
are
obtained by taking perturbations of the t i . (More precisely, positive- and negativetime controlled motions are used.) When phrased in terms of the integrability of
an associated distribution, this generalizes classical theorems of Frobenius and
Chow.
e,
e
Discrete-time control systems have been studied much less than their continuous
counterparts, and their properties diverge considerably from those of the latter, due
mainly to the possibility of singularities; see for instance [SO]. The paper [JA]
introduced the idea of studying invertible discrete nonlinear systems, and developed
a realization theory which parallels much of the continuous time situation; further
work along these lines was carried out in [FN], [NC], ISS], and related papers.
Invertible systems are those for which transition maps, (one for each fixed control,)
are all (local) diffeomorphisms.
Invertibility is of course a priori an extremely
strong assumption in the context of general discrete time systems. However, for
systems that result from the sampling of continuous time systems, this assumption
is always satisfied. For invertible discrete-time systems, it is possible to give a
Reoearch
441
M. Fliess and M. ifazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 441-483.
1986 by D. Reidel Publishing Company.
442
E.D. SONTAG
close analogue of the continuous time orbit theorem. Since times are discrete, it is
of course not possible to take time derivatives as above. Instead, one substitutes
derivations with respect to the values of the controls in each interval.
(The
underlying assumption being that there is some sort of manifold structure on the
control value set. Precise details are given later.)
The first half of this paper introduces a framework that allows to prove. an
abstract orbit theorem, for "smooth actions on manifolds".
This yields as
consequences the above mentioned discrete-time and continuous time results. More
interestingly, the theorem will also imply a number of other results, including
characterizations of "zero-time" orbits of various different types, and an alternative
submanifold structure in the continuous time case (the "input-topology" structure).
The latter will be compared with the more classical "time-topology" structure.
Certain facts that would appear to be obvious, for instance the second countability
of zero-time orbits (but not of arbitrary orbits), turn out to require careful proofs.
It should be pointed out that there have been many other proofs of "orbit
theorems" in the literature, at various levels of generality (see e.g. [KL), [SJ), [KS]).
All proofs are in principle based on the same ideas. In fact, the present approach
is based on the proof in the conference paper [SS), which was in turn motivated by
a general (unpublished) abstract result due to H. Sussmann, which was in turn a
generalization of the proof in [SUI]. We believe that the present result strikes' the
right balance between generality (it appears to imply all others) and level of
abstraction (it can be applied immediately to particular classes of actions), and our
main contribution here in that respect is in exposition.
The second part of the paper concentrates on sampling.
When a continuous
time system is regulated by a digital computer, control decisions are often restricted
to be taken at fixed times 0,6,26, ... ; one calls 6>0 the sampling time. The resulting
situation can be modeled through the constraint that the inputs applied be constant
on intervals of length 6. It is thus of interest to characterize the preservation of
basic system properties when the controls are so restricted. For controllability, this
problem motivated the results in [KHN), which studied the case of linear systems;
more recent references are [BL], [GH). For nonlinear systems, it appears that the
problem had not been studied systematically until the paper [SS] and later
conference papers by the author. As usual for nonlinear systems, it is easier to
study transitivity (controllability with positive- and negative- time motions) than
controllability (but, see [SOl) for various controllability results). The main result
is that, for fast enough sampling, transitivity is preserved provided that the original
system be "strongly" transitive in a sense to be made precise later. The proof in
[SS) is based on a fixed point theorem. The same result is proved here using more
elementary tools, as an almost trivial consequence of the interplay of the time- and
input- topologies (see above discussion) on continuous time systems..
This is
probably the most natural way to understand the sampling results. For expository
purposes, we have also included here a few topics that had already been covered in
the above mentioned conference papers, including a more or less careful treatment
of one-dimensional systems, which provide a good source of examples and
counterexamples.
443
2. Preliminaries
We first give some differential-geometric terminology which generalizes that in
the standard literature. Most is as in [SUI], [KR] and [IS]. All manifolds will be
smooth (COO) and paracompact (hence, Hausdorff and each component is second
countable). Let M be an arbitrary such manifold. A submanifold N of M is an
immersed (not necessarily regular) su bmanifold.
By a vector field X on M we shall mean a smooth vector field (smooth section
of the tangent bundle) defined on an open subset Vx of M; we denote by B(M)
the set of all such X. Given X and Y in B(M), let Vxn Vy = V. We then define
the Lie bracket [X, Y] as the Lie bracket of X restricted to V and Y restricted to
V. If V is empty, the bracket is undefined. Similarly for the sum of X and Y,
and products by constants. This makes B(M) into a "pseudo-" Lie algebra (or, a
"sheaf of Lie algebras"); for simplicity we shall take in this paper the term "Lie
algebra" to imply only partially defined operations.
Similarly, we let Diff(M)
denote the set of local diffeomorphisms on M, with compositions only partially
defined.
As a general rule, if the Lie bracket of two vector fields (or the
composition of two local diffeomorphisms) appears in a statement, that statement
should be taken to mean "if this composition is defined, then ... ".
We denote by X(~), instead of X~, the value of XEB(M) at ~EM. A (possibly
singular) distribution D on M is a subset of the tangent bundle TM with the
property that
D(e)
:= {vET~M
I U,v)ED}
E.D.SONTAG
444
foliation, i.e., there is a partition of M into maximal integral manifolds, the leaves
of D.
The subset 4> of B(M) is involutive if [X,Y] is in 4> whenever (the product is
defined and) X, Yare in 4>. We shall say that 4> is a subspace of B(M) is rXE4>
whenever XE4> and rE3l, and X+ Y is in 4> whenever (the sum is defined and) X, Y
are in 4>.
(" Presheaf of linear spaces" is probably a better terminology.)
The
smallest subspace containing 4> is denoted by {4>} sp (the "linear space generated by
4>"). The smallest involutive subspace containing cP is denoted by {cph (the "Lie
algebra generated by cp"). Finally, the distribution ({ 4> h)D is the Lie distribution
generated by 4>, and is denoted by {cp }LD'
A (smooth) distribution D is involutive if {D} vf is involutive.
Integrable
distributions are involutive (because the vector fields belonging to D are tangent to
the leaves of D, which are submanifolds).
We shall say that the subset C of the manifold V has nzce boundary if the
following property holds: for each UEC there is a smooth curve
, : [0,1]
->
such that ,(0) = u and ,(t) is in intC (interior of C with respect to V) for all
t>O.
(Smooth on a closed interval means smooth in a neighborhood of this
interval.)
2.1. Actions
An action r;
{g",aEA} ), where:
is
an
8-tuple
(M,A,-,{ta,aEA},{Va,aEA},{Ca,aEA},{D",aEA},
A is a set,
- : A -> A is a map of order 2: -(-a)=a,
For each aEA, Va is a manifold,
For each aEA, D" is an open subset of MxV a,
For each aEA, C a is a subset of Va with nice boundary,
For each xEM, there is some aEA and some UEC a such that (x,u)ED a,
For each aEA, ta is a smooth map V" -> V.a such that the composition
a t.a is the identity (the subscript is dropped, and ta(u) is written
simply as ii, when a is clear from the context) and such that ta maps
C a into C. a and the interior int(C,,) into int(C.,,), and
9. For each aEA, g" is a smooth map D" -> M, such that:
2.
3.
4.
5.
6.
7.
8.
445
x .
This proves that the data M, A *, Vb' etc., defines a new action 2;* (taking a
suitable product path, one can prove that C b has nice boundary, if each C a does).
The main objects of study are the orbits of the action 2;:
O(x) := {z
I gb(x,w)
= z, some b,w}.
Note that the orbits of 2;* coincide with the orbits of~. Later we shall introduce
various other actions derived by restricting the set A *; these actions will be very
446
E.D. SONTAG
TC" := {(u,v)
I UEC"
and vETuV,,} .
For each (a,u,v) with aEA and (u,v)ETC" we define a vector field X",u,v as
follows. For any { such that ({,ii.) is in D -a'
8g a(g_a({'u),v)
8v
I .:=,,(v).
(2.1)
Note that g,,(K,,(Ui),v) is the same as what can be denoted, using the word
(-a,a)EA', by g"s,a ({,ii.v). Equation (2.1) provides a vector at {. It is clear how
to compute it in local coordinates (product of a Jacobian matrix by a vector). A
coordinate-free interpretation is as follows. Let 0: be the map that sends VEV into
g,,(g_a(~,ii),v); this is defined in some neighborhood of u in Va' For any real-valued
smooth map f defined in a neighborhood of x, consider the composition f3:= fO 0:.
Then X 8,U, v(~)(f) is the evaluation v(f3), where we are interpreting v as a
differential operator on germs of functions at u. With this definition, it is clear
that Xa,u, v(~) is again a differential operator, and so defines a vector at~. From
the coordinate description it follows that this is not only smooth on ( but in fact
smooth as a function of ((,u,v).
If 1[: M-+M is a local diffeomorphism, we denote by Ad1[ the (partial) linear
operator B(M)-+B(M) corresponding to conjugation by 1[, more precisely:
for X in B(M) and { in M. (In this equation, (1r- 1 ). denotes the differential of
at the point It({).) Note that the domain of the vector field Ad 1r X is
{( I 1[W
1r- 1
is defined and is in Vx } .
gb("w), we denote
Ad1[ = Adb,w
447
(2.2)
In particular, let r(1;) be the group consisting of all the gb("w), for bEA* and
WEC b , for the given action 1;, and let cI>(1;) be the set of all vector fields Xa,U,II'
for (a,u,lI) with aEA and (u,II)ETC a. We introduce the distribution associated to
1;,
D = D(1;) := Ad r (1;)cI>(1;) .
This is an everywhere defined distribution: for each (EM there is by property (7)
in the definition of action a pair (a,u) such that g.a(x,ii) is defined, so Xa,U,1I is
We consider also another distribution, the Lie
defined for all II tangent at u.
distribution associated to 1;,
DL = D L(1;) := {cI>(1;)}LD
The rank of 1; at ( is by definition the rank of D at (; the Lie rank of E at ( is
the rank of D L.
2.3. Example: continuous time systems with time-topology
Given an arbitrary everywhere defined set of vector fields q" we may consider
the (pseudo-) group generated by cI>,
exp(q,) := {exp(tX), tE!R, XEq,} .
Here exp(tX)((), if defined, is the solution at time t of the differential equation
x{t) = X(x(t)), x(O) = (.
The well-posedness theorem for ode's insures that exp( tX)( e) is defined for an open
set of pairs (t,e) (which depends on X).
We may thus introduce the following
action 1; (q, ), the action associated to q,:
One can then consider the distribution associated to the action 1; (q,). We call
this distribution Ad(q"q,). Let a = (f,X), and pick any u in !R and any II in Tu!R,
the latter identified again to!R. Then,
X &,U,JI = fX
It follows that {q,}D
{cI>(1;)}D' so:
448
E. D. SONTAG
Actions obtained in this way will be also refered to as continuous time actions with
time topology, for reasons that will become clear later. They will be one of the
two main types of actions to be associated to continuous time systems.
The classical orbit theorem for continuous time sytems, due independently to
Stefan and Sussmann, says that Ad(cI>,cI is integrable, and that the orbits of I:(cI
are the leaves of this distribution.
These statements will be proved later as
particular consequences of the general orbit theorem (whose proof is in itself
essentially that in [SUI]).
Finally, if D is a distribution, we let exp(D) be by definition exp(cI, where cI> =
exp({D}vr), and define Ad(D,D) as Ad(cI>,cI for this set cI>. The results in [SUI]
also prove that, if cI> is an everywhere defined set of vector fields and if D:= {cI> }D'
then D is integrable if and only if D is invariant, meaning that
Adexp(cI (D)
D .
and that a smooth distribution is integrable iff it has the "integral manifolds
property": for each ~ in M there is an integral manifold of D which contains ~.
is the identity
4,4,
_b(x,wuvw)
I
ov
(2.3)
In" general, if cEA' and t/J is in C c' we shall be interested in partial derivatives of
gc(-,t/J) with respect to components of t/J.
Assume that c = b'bb" is a
factorization of c into subwords, and let t/J = x' wx ' , be the corresponding
factorization of t/J. Then, (assuming that x = gc(y,t/J) is defined,)
(2.4)
is by definition the differential of gc(y,x'(')x") at the point w of Vb' When b'
and b" are empty (c=b), we often omit the subscript and write just dgb(y,w) or
449
even dg(y,w). Differentials with respect to x will be written d x or using the (.)*
notation; that is, dxgc(x,w) is the same as (gc(-,w))* evaluated at x.
Assume that the above c factors as (b',a,-b), with aEA, and consider the
corresponding factorization t/J = (w',u,w).
If v is in TuCa' we can evaluate the
differential in equation (2.4) at v; there results the following formula:
[dagc(y,t/J)) (v) = Adb,w(Xa,u) (x)
(2.5)
Given any xEM and any vector field of the type Ad b w(X &,U,v,,) defined at x, there
is a y such that gc(y,t/J) = x: just let c:= (a,-b) and t/J:= uw. We conclude that
D(x) equals the span of all the images of the maps as in equation (2.4). (In fact,
it will follow from later discussion that it equals in fact the image of just one such
map.)
I
Consider the action E *, introduced earlier, derived from the original action E by
considering the free monoid A * on A. This gives rise to a distribution D(E *).
The following result is trivial, but will be useful later:
Lemma 2.1: D(E) = D(E*)
Proof: By definition, D(E) is included in D(E*), so need only prove the
reverse inclusion. Since f(E) = f(E*), it is enough to show that cJ>(E*) ~ D(E).
Pick any cEA * and any X in Dc' The tangent space to Dc at X is the direct sum
of the tangent spaces to all D a' for the aEA that appear in the factorization of c.
instead of v.
(2.6)
These vector fields will appear later.
Computing in local coordinates, two
applications of the chain rule show that in fact the same distribution wou1d be
obtained from these, because Ya,n,....,,= -X a,u, ....,,'
It is also easy to establish the
following formula, for any (a,u,v) with aEA and (u,v)ETC .
X-a,u,Jl
_ = -Ad a,u (X a,u,V ),
(2.7)
where p = [oii./ou)(v).
Thus, in generating D it is redundant to include the
vector field X. ,il,V when X.,u,v has been already included.
a
450
E.D.SONTAG
III
U c of the type
(w,v,w',v,w")
Definition 2.6: The action E is an action with zero if all the sets U a are equal
(say to U), all the sets e a are equal (to e), and there exists an element
in
int(e}, 0:;::0, with the property that ga(e,O}=e for all
e and
all aEA.
Finally, we give
Definition 2.7: The distribution D had full rank at eEM iff its rank at
e equals
451
e iff D(I:)
does.
Similarly, I:
Definition 2.8: A (real-) analytic action is one for which all the data
(replace "smooth" by "analytic" in the definition of action).
IS
analytic
OW
Let ( be
is
(3.1)
where the sup is taken over all band w such that ((,w) is in Db and
gb(('w) = f
The lemma will be proved in a latter section.
We now show that theorem
(3.1) follows from it, for the manifold case. Claim: the orbits of E are integral
By formula (2.5) and part (2) of the lemma (using r-(), it
manifolds of D.
follows that rank of D at ( ::::: dimO((). Further, the generators of D are, by part
(1) of the lemma and by equation (2.5), included in the tangent space to O(() at
(. Thus T(O(() = D((), as claimed. We conclude that the connected components
of the possible orbits O( () give rise to the leaves of the integrable manifold D, as
established as a consequence of the following lemma. This lemma can be used to
prove the fact (see section 2.3) that a distribution D is integrable iff it has the
integral manifolds property.
Lemma 3.2: Assume that the smooth curve -y:(O,l)-->M is such that (a) 'i'(t) is in
D(-y(t)) for all t, and (b) the rank of D is constant along -y(t). Then the image
E.D. SONTAG
452
of , is contained in an orbit of E.
Proof: We claim that for each tE(O,l) there is a neighborhood V of t such
that ,(V) is included in O(,(t)).
Let XI"",Xr be vector fields such that
{XI(x), .. ,Xr(x)} is a basis of D(x), x=,(t). Because the rank of D is constant
along this curve, it follows that {Xl (y ),.,Xr(y)} is a basis of D(y), y=,( f), for f
close to t.
Thus there are r smooth real functions Pl"",P r, (defined in a
neighborhood of t,) such that
for all such f. This can be seen as a controlled differential equation evolving in
; the manifold O(x). As such, there is a solution " of this equation, for T near t,
contained in O(x), and with ,,(t)=x.
But,' would also be a solution of this
equation as an equation evolving in the manifold M. By uniqueness of solutions of
(controlled) ode's in M, it follows that , = " is indeed contained in O(x) for f
near t.
This establishes the claim.
For each possible orbit 0, this argument
shows that {tl,(t)EO} is open, Since the orbits are disjoint, connectedness of (0,1)
implies that the range of , can intersect at most one orbit .
If N is any connected sub manifold of M which is an integral manifold of D,
and if , is in N, then N must be contained in 0(,), and hence in the connected
component of 0(,) at ,. This is because any y in N can be connected to , by a
smooth curve, integral for D, and the above lemma concludes that O(y) = O(~).
This concludes the proof of the theorem in the manifold case, assuming lemma 3.1
IS known to be true.
In the rest of this section, we show how to prove the theorem in general,
assuming it has been proved in the manifold case. We need the following lemmas.
Given the action E, we may introduce another action E int , obtained when replacing
C a by int(C,J for each aEA. All axioms are again satisfied. Let 0int(x) be the
orbit of x under this action, and let Dint be the corresponding distribution. Note
that the theorem then applies to Dint (manifold case).
OW
for all ~.
453
all t>O and ,)0) = u i. We now consider the following action E'. Its index set
A' has just two elements {I,-I}. The manifolds VI' = V. I ' are both equal to
(-1,1), and C I ' = C. I ' is the union (-I,O)u(O,I). The maps L are the identity, and
(with domains induced from the original action),
gl'(X,'\) := gb(x,'l'I(.\2) ... ".(A2))
g.l '(x,.\) := g.b(X,7 r P2),,7 1 (.\2
Let 0' denote orbits with respect to E'. Note that, from the choice of the curves
'l'p it follows that O'(x) ~ 0int(x) for all x.
Finally, consider the action E"
obtained by using instead C I = C' l = (-1,1) in the above description. We know
that ~EO' ,(~); the result will follow if we can prove that ~ is also in 0'(0. The
theorem can be assumed true for both E' and E" (manifold case). Furthermore,
a density argument as used in the previous lemma shows that the corresponding
distributions D' and D" are equal. Thus the connected component of 0' (I) at ~
is the same submanifold of M as the connected component of 0" (~) at ~ (since it
is the leaf of D' through ~).
Thus there is a subset N of M which is a
neighborhood of ~ both in the topologies of 0'(1) and 0' '(~).
For.\ near 0,
gl ,(~,.\) is in N, by continuity of gl' in .\ for the topology of 0' 'U). (Property
(2) in lemma (3.1).) Thus there exists a .\ i=
such that y = gl '(~,.\) is in
454
E.D.SONTAG
In particular, if E
Then D = Ad(~,~).
Proof: One inclusion is proved above; for the reverse, consider the action
E (~ ) associated to ~.
Pick ~ in M.
Let N be' the leaf (maximal integral
manifold) of Ad(~,~) through {. We claim that then OW is contained in N. This
will imply that T ~OW = DW is included in T ~N, giving the desired inclusion.
455
(3.2)
Partition [0,1] into finitely many intervals Ii in each of which , is constant. If y,z
are endpoints of one such interval, then this argument shows that y,z are connected
by an integral curve of Ad (<l> ,<l, and are therefore in the same leaf of this
distribution. By induction on the intervals,
and ~ are in the same leaf, so ~EN.
Thus, O(e)C;;;N, as desired .
4. Proof of
leInInIl
3.1
We shall need some more notation. For b in A *, mb (or just m) will be the
map gb(x,.), with domain Lb:= {w I (x,w)ED b}. We also make the convention that
a statement like "gb(x,w) = y" will mean "(x,w) is in Db and g(x,w) = y". Fix an
x in M, and let 0 = O(x). We establish first that
r(x,y) = r(x,z)
for any
y,z
in
O.
Pick b,c in A* and w,w' such that gb(x,w) = y, gc(x,w) = z, and rank [dg(x,w)] =
r(x,y). Introduce e:= (b,-b,c) and x:= www'. Since g(x,ww) = x, it follows that
g(x,x) = z. So rank[dg(x,x)] :0:; r(x,z). Let F:= g(.b,c)(-'ww') --with domain the
open set {x I (x,ww,) E D(.b,c)}' Since dxF(p) is a linear isomorphism for all p in
the domain of F, it follows that r(x,y) = rank[dg(x,w)] = rank[dxF(y) dg(x,w)] =
rank[dbg(x,x)] :0:; rank [dg(x,x)] :0:; r(x,z).
A symmetric argument concludes the
equality. Let r be the common value of the r(x,y).
Consider now the set S of all triples s:= (b,Q,h), where b is in A and:
Q
mbl Q : Q
-+
L b,
(4.1)
r,
(4.2)
456
E.D. SONTAG
h: Q
-t
~r
h(Q).
(4.3)
Fix one such s, and consider the set m(Q); this is a subset of o. The bijection
mlQ induces a canonical manifold structure on this set, for which both mlQ and
:= h (mIQ)-1 are diffeomorphisms (and such that is a chart). We now prove
that, for this structure,
0
We now prove that the family of all such charts (mIQ,) defines a smooth (rdimensional) manifold structure on 0, and that property (1) holds. It will then
follow from (a) above that this structure makes 0 into a submanifold of M, and
the uniqueness statement follows from (b).
The sets m(Q) cover 0: Pick any y in 0 and let b,w be such that gb(x,w) = y
and dm(w) = dg(x,w) has rank r. Thus dm has maximal rank at w, so there is an
r-dimensional embedded submanifold Q of Lb, containing w, such that equation
(4.1), equation (4.2) are satisfied; ,replacing Q if necessary by an open subset of Q,
a suitable h can be .found for equation (4.3).
Pick an arbitrary y in V; thus there are w,w m Q,P with y = m(w) = m'{w').
Let e:= (b,-c,c) in A*, and take x:= ww'w'.
Note that rank[dm(x)] ?:
rank [dcg(x,x)] = rank[dg(x,w')J = r. Since dm(x) always has rank at most r, it
has maximal rank at this x. So there is an open subset Z of L. which contains X
and such that m.(Z) is an r-dimensional embedded submanifold of M_ Introduce
the open set W [resp., W'] consisting of those v in Lb [resp., LcJ such that vw' w'
457
III
W'.
Let
which is then open in Q, because EnF is open III E. So rj>(EnF) is open in h(Q)
= rj>(m(Q)). Thus rj>(z) has a neighborhood included in rj>(m(Q)), and (a) follows.
To prove (b), note that rj> maps EnF (embedded submanifold of me(Z))
diffeomorphically onto rj>(EnF) , which is open in h(Q) and contains rj>(y). A similar
statement holds for 13.
So 13 rj>.1 gives a diffeomorphism between rj>(EnF) and
f3( EnF), and (b) follows.
0
mb are smooth.
a w' in Lc with
It will suffice to
is (in a suitable
(4.4)
so gc is indeed smooth.
5. S-8ctions
In this section and the next we introduce, for a given action E, two new types
of associated actions Es and E B These will be useful in the study of discrete and
continuous time systems. Let E be fixed for the rest of this section.
We introduced earlier the action E' whose index set is A'.
458
E.D. SONTAG
is any subset of A * with the property that -SES whenever SES. Assume further
that S contains all pairs of the form (-b,b), for bEA *. There is a well-defined
action Es obtained by restricting the index set A * to S. We call an action of this
type an S-action associated to E. Let Ds be the associated distribution D(Es).
Lemma 5.1: For any S as above, D = Ds. For each ~EM, O(~) = u{O.(x)"
(where 0. indicates orbit with respect to Es).
XEO(~)}
IS
connected.
Corollary 5.4: Assume that E is a complete action. Then the leaf of D through
~ is O.W (orbit of canonical Es). In particular, E has full rank at ~ if and only
if
O.W
is a neighborhood of ~ .
459
6. Balanced actions.
We extend
00
to A*
.-
{(u( .. u r )
E Cr
v~al ._ ((U("u r ) E
uj
= u1r )
uj =
1r )
Do .
This is a balanced
460
E. D. SONTAG
Let Lo := {iP
oh
I tr(a)=tr(a'),
Let
(u,v)ETC}.
Note the analogy with,. with proposition 3.8. The algebra Lo corresponds to the
usual "O-time Lie algebra" for continuous time (time-topology) systems ([SJ]).
Even if :1:: is connected, :1::B may be nonconnected; however completeness and the
existence of "zero" is inherited:
Proposition 6.3: If :1:: is a complete action [resp., an action with zero,] then :1::B is
complete [resp., an action with zero].
Proof: Let ~ and b balanced be given, b = aIar" If:1:: has a zero 0, the
sequence X of r O's is such that gb(~'X) = ~ for all~. So:1::B also has a zero.
(Note that X is indeed in int(C~al).) Take now :1:: to be complete. The domain
condition is certainly satisfied by :1:: B.
Assume given now any w = u(u r in
int(C~ ..I) and another such w'.
By connectedness of int(C) there are curves 'i
joining u i with u i '. We may assume that, if j=1ri, then '/\) is :rJX). Thus,
X to w in int(C~al) .
Finally, note that by an argument as in the previous section, orbits under :1::B
are submanifolds of orbits under :1::.
(7.1 )
in
where r is an integer, the u i are in C, and the ti are arbitrary real numbers. The
zero-time orbit 0 o(~) is obtained by considering all points as above but with the
461
constraint that
I: ti=O.
The previous theory can be applied to the above systems in two very different
ways. The first was described in section 2.3, and consists in associating the action
III which
(7.2)
The alternative which motivated much of the previous discussion
instead
IS
to consider
More precisely, we consider the action with A'- !R, the obvious "-", U a
[respectively, C a ,] equal to U [resp., C,] for all a, the identity La' and D t the
domain of the above map.
We refer to an action of this type as a continuous
time action with input-topology.
The first (time-topology) model IS the one implicitely used in the literature
when dealing with continuous time systems, and is the basis of the known orbit
theorems.
The second model (which cannot be even considered in the usual
treatments, where smoothness of (7.1) in u is not necessarily assumed,) will provide
the right framework for understanding sampling results.
Note that the orbit O( 0 is the same, as a set, as the orbit of the
corresponding continuous time action with time-topology or that of the
corresponding continuous time action with input-topology. Thus the notations are
consistent. We let DT and Du be the distributions obtained with the respective
actions L: T , L: u ' These are in general different, and so the manifold structures that
result on O(~) from the two possible applications of theorem 3.1 will be in general
different. For instance, in the trivial case in which M = !R and P(~,u)=l (U is
then irrelevant), clearly O(~)=M for each~. Here DT has full rank at every point,
and O(~) has the topology of M. On the other hand, Du has dimension zero at
each point. The connected component of O(~) at ~ with the input-topology is just
{ni the orbit O(~) = !R has uncountably many components.
The set cP is in the time-topology case just the linear span of {Xu' UEC}.
Since the corresponding action is connected, we have that O(~) contains a
neighborhood of ~, relative to the topology of M, iff the DT has full rank at ~.
We also know that in the analytic case the latter is equivalent to full Lie rank.
In the input-topology case, full rank of Du is sufficient but not necessary (above
counterexample) for O(~) to be a neighborhood of~. The input-topology action is
not countable nor, in general, connected.
When the vector fields defining the
system are complete in the usual sense, this action is complete and hence
S-connected, so the leaves of the corresponding distributions are the orbits O.(~)
(again see the above example).
In the time-topology case we can apply the material m section 6. Here
IS
462
E.D. SONTAG
(7.4)
This action EB may fail to be connected.
= u,
y = 1-u .
For any control, x(t)+y(t) = x(O)+y(O)+t, and the orbit of (1/2,1/2) under EB is
the set of points of the form (x,-x) with x not an integer. This is not connected
even in the topology of M. We shall prove below that these orbits are exactly the
sets OoW.
Remark 7.1: The pathology is due to the fact that the system (action) is not
complete. If the system is complete, 0 o( 0 is connected in the time-topology (and
hence in the topology of M).
This is very easy to establish directly, but also
follqws from the previous work. Indeed, consider the balanced action EB associated
to the time-topology action. If E is complete, EB is also complete. Further, E
has a zero, so EB is a complete action with zero, and hence connected. Thus, by
proposition 3.6 the orbits under EB are connected. These are submanifolds of M
and of the orbits under E T , so they are connected in these topologies too .
Note that the canonical action Es (recall section 5) gives nothing new for the
time-topology case: the obtained orbits are the same as for E. This is in general
true for connected actions, as discussed previously. Or directly, because given any
(f,X), {, and any t we may write exp[ftXJW = exp[f(t+O)XJ exp[-eoXJW for small
enough 0, which exhibits gex(x,t) as an element of the form g_b,b(x,t 1t 2).
One
could also consider different balanced actions associated to the input-topology
model, but we don't need to do so here.
0
In this section, unless otherwise stated, EB will mean the balanced action
associated to the time-topology model of the given continuous time system, and Es
will be used exclusively to mean the S-action associated to the continuous time
input-topology model, with the following set s:
s:=
{bEA* I b = t1 .. t r , 2:tj = o} .
Note that S is invariant under "-" and that (-b,b) is in S whenever bEA*.
corresponding E s is not the canonical S-action associated to E.
Proposition 7.2: The following four sets are equal for each { in M:
The zero-time orbit 0o({)
The orbit of { under Es
The
463
Proof: From the definitions, it is clear that orbits under Es equal the sets
00' and that orbits under EB and under the canonical-S action Es are included in
zero-time orbits.
Consider now the proof that in the complete case the orbits
under the canonical S-action are precisely the sets o( ~). It will be sufficient to
prove that each element I of the form
(7.5)
can be obtained by applying to ~ a composition g of diffeomorphisms of the type
exp(tX)exp(-tX), for various X,t. More generally, assume that the times ti in (7.5)
add up to r. We claim that I can be obtained as the result of the application of
such a g followed by an element of the type exp(rX), X=X u' We prove this claim
by induction on r.
For r=l, the result is trivial.
If true for r-l, write the
expression in equation (7.5) as
exp[trXu ]exp[r'X]g(O ,
r
where tr+r' =
r.
exp[-r'X u ]exp[r'X]g,
r
2:: tj
with the gj balanced and the Sj all of the same sign and adding up to r. (And the
Xi of the form Xu,)
So, if r=O, all Sj must be zero, and we have a balanced
expression. When r=l, the claim is ,true with k=1. Assume inductively that we
are given the expression
exp[toX]exp[sl X1]gl exp[s2X2]g2"gk.1 eXP[skXk] (e).
(7.6)
Assume without loss that to1=O. If to has the same sign as the common sign of
the sp then the induction step is proved. Otherwise, suppose that to <0 and the Sj
are all positive (the opposite case is analogous). Let u.J be the sum of the s.1 for
i=l,"J, with 0"0:=0. If -to>O"k' we may rewrite the above as exp[rX]g(e), where g
is the balanced (and well defined at e) element
We
E.D. SONTAG
464
-r' <Sj+I'
Renmrk 1.3: In the above equalities, one may as well take the orbits under Es
and EB obtained when restricting controls to intC.
This is because, as proved
earlier, orbits do not change if such a restriction is made, and the proposition
applies both to the original case and the case where C is replaced by intC.
Definition 1.4: The action E satisfies the strong Lie rank condition at ( iff EB
has full Lie rank at ~.
The following definition will be more relevant to our main results.
iff 0 o( 0 IS a neighborhood
The definition refers to the topology of the ambient manifold M. It turns out,
however, that strong transitivity is equivalent to ( being interior with respect to
Or, in terms of the associated
the topologies induced by Es or by E B .
distributions,
Definition 1.6: The time topology of 0o(() is the topology induced from the
action E B . The input topology is 'that induced from the action Es'
The above fundamental result is a consequence of the fact, whose proof is given
in the next section:
0o(~)
are second
465
Ds = {Ad?r(Xt,u)
(7.7)
x t ,u,V
8exp[tXv]exp [-tX,J( 0
8v
I v:=u(II).
(7.8)
If P(x,u) in (7.1) is affine in u and the system is analytic, then for small t these
generators can be expanded in terms of the Lie distribution appearing in lemma 6.2
(these expansions will appear later), and that fact may help to understand
intuitively why theorem 7.1 is true.
Remark 7.8: It is important to note that proposition 7.7, and theorem 7.1,
depend essentially, in the time-topology case, on the assumption made in defining
continuous time systems that the interior of C is connected. Otherwise we could
consider a system on M = lR having, for instance, U = C = (O,1)u(2,3), and the
dynamics given by f(x,u) =
if uE(O,l) and 1 if uE(2,3).
For this system,
0o(e)==lR for each e, but Ds(e)={O} for each e, since the partial derivatives in
equation 7.8 are clearly all zero .
(7.9)
where t=(tl' ... ,t r) and w=(u1, ... ,u r ). Its domain Er?r is that open subset of !R?rxur
where the expression is defined, and lR?r:= {t=( tl ,. .. ,t r) [ti=t?rJ
Note that the
image of each a is included in 0o(e), so we view a as a map into the zero-time
orbit. It is essential to note, in order to understand the necessity of the arguments
to follow, that this mapping is not continuous for either of the topologies of 0o({)
which we are considering. However, a is continuous with respect to t when 0o(e)
is given the time topology, and is continuous with respect to w if the input
topology is considered instead (hence the names for the topologies).
This is
because a(t,w) equals gt1 ... tr(u!'"u r) when g is as in (7.3) and also equals
( )(lt11"",lt r I) when g is as in (7.2) and .1 = sign(t.).
g(f 1u)
1 ... rUr
1
Every element of 0o(e) can be written as a(t,w) for suitable r, ?r, w, and t (and
all ti=F0, if desired). This is because of the equality of 00 and orbit under E B.
Fix now rand ?r.
Lemma 7.9: For each (to,WO) in Er?r there exists a neighborhood N of (to,WO) in
E. D. f.ONTAG
466
21rl-l, 1rij(2i)=2j,
III
Z. Moreover,
(The
Finally, let
This
is the only place where the (essential) assumption that inte is connected is ever used.
Let
467
t( ..tr,w)
0: 2
O:(S("sr'w)
0: 2
0:(
(7.10)
(7.11)
r, ij
r, ij
_
r," kj
(7.12)
(>k(t,r)'Xk(w,uk,u.))
J
J
J
if i::;k<j-l and in the same component as o:(t,w) if k=j-l. We prove the claim by
induction on k. For k=i, this is trivial by equation (7.11). Assume now the claim
proved for k. Since w is in Al x x Ar' it follows that Uk and u k+ I are both in
intC.
Thus there is a path , connecting Uk and u k+ P ,with the image of ,
contained in intC, and hence in B. Consider the path
,'(.\) :=
0: 2
r'''kj
(7.13)
(>k.(t,r)'Xk'(w,,(.\),u.))
J
into 0oU) with the time topology. This is well defined, because of the choice of
the neighborhood V, and it connects the element in equation (7.12) with the the
corresponding element having u k+ 1 instead of Uk' If k+ 1 <j, this equals
0:
,
2r, 1r k+l,j (>k+1 ,J.(t,r)'Xk+1 ,j.(w,uk+pu.))
J
]. and exp[rX
]
uk +1
u k +1
commute.
This establishes the inductive step, and l proves the first part of the
claim. Applying now the same argument with u. I a'nd u., the expression obtained
J'
J
at the end of the path is simply o:(t,w), by the equality exp[-rX u ]exprtX
]exp[rXu ]
l
u
j
= exp[rX].
u
j
Lemma 7.10: For each (to,WO) in Er1r there exists a neighborhood N. of (to,WO) in
,Er1r such that o:(N) is connected III the time topology.
Proof: The proof is very analogous to the previous one.
Consider now the functions
Fix
III
M.
>(t,.\) := (.\tp(I-.\)tl,.\t2,(I-.\)t2,oo.,tr,(I-.\)tr)
x(w,w') := (upuI',u2,u2',,ur,ur') .
Let 1r be the permutation of {1,,2r} obtained from 1r by the formulas 1r'(2i)=21ri,
1r'(2i+l)=21ri+1. Now let
Z := {(t,.\,w,w')E!R1rx!Rxurxurl (>(t,'\),X(w,w'))EE2r ,m}
This is again an open set.
468
B.D. SONTAG
where b,>O and u~ is in intA; for each 1. (Again, the first product of intervals
should be interpreted as a subset of ~1T') Let
N := TI(t?-b,t?H) x A}x ... xAr .
Pick (t,w) and (s,w'). We want a path connecting u(t,w) and (s,w') in the time
topology. If w = w', this is again easy: by induction, assume that t,s differ only
at tp ti' where 1Ti=j. A path from t; to s; maps into the appropiate paths in
0o(()'
The interesting case is that in which we try to connect (t,w)EN and
(t,w')EN. In that case, consider the path
')' '(A) :=
u 2r
,m(4)(t,A),x(w,w ')),
O~A~l,
o( () with the
We can now complete the proof of proposition 7.7. Consider the case of the
input topology. The set o (0 is a countable union of sets of the form u r,,.~(Erll~),
so it is enough to show that each of these latter sets intersects at most countably
many components of 0o(() with the input topology.
(Note that this is not
immediate, since for each component of 0o(()' the pre images under u are not
necessarily open, because u is not continuous.) But there is a covering of Er1T by
open sets N each of which maps into a connected component, by lemma 7.9. Since
Er1T is second countable, bec,ause U is, it follows that there is a countable subcover
by these sets N ("Lindeloff" property), and the result follows. The time topology
case is entirely analogous, using lemma 7.10 instead .
8. Invertible discrete-time systems.
(Invertible) discrete time systems are a natural class of discrete time control
systems, and where studied explicitely first by [JA].
They are described by
controlled difference equations
(8.1 )
469
2:: bi=O.
Thus the leaves correspond in the complete case to "zero time orbits" in
discrete time. In general, all we can say is that discrete-time systems give rise to
countable actions, and hence proposition 3.7 applies.
In the analytic complete case, there is a Lie algebraic criterion for transitivity
that may be easier to apply than checking the rank of D. It is to some extent
related to the result in proposition 3.10.
(Note however that in applying this
criterion to sampled sytems -see next section,- it is still necessary to integrate the
original continuous time system; the criterion is not a "direct" condition based on
the vector fields defining the system, as one using Lie distributions would be.)
This criterion, which we prove below, was first established by [JNC], based on
computations in differential algebra.
Assume a discrete-time complete action E is given, and fix a control value Uo in
intC, to be denoted simply by 0.
Let 11":= g+("O) and let r* be the group
generated by 11". More generally, for each integer i>O we associate the mappings
gi(X,U) := 1I".j(g+(1I"i.l(x),u))
g.j(x,u) := 1I".i+l(g.(1I"i(x),u))
There is then an action E1I" defined by using these g's together with the original
sets U and C. Let 11"( e) be the orbit of e under this action. Again applying an
argument analogous to that in the two paragraphs after equation (7.5), we have:
Lemma 8.2: For all
e,
01l"W
O.W.
Consider the distribution D 11" associated to the new action. The vector fields in
4>(E1I") are precisely those of the form Ad1l"i(X), with i an integer and X in 4>(E).
The action E1I" is connected, since gi(e,O)=e for all
a~d any other gj(e,u) can be
deformed to this by completeness. Thus by lemma 3.10 its distribution has full Lie
rank iff it has full rank, in the analytic case. By lemma 5.1, D and D1I" coincide.
We can summarize the discussion as follows (this is, with different terminology, the
result in [JNC]):
e,
Corollary 8.3: Assume that the E is a complete discrete time action. Then, E is
transitive at
if (and, in the analytic case, only if) E1I" has full Lie rank at
e.
There is yet another sufficient condition for transitivity, not necessary even in
470
B.D. SONTAG
the analytic case, which will he of interest in the context of sampling. The rest of
this section studies that condition. We still assume that I: is complete and that
an element "0" has been fixed in intC. We let 'If be g+("O), as before, and ')':=
'If'l = g,hO).
Let el''''Jem be a basis of To U.
the vector field
(8.2)
generates a distribution of full rank at ~, then I: is transitive at~. A (rather
surprising) result in sampling will be that in a certain sense this condition will be
also sufficient for analytic systems. The Lie algebra L can be also generated in a
different way, which will be needed later. Let a.I,V be the function g- (.,v)i, for each
v. For each i::::l and each j consider
(8.3)
so that b 1j = bj for each j.
b H1J = Ad')'(b i)
for all j.
We conclude:
L .
9. Sampling
Consider a continuous time system as in section 7. In digital control it is of
interest to restrict attention to controls in equation (7.1) which are constant on
intervals of length 6, where 6 is a positive "sampling interval". It is of interest to
determine when properties of controllability are preserved under sampling
(restriction to sampled controls).
There is a large literature on such issues for
linear systems; see for instance [KHN], [BL], or [GH]. These results establish in
particular that controllability is preserved if 6 is small enough. We prove in this
section a result along these lines, for transitivity, as well as a more algebraic
The result is very easy to prove based on the above machinery, and
criterion.
generalizes that in [SS]. It is also possible to give (positive-time) controllability
results, as done in [SOl], but the techniques required are different, and we do not
do so here.
Assume a continuous time system (7.1) is given.
We consider the associated
actions I:B and I:s discussed in section 7. For each 0>0 we introduce also the
discrete time action I:6 defined by taking g+(~,u):= exp[6X u ] and g,((,u):=
exp[-6X u ]'
471
Lemma 9.3: Assume that A(t) is as above, and consider the following statements:
Proof: We first prove that (1) is implied by (3). Assume that the space S
in (1) is proper. Then there is a nonzero row vector q such that qA(t)=O for all
t. It follows that qAa(t)=O for all t, in particular for t=tO, so (3) follows. In the
analytic case, the argument can be reversed.
We are only left to prove that (1) implies (2), the converse being trivial. If
(1) holds, there is a finite set of matrices A(Tl' ... ,Tk ) whose columns span IRk, with
various choices of vectors (Tl' ... ,Tk ). Appending these matrices to each other, we
might as well consider the case in which there is only one such matrix (with
maybe a larger u and k). So assume that A(T) has rank n at T. Let A>O be
such that A(t) is defined and has rank n whenever it(Tii<A for all i. Consider
now any positive 6 with 6<A. For each i, let si be an integer with is(Tj/6i<1.
Then, A(s16, ... ,sk6) has rank n, as desired. Note that we have proved somewhat
472
E.D. SONTAG
more: if q matrices are used to generate in (I), the same number is enough in
(2).1
e,
e,
J ..
where lI"(t):= exp[t il Y exp[t iK Ykl. This is defined for t near r. The matrix A(t),
t = {tij' i=I,.,n, j=I,. .. ,k(i)}, is the matrix having the b(t i) as columns. Thus
condition (I) in lemma 9.3 is satisfied (use r's as particular values). The columns
of A(t), tE5Z, are elements of D 5. The result then follows from the lemma.1
Note that it is possible to refine the proof given here to conclude, as in [SSl,
that .6. can be chosen uniformly on neighborhoods of
and hence uniformly on
compacts.
e,
X : !R
->
E(M)
suc that Xr := X(r) is complete for each rand Xr(e) depends smoothly on
(r,e)E!RxM. If X and X are like this, ~:= [X,Xl is by definition the. parametrized
vector field with Zr := [Xr' Y rl for each r. This defines a Lie algebra structure on
the set of all parametrized vector fields. If X is a p.v.f., we may consider the new
p.v.f. X' with
alae
and
alar,
LK,Kl(N)=. L
.+;;=N
473
(J IJ:(i),rU)j.
(9.1)
F:
:= { XiN ), N::::O,
XEF }
<;:;
E(M)
(Note that both sets represent Lie algebras of vector fields, not of p. v J.'s.)
{F}LA'
the form
N
rlJ:,Kl~N).
k=l
474
E. D. SONTAG
N
N
LriJ:,rJ(N)=L
k=l
l=l
(0 Pl'
Pl= L
rkkllk(J:(l)J,rN-l)j.
k=l
By a VanDermonde argument, we may choose the sequence of reals {rk } so that all
E rkk l =
('0 ).1
This gives
Note
{F 6}L'
e.
475
.-
(9.3)
Let F be the family consisting of all such p.v.f.'s; F is shift-invariant because lX[jJI
= X[ilJI.
Further, for each fixed c>O, F c is nothing else than the set of all
elements b iJ in equation (8.3) (with respect to the action Ec)'
is c-transitive if {FC}L has rank n at ~, or, from corollary 9.7,
It follows that E
Theorem 9.2: If {F:'h has rank n at ~ then there is a t.>0 such that E IS
c-accessible at ~ whenever O<C<t..
Now consider a control-linear system, I.e. a continuous time system for which
the right hand side P(x,u) of (7.1) has the form f(x)-l: uigj(x), and U
is !Rm , C
(r,j)., ~
ikad~-l(g)l'/k!,
(9.4)
k=1
where ada(8) := [a,,8]. (This formula was used before in the context of sampling
in [NC], where a weaker version of the corollary given below was conjectured.) It
follows that
(9.5)
so that {F:'}L is precisely the algebra Lo that appears in lemma 6.2, because the
lie algebra generated by the ad~(gj) equals Lo' For any fixed c>O, introduce the
Lie algebra of vector fields Lc which is given as in equation (8.2) for the discrete
time action E C. We conclude from the above:
Corollary 9.8: Consider a continuous time complete control-linear system.
If
{Lc}o has full rank at ~ then E is c-transitive at~. If E satisfies the strong Lie
rank condition at ~ then there is a t.>o such that for each O<c<t., {Lc}o has full
rank at ~.
Note that the last conclusion is a very particular case of a fact that we had
already proved above (c.f. theorem 9.1). The first conclusion is rather interesting,
however. Based on the discrete-time theory, there is no reason to expect the given
distribution to have full rank, even if E is c-transitive and analytic.
But the
corollary says that when dealing with sampling problems, at least for small enough
C this will indeed be true.
Further, in particular examples it may be easier to
check Lc' Note that formally, if we let JC be the following (partial) linear operator
476
E.D.SONTAG
on SCM):
00
;6:= L
ad;-15k/k!,
k=1
then bj
J5gj' and
L5 = {Ad:xpl.5f]J5gj
I k~O,
These formulas are used to provide a more explicit result for bilinear systems in
[S03].
yza/ax - xyB/az,
g =
(9.6)
N
X r= ""'
~ ad N':'I
(g)r)Nl
f
N=I
Let h be the linear vector field (z,O,-x), and consider its Jacobian
H :=
I ~ ~ ~ I
-1 0
ad}(g) = yi(_H)ig
for i~2
477
where H(r) is the matrix L~l rkHk.1/kL Using linear algebra to evaluate (-H)(ry),
we conclude that we must study the following p.v.f.:
Xr = [-r2 z/2+{1/y){sin( ry )+cos( ry )-1 },r,r2 x/2+(1/y ){sin( ry)+1-cos( ry)}] '.
Let Y be the p.v.f. 2~ (substitute t:=2t in above equation), and ~ the p.v.f. 3~
(substitute t:=4t).
Consider now the matrix whose columns are Y r-4Xr' Zr-9Xr'
and the Lie bracket of these. The determinant of this matrix can be computed
easily with a symbolic manipulation system and one obtains
Thus the determinant is nonzero whenever ry IS not a multiple of 21[. Pick now
any 6>0. For any point (x,y,z)' such that 6y is not a multiple of 21[, this shows
that the system is 5-transitive at (x,y,z)'. But if (x,y,z) is now any point in !R3 ,
we can always 6-reach from this state one with 'good' y, since the y-coordinate
satisfies dy /dt = u.
It follows that the system is transitive at every { in !R3
(Instead of the argument in terms of reaching points with 'good' y, one could
search directly for more generators in order to establish the conclusion at every
point.).
9.S. The one dimensional case.
As an easy example, we consider complete continuous time analytic systems with
M =!R. Although elementary, this case provides some feeling for the kinds of
pathologies that may occur. We let
N:= {({,6)
I {EM
(x,5)EB, some
x}.
(9.7)
These are the sampling periods for which (7.1) is not globally transitive, in the
sense that D6 has full rank at all {. We shall prove:
If B is nontrivial, then C is a
In particular, the system is globally transitive for all small enough sampling
times (if nontrivial). Theorem 9.3 will follow from a more detailed study of the
following sets. For any two (complete) vector fields X, Y, write
478
E.D. SONTAG
k}.
(9.8)
Take two vector fields of the form X = Xu and Y = Xv' u,v in C. Assume that
(x,o) is not in B(X,Y), so that, for the system Eo ' gb(x,w) "/=- gb(x,t/I) for some
k>O, where w=u k , t/I=vk, and bEAt. Since C is connected, the image of gb(x .. )
contains a nontrivial interval.
Thus E is transitive at x, and x is not in B.
Conversely, assume that (x,o) belongs to all the B(X,Y) of the above form. Then
the orbit 0o(x) of x under Eo is included in the discrete set {exp[koX](x),
k=integer}, for any fixed X, and so (x,o) is in B. We conclude that
in
(9.9)
C}.
It follows that it is sufficient to prove theorem 9.3 for the sets of type B(X,Y).
We identify vector fields with their coordinates with respect to the natural global
chart in !R.
Lemma 9.9: Assume
X(x)Y(xO for all x.
that
B is nontrivial.
We are thus led to the study of the sets B(X,Y) with, say, X(xO and Y(xO
for all x.
Call such vector fields " positive" . Conversely, any such pair {X,Y}
gives rise to a system (7.1) with B
B(X,Y)j this is a consequence of the
following characterization, which is easy to obtain but very useful:
Lemma 9.10: Let X,Y be positive (analytic, complete) vector fields.
There is
then an analytic function g: !R -+ !R, with derivative (dg/dt)(t-I for all t and
such that, for some diffeomorphism b(.),
g(t+ko)=g(t)
iff
(b(t),6)EB(X,Y),
(9.10)
Conversely, given
479
any analytic g with derivative bounded below, and any (strictly increasing)
diffeomorphism b, there exists a continuous time system, and in particular there are
positive X,Y, such that B = B(X,Y) and 9.10 holds .
Proof: Let a(t):= exp[tX](O), b(t):= exp[tY](O), both analytic and strictly
increasing. Let c:= a-I, d(t):= c(b(t)). Define
g(t):= d(t)-t.
Since c(.) and d(.) are increasing, g has derivative > -1. Let x be any state,
and to:= b- 1 (x).
Note that exp[tX](x) = a(c(x)+t), exp[tY](x) = b(to +t).
So
these two trajectories are equal at tiff g(to +t) = g(t o)' Further, since g(O)=O, g
is constant iff g=O, which happens iff a(t) = b(t) for all t. This proves the first
part of the lemma.
Conversely, assume given g and a diffeomorphism b. Multiplying g by a
constant, we may assume that (dg/dt)(t) > -1/2 for all t. Let U = C = lR, and
introduce for each u the function du(t) = (sin 2u)g(t)+t; note that the derivative of
d u is >1/2, for all u.
may then introduce
We
f(x,u):= (dau/dt)(a~l(x)).
Let X:= f(.,O), Xu:= f(.,u) for u>O, and Y=f(.,l). Reversing the previous argument
shows that, for any u>O, exp[tXu](b(x)) = exp[tX] (b(x)) iff g(x+t) = g(x:)
(independent of u). For this system, then, B(X,Xul = B(X,Y) for all u>O. Thus
B = B(X,Y), and 9.10 holds.
Fix now a function g satisfying the properties in lemma 9.10, and denote' by
B(g) the set of pairs (t,6) with 6>0 such that g(t+k6) = g(t) for all integers k.
Also, let C(g) be the projection of B(g) in the 6-coordinate.
Lemma 9.11: Let (t,6), (t' ,6') be in B(g).
Then,
(9.11)
480
E.D. SONTAG
Corollary 9.12: If 6 and 6' are rationally independent, and if (t,6), (t',6') are in
B(g), then g(t) = g(t').
Corollary 9.13: Assume that C(g) has a limit point in Ii.
(t",6 ") in B(g). Then g(t ') = g(t ").
Proof: We shall use the following observation twice: Assume that {aj} is a
converging sequence of distinct real numbers, and let f be any nonzero real number.
There are then (i) a subsequence {aj } of {aJ, and (ii) sequences {bj}, {cj } of
integers, such that the numbers ej:= bjaj + c/ are all nonzero and {ej } converges
to zero. [Proof: assume that a j - a. Let bi' cj be integers such that bjf:O and
Ibja+cll < I/i (if a=O use just cj=O, otherwise consider the group generated by a
and f). Now pick any ap j=ji' such that the inequality is still satisfied and ejf:O.]
Assume that {(tn,6n)} ~ B(g), with all 6n distinct and converging to 6 (which may
be zero).
Applying the above observation with f:= 6', we conclude --for a
subsequence of the (t n,6n)-- that the b j6j+c j6' are are all nonzero and converge to O.
By lemma 9.11, Ig(tl)-g(t ')1 also converges to O. Taking in turn a subsequence of
the {61}, and f:= 6", we can also conclude that 1g( tj}-g( t "} 1 converges to zero. so
g(t '}=g(t' '}, as desired .
Proposition 9.14: If g is nonconstant then C(g} is discrete as a subset of Ii.
Proof: Assume that there are infinitely many distinct 6j ~ K, with (t j,6j) in
B(g}. By corollary 9.13, there is a constant c such that g(t l+k6j} = c for all i and
all integers k. Let t j ' = tl mod(6j) such that tj'e[O,KJ. Thus g(t l '} = c and {t j '}
is bounded. Since g is nonconstant and analytic, there are only finitely many t i '.
But then there are infinitely many t j ' ':= t j '+6j --since there are infinitely many 6(and these are also bounded, with g(t j ' ')=c. This again contradicts nonconstancy of
g .
Theorem 9.3 now follows from proposition 9.14 and lemma 9.10. Actually, we
can prove somewhat more. Since B is analytic, each subset with constant 6 also is,
so B is the union of a discrete set and a union of lines Lj:= {(x,61), x in M}. So
Since periods form a subgroup, g
g is periodic with period 6j, for all i.
nonconstant implies that the 61 are integer multiples of some fixed 6>0. So the
nondiscrete part of B is of the form
{(x,k6), x in M, k = integer}.
The set C(g} may be rather complicated.
sequence of numbers {an} such that
(an)'! <
l/ff,
and
Take a
(9.12)
(9.13)
481
Now let gn(x):= cos(1rx/an) and g:= (infinite) product of the gn' This product is
well defined because there is by (9.13) normal convergence on compacts, and g is
indeed analytic. Further, consider its derivative
g' =
l:
(g/gn)gn'
Remark 9.15: One of the most useful tools in the continuous time theory is (the
positive form of) Chow's theorem, which implies for analytic systems that the
positive-time reachable set has nonempty interior whenever O(x) does. In fact, the
term "accessibility" is used interchangeably with transitivity in that context, with
the first term refering to the property of positive time reachable sets.
Here,
but that the set of points
however, it may happen that D6 is full rank at
AcW
:= {exp[6Xu ]' ... 'exp[6XUk ]W I U.EC}
u
1
1
iff it satisfies
x=2r1r, 6=2s1r, r,sEZ
and
r.
As above, this gives rise to a system for which D 6(x) = {O} iff (x,6) is of this
form. Further, assume that this g is such that, with xo=21r, g(xo+2h) = g(x) for
all positive integers k. In that case we can conclude both that 021r(X O) has interior
and that A 21r (x o) = {2h, k~l}. An example of a g like this is
g(x):=(sin x)/x .
This example can be modified to obtain one where even the orbit under E6 equals
M for all (x,6) but such that still A6(x o) has empty interior for some xO' For
482
E.D. SONTAG
l:: 2'ng2(x+21rn),
10. References
[BA] Baillieul,J., "Controllability and observability of polynomial dynamical
systems," Nonl.Anal.,TMA 5(1981):543-552.
[BL] Bar-Ness,Y. and G.Langholz, "Preservation of controllability under sampling,"
Int.J.Control22(1975):39-47.
[BO] Bonnard,B., "Controle de I'attitude d'un satellite," Report #8019,
Univ . Bordeaux, . Oct.1980.
[BC] Brickell,F. and R.S.Clark, Differentiable Manifolds, Van Nostrand, New York,
1970.
[FN] Fliess,M. and D.Normand-Cyrot, "A group-theoretic approach to discrete-time
nonlinear controllability," Proc.IEEE Conf.Dec.Control, 1981.
[GH] Gibson,J.A. and T.T.Ha, "Further to the preservation of controllability under
sampling," Int.J.Control 31(1980):1013-1026.
[GO] Goodman,R., "Lifting vector fields to nilpotent Lie groups," J.Math. Pures -et
Appl. 57(1978): 77-86.
[HK] Hermann,R. and A.J.Krener, "Nonlinear controllability and observability,"
IEEE Trans.Autom.Ctr. 22:728-740.
[IS] Isidori, A., Nonlinear Control Systems: An Introduction, Springer, Berlin, 1985.
[JA] Jackubczyk,B., "Invertible realizations of nonlinear discrete time systems,"
Proc.Princeton Conf.Inf.Sc.and Syts. (1980):235-239.
[IN] Jackubczyk,B., and D.Normand-Cyrot, "Orbites de pseudo-groupes de
diffeomorphismes et commandabilite des systemes non linearires en temps discret,"
C.R.Acad. Sc. Paris, 298-1(1984): 257-260.
[JC] Jacobson,N., Lie Algebras, Dover, N.Y., 1979.
[KHN] Kalman,R.E., Y.C.Ho, and K.S.Narendra, "Controllability of linear dynamical
systems," Contr.Diff.Eqs. 1(1963):189-213.
[KL] Kalouptsidis,N., Accessibility and Stability Theory of Nonlinear Control
Systems, Dr.Sci. Thesis, Washington University, 1977.
[KS] Kupka,I., and G.Sallet, "A sufficient condition for the. transitivity of pseudogroups: Application to system theory," J. Diff. Eqs. 47(1973):462-470.
[KR] Krener,A., "(Adf,g), (adf,g) and locally (adf,g) Invariant and Controllability
Distributions," preprint, UC-Davis, 1984.
[LO] Lobry,C., "Bases mathematiques de la theorie de systemes asservis non
lineaires," Report #7505, Univ.Bordeaux, 1976.
[MO] Mohler,R.M., Bilinear Control Processes, Academic Press, NY, 1973
[NC] Normand-Cyrot, Dorothee, Theorie et Pratique des Systemes Non Lineaires en
Temps Discret, These de Docteur d'Etat, Univ. Paris-Sud, March 1983.
[ST] Stefan, P., "Attainable sets are manifolds," preprint, Univ. of Wales, 1973(?).
[SO] Sontag,E.D., Polynomial Response Maps, Springer, Berlin-NY, 1979.
[SOl] Sontag,E.D., "Remarks on the preservation of various' controllability
properties under sampling," in Developpement et Utilisation d'Outils et Mode/es
Mathematiques en Automatique, Analyse de Systemes et Traitement de Signal, ColI.
CNRS, RCP567, Belle-Ile, 1983, pp.623-637.
[802] 8ontag,E.D., "A concept of local observability," Systems and Control Letters
483
5(1984): 41-47.
[S03] Sontag,E.D., "An eigenvalue condition for sampled weak controllability of
bilinear systems," to appear.
ISS] Sontag,E.D. and H.J.Sussmann, "Accessibility under sampling," Proc. IEEE
Con/. Dec. and Control, Orlando, Dec. 1982.
[SUI] Sussmann,H.J., "Orbits of families of vector fields and integrability of
distributions," Trans.AMS 180(1973):171-188.
[SU2] Sussmann,H.J., "Lie brackets, real analiticity, and geometric control," in
Differential
Geometric
Control
theory
(R. W.Brockett,
R.S.Millman,
and
H.J.Sussmann, eds.), Birkhauser, Boston, 1983.
[SJ] Sussmann,H.J. and V.Jurdjevic, " Controllability of nonlinear systems,"
J. Diff. Eqs. 12(1972):95-116.
Various other
Theoretical Aspects
Anthony M.
Department
University
Ann Arbor,
Bloch
of Mathematics
of Michigan
Michigan 48109
'Christopher I. Byrnes
Departments of Mathematics and of
Electrical and Computer Engineering
Arizona State University
Tempe, Arizona 85287
Abstract
In this paper we derive the existence of and a parameterization for
the local and global minima for the (total) least squares estimation of
linear models describing an infinite sequence X of data in a (separable)
Hilbert space.
nonlinear variational problem; e.g. for line fitting this is the problem
of finding the minima of the (least squares) distance function
(*)
fX : ([
]pOO -+
lR
problems of estimation and control as nonlinear filtering and the attitude control of flexible spacecraft of an infinite-dimensional realization theory, replete with controllability and observability criteria, is
widely appreciated.
mensions and the linear theory in infinite dimensions are highly developed.
e.g.
488
Returning to (*), we note that the solution obtained in finite dimensions by Byrnes and Willems [7] used a blend of the two techniques
mentioned above; viz. by viewing [ pn (or Grass (d,n)) as a homogeneous
space, they studied the Morse theory of
coadjoint orbits.
fx
fx
in this paper, we show that certain workable analogues of these finitedimensional situations exist by analyzing (*) in terms of a generalization of Smale's Morse Theory, ideally suited to the critical point
analysis of linear functionals restricted to "finite rank orbits" of the
infinite unitary group.
1.
INTRODUCTION
In this paper we want to describe how two basic tools in the theory
of nonlinear systems, the Morse theory of critical points and values and
the Lie theory of Homogeneous Spaces, can be extended in a manner applicable to infinite-dimensional control and estimation.
goal is to develop basic control theory (e.g. controllability or observability criteria) for left-invariant or for Morse-Smale systems as completely as in the finite-dimensional setting, we illustrate and apply
our developments in the situation in which they originally attracted our
interest:
H as follows:
= Ej
(x.)
1
Aij e j
H is given by
= TrCP = TrC
- TrCQ
where
489
I - Q,
and
A*A where
CA .. ).
1J
Lemma
fCP)
TrCP
C is of Trace class.
Proof
If
A*A
H,
TrCCP)
is Trace Class.
A is Hilbert-Schmidt.
In par-
lR
(l.la)
H via
~(n) = Tr(CQn)
as a function on Grass (d,H).
We remark that the estimation problem on Grass(d,H) corresponds to
a functional approximation problem for a set of
of
basis vectors in
H.
L2
functions by a set
From
any number of points of view (see especi.ally [9] or Section 3), Grass
(d,H) can be endowed with the structure of a complete Riemannian manifold in such a way that fx is smooth. In this setting, the well known
condition for the existence of a Morse theory on an infinite-dimensional
manifold is Condition CC) of Palais and Smale ([13] and [15]).
Let
490
Then we say
J: M -)- lR satisfies
If
II J' (x) II
of
is a subset of
M on which
IJI
Unfortunately,
fX
S.
does not satisfy Condition (C), as can be seen
fx
Ifxl
d)TrC
<
TrC(I - Q)
fx
TrC
to be
equal to
lim inf
(x )
n
Then:
J (x ),
n
and suppose
{J(x )}.
n
or
Then either
(i)
(ii)
J,
holds.
and also
Roughly
JP
Here,
491
(A), each
n
A i 0, being ison
shown that
fx '
fX
More
By the
is perfect.
X,
This applies in
X.
In the present
]p'" -+
lR
(1. 2)
a: 1''''
Mf
(t)
is given by
(t)
(1. 3)
fx
U(H),
Viewing
ran~'
as a coadjoint
a:]P'"
fX
as the
orbits, one can show (see [4J) that this system is com-
pletely integrable.
*
11+ Cl",
which may be defined as
P: a: I''"
-+
= diag (~)
Just as in the finite-dimensional case, the image of
P(~)
is convex; i. e.
492
Corollary
+
X on 11 such that
Therefore, the minimization problem (1. 2) is equivalent
x 0 P = f X '
to the linear programming problem
min x(y) : y e K.
2.
g: M -+ :R is a
tive to
is a field,
C(g) = YN.,
J
C(g)
= T (N.)
V x b N.
x
X
1
1
Of course, provided (i) holds one always has
T (N.)
x 1
c:
x e N.
N.
at
xI
x.
at
is full rank in
Xl
(iii)
f\
=L
~.ti
.
1
N with respect to
If
N is
H.(N;F), recall
1
is the polynomial
(2.2)
(2.1)
Neg (Hess(g)) -+ N.
any compact manifold and
e N
F will
be made explicitly.
Next one forms the Morse series of
M (t) =
g
I
j
. (N.)
t1 J
PN. (t)
J
g
(2.3)
493
where
x eN ..
J
In case (i) - (iii) are satisfied, the Morse-Bott inequalities take the
form
a. e JNV{O}
(2.4)
Bott function.
fX :
a:
WOO ~ lR
Then, if
is a Morse-Bott function,
is perfect if
M (t) = M (-t).
g
g
Here we fix F = ~.
Now, as argued in Section 1, f(P) = TrCP does not satisfy Condition (C) of Palais-Smale but does satisfy condition (*). Moreover,
while this can be shown from explicit knowledge of the critical values
of f as will be seen hereunder, this is clear ~ priori from the
spectral theorem.
We can see this firstly by an analogy with Quantum Mechanics. We
regard C as a quantum mechanical Hamiltonian (a self-adjoint operator)
on the Hilbert space H. The main problem in quantum mechanics is then
to find the eigenvalues and eigenvectors of the Hamiltonian, corre~
sponding to its energy levels and stationary states respectively.
Now it is easily seen that this corresponds to the problem of
finding the critical values of the function
h(p) =
((~,~))
z e H,
p = AZ,
Ae
a:*
P(H)
a:w
00.
But
C.
Since
~hus
494
Regard
iQ.)
Then a critical
TrCQ is given by
TrC(Q + [~,tAJ) - TrCQ = Tr[C,Q]A = 0
lim
t~O
Since this must hold for arbitrary A, by the Hahn-Banach theorem this
implies [C,QJ
Since
O.
C.
Hence it is
C is enough to determine
On the other hand, we can see that f takes a form here which
enables us to argue that the Morse-Bott theory developed for functions
satisfying Condition (C) may be applied directly to f. Since, as will
be seen below, 1m f is a half-open interval (bounded above by TrC
which we may normalize to be equal to 1), we may consider the function
f = tan (If) say. Then f does satisfy Condition (C), while taking
tangents does not alter the critical point structure of f.
Proof of Theorem 1
To prove that
fX:
a: ]pOO
have
a:]POO
f(P)
f(P).
= TrCP.
C may be "diagonalized"
as
z.
Let
:l
495
g =
L c.1 (x.12
~(
2
+ y.) + \(1
1
L (x.12
H.
2 )
+ y.)
1
i
i
The solutions to the first order conditions are given as follows (for
c.
\ some i.
that
c.
Suppose
ci
has multiplicity
Then we require
2
Xi + Yi +
2
2
+xi+m+Yi+m=l,
H rather than
00
analysis in
a: lP .
Now, from the second order equations we see the Hessian for \ =
C.
is given by
Thus we have shown that all critical manifolds are of even index
with even Poincare polynomial.
Hence
Mf(t)
is even and f
is perfect,
c.
(with
TrC = 1),
a: lP 00
fare given by
1 - cl '
1
i 1m f.
a:
WOO -+ 1R
a: WOO
496
M, Xf .
{f,g}
on
and
is given
a)
w(w (Cn) = 0
x
n b T M
x
Lemma
The finite rank orbits of
U(H)
are symplectic.
Proof
Let x be a point on the adjoint orbit, and let
vectors to the orbit at x. Then we can write
s,n
be tangent
s=
wx(s,n)
= Tr
x[a l , a 2].
Now
4.97
For suppose
= [x, a l ] =
wx(~,n) is non-
[x, ai].
[x, a l ] = 0
We now define a version of the moment map for finite rank orbits
M as follows:
Suppose we have a Hamiltonian action of the infinite dimensional torus
Too
on
M: Too
tions
f.
M -+- M.
The map
... , which
.
to 1* , but in the
(f l ,f 2 ,
M
00
Q in
11
100* .
M to
f.
to be the
I[ lP
00
U(H). Taking
i~
the
Then set
Q= z
q.
Ill.=l}
.
1
1
Q by
REFERENCES
1.
2.
3.
4.
A.M. Bloch, Total Least Squares Estimation and Completely Integrable Hamiltonian Systems, Ph.D. Thesis, Harvard (in preparation).
498
5.
A.M. Bloch and C.I. Byrnes, 'Morse Theory on Trace Class Orbits',
to appear.
6.
7.
8.
G.H. Golub and C.F. van Loan, 'An Analysis of the Total Least
Squares Problem', SIAM J. Num. Analy. 17 No.6, (1980) 883-893.
9.
10.
11.
12.
13.
14.
15.
16.
S. Watanabe, Karhunen-Loeve Expansion and Factor Analysis, Theoretical Remarks and Applications, Transactions of the 4th Prague
Conference on Information Theory.
Parameterr~ume',
R. SCHOTT
ABSTRACT
In non linear control theory many systems are governed by an
equation of the type
q E IRN
and
q(t)
(Ao + L u. (t)A.)q(t)
1
Ao ,A 1 , . ,An
are
NxN
(I), where
square matrices,
fot
a (t)
n
q(t)
(A 0 +
L u. (t) A. ) q (t)
i-I ~
~
(1 )
499
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 499-510.
1986 byD. Reidel Publishing Company.
500
R. SCHOrr
The state q is
matrices, ul, .. ,un
Suppose that the
(l) is known and has
q(t)
~.
= [1+ E .
v~o
Jo, ,J
=0
Vfot
(2)
is defined in the
following way :
Jos u. (T)dT
s, B. (s)
~
B. (t)
J
This means that the equation (1) can be solved for each
path.
The Malliavin calculus permits to prove some regularity
properties of the solution and to prove in particular that
the solution has a density.
In order to be able to control the effect of the random
perturbation at each point, we establish some probabilistic
properties of the iterated stochastic integrals which appear
in (2). More precisely we prove that if Bl ,B 2 , ,B n , n E IN
are independent linear Brownian motions then
a (t) = Jt dB dB 1" .dB l , n E IN, has the same law as
n O n nt n/2 a n (l) and that there exist constants A ,B ,C ,D ,E ,F >0
.
.
f an~1)
~
n nwe nhave:
n n
such that ~f
gn (
u ) .~s the d
ens~ty 0
A e
-B lul 2/n
n
luln/n-1
-D lul 2/n - 1
n
lui
~ g (u)
501
a (t)
n
(B')'_I
2
1.1-"
,n
are inde-
Our method is based on an iteration for obtaining this approximation. However the iteration is not possible directly on
an(t) and we begin with some remarks.
2.1. Lemma: a (t)
n
n.
If n = I, we have al(t) = Bl(t) and we know that the property is true (this is a classical property of the one dimensional Brownian motion).
We suppose that the property is true up to order (n-I)
and we prove that it is also true on order n.
an(t) is defined as
t
[p t]
k [: k+ 1
k ]
a l(s)dB (s) = lim E a 1(-) B (---)-B (-)
a (t) =
n
0 nn
~ k=o
n- p . n p
n p
(t
--2--
(an-I (k),
k \
a n- 1 (t)' It Bn(t)'
Therefore,
pE a 1 (-)
k' [k'+l
k']
.
(t)
iut n/2 lim
B (--)-B (-)
{
1uan
k'=l n- p
n
p
n p
}
E(e
)=E e
~
. n/2 f l
E{e 1ut
0 a n_ 1 (S)dBn(S)}
E{e iut n/2 an
(I)}
and we conclude that a (t) has the same law as tn/2a (1).
Let (A )~I be a ~equence of independent centered n
Gaussian raRdom variables with variance 1 which is independent of (Bl,B z , Bn ).
502
R. SCHOrr
: a (I)
Proof. Setting
An(I~
vn:
(a n_ 1 (S2dS)1/2
we have
iUA v
- ~ v
n n) = E (e 2 n) where An is a centered Gaussian
E (e
variable with variance I.
Now,
2 1
iUA v
[ - u2 fo[a _1(S)]2 dS ]
E(e
n n) = E e n .
On the other hand,
1
iua (I)
[ iufo a _1(s)dB (S)]
E(e
n
) = E e
n
n
E{ e
because the
(B (k+I)_B (~
n
independent of the
E(e
iua (I)
n
p-*oo
n p
k
a n- 1(-)'
p
and therefore
E(e
iUA v
n n).
p[ sE[sup0,1]lan(s)L~ x] ~ 2P[lan
(l)I
x].
The proof of this result is easy, by recurrence. We begin with the classical case of a one dimensional Brownian
motion. For n = 2 the proof was given in [I] and the lemma
2.3. can also be presented as an extension of this case. Now
we will give the main result of this section, namely the
approximation theorem of the density.
503
2.4. Theorem
Let
gn(u)
be the density of
a (I). Then,
n
::;; gn(u)::;;
where
(n_I)2 n
I2TT lui
{luI 2 / n }
exp - - - 2
(n-l) !
p( (I6
(an-I
(S2dX)I/2~x].
Vx E IR , Vn
Proof of Lemma 2.6 Let us introduce two definitions to simplify the notation.
Let
vn
= (fo1
(an _ 1 (s
ds)
1/2
Let V and W are two random variables and a ~ I. The notation V W means that P{V ~ x} ~ aP{w ~ x} for all
a
x E IR.
Now we begin the two steps of the proof.
504
R. SCHOrr
a) Lower bounds
We define on [0,1] the following sequence of functions
f 0 = 1
fs
fk(s) =
fk=I(t)dt
for
n-I
wk = If fk_l(s)an_k(s)dsl.
We note that
ak
we obtain
If~fk_l(s)(f:an_k_l(t)dBn_k(t)dSI
wk =
'f:an-k-l (s)fk(s)dBn_k(s)I.
As in the lemma 2.2 we shall show that
law
(f~fn_l(t)dt)2ds)I/2
b) Upper bounds
The maximal Lemma 2.3 gives
v
sup Ia
[0, 1 ]
n-
1 (s) I
~<
la n_ 1 (1) I
(n-l)!
505
n-l
n-l
v
n I Ak I I A I
n 2(n-l) k=l
n-l 1
This gives the right hand side of the inequality in Lemma 2.6.
Completion
of the proof of Theorem 2.4.
it is easy to see
Since an(l) has the same law as A v
n n
that the density gn(u) satisfies
_u 2 /2v 2
en}.
g (u)
n
I2IT
vn
E{___
l
If
ljiu
-u 2 /2v 2
e
(2n) 1/2v
for
o<
v < u
for
uv'21T"e
we deduce that
E(Iji (v )]- _1_ P(v
u/2ne
~ u) ~ g (u) ~
E[1ji (v)]
nun
(1)
and we are now ready to prove that Lemma 2.6 gives the approximation of Theorem 2.4. In fact we have
gn(u) ~ (n-l)2
and
E{ljiu(IA11
n-l
n-l
)} ~
E{ljiu(IA11
f
IAI>u
n-l
l/n-l
)}
exp(-A 2 /2)
2n
2
ure
exp(-u 121AI
dA +
2(n-l)
)exp(- 2 )dA
2n IAl n- 1
Upper bounds can easily be found for the two terms on the
right hand side of the above inequality and we obtain,
2/n
2 exp(- lul2
)
E{ljiu(IA1I n - I )} ~ ------~
v'2iT lui
506
R. SCHorr
and
g (u) ~
n
2 n
lul /
...:...;:;..,=--_ _ _ _ _ _ _ _2_ __
(n-I)2 n exp(-
I2TT lui
l} -
(n_l)2 n- 1
u
ane
p JAIl
n-l
> u .
The second term of the right hand side has the upper bound
(n-l)2 (n-I)
2TT
Ie lul n / n- 1
exp(-
luI 2 / n - 1
2
).
For the first term, a lower bound can be obtained for example
by integration over the interval
This gives
2(n-l)
9
}
2/
1
- exp-{2
+ 8 (n-I) lui n
2r2
r
and finally we have
1
2 n-I 1
-{ 2 2 (n-I )
9
}
2/ n
gn(u) ~ (2TT)n/2(3)
exp
2r2
+ 8 (n-I) lui
n
luI 2 / n - 1
(n-I)2 exp2
2TT
re lul n / n - I
Now we shall prove that the method used above can also be
applied to some stochastic integrals of the type a (t) if
the Brownian motions (B')'
.I 2
are not all rndepen1 1= , , . . . ,n
d ent.
507
f~ B~dB2
(where
Brownian motions).
1
In this case
aa(l)
= JOB~dB2
and
va
We also have
Va
sup B21 2
[0,1]
B2(1) = a (I) = A V
1
p{(J~ B~(S)dsy/2
>x}
2\{IAl12
~x}
A that
and the calculus is of the same kind as in the proof of Lemma 2.6.
Therefore Theorem 2.4 is true for this stochastic integral
which proves that the asymptotic behaviour of the density
is that of exp(-A u 2 / 3) where A is a strictly positive
constant.
fb (f~ B dB )dB
2
508
R. SCHOrr
2.7. Lemma
We have
f~[f:Bl(U)dBl(U) ]dB1(S)
=
forf
1
(2)
Proof.
Let G be the left hand term of (2). An integration by parts
gives
G
f~
Bl(t) =
rt
J
Bl(u)dB1(u).
509
A[J~(J:Bl(V)dBl(V)ydSr/2
the stochastic integral
>(
(O:B1(V)dB (v)}s
= J~(J:Bl(V)dBl(V))
E Ic.
INP
11""
. 12 <
+00
,1p
R. SCHOTT
510
R. SCHOrr
REFERENCES
[1] R. BERTHUET. Une loi du logarithme itere pour certaines
integrales stochastiques. Note aux C.R.A.S.,
PARIS, t. 289, 17 dec. 1979.
[2] P. CREPEL et B. ROYNETTE. Une loi du logarithme itere
pour Ie groupe d'Heisenberg, Z.W. 39 (1977)
217-229.
[3] K.T. CHEN. Iterated path integrals. Bull. Amer. Math.
Soc. 83 (1977) 831-879.
[4] M. FLIESS. Stabilite d'un type elementaire d'equations
differentielles stochastiques a bruits vectoriels. Stochastics (1981) vol. 4, p. 205213.
[5] P. LEVY. Wiener's random function and other Laplacian
functions. Proc. 2nd symposium, Beckeley,
Probability and statistics (1950), 171-187.
[6] R. SCHOTT. Tail probability of some random series. Lecture
Notes in mathematics nO 1064, p. 418-421.
[7] R. SCHOTT. A law of the iterated logarithm for some sto-
C. HESPEL*, G. JACOB**
* I.N.S.A 20 avenue des Buttes de Eoesmes
35043 RENNES CEDE X
**Laboratoire de Recherche en Informatique Fondamentale
(LA 369) - Universite de LILLE I
59655 VILLENEUVE D'ASCQ CEDEX
ABSTRACT
Given an analytic system, we compute a bilinear system of minimal dimension which approximates it up to order k (i.e. the outputs of these two
systems have the same Taylor expansion up to order k).
For that purpose, we use a noncommutative formal power series called the generating series of the system. Let s be the series, and y the
output of the analytic system : we notice that the Taylor expansion of
y up to order k in t=O can be obtained from the coefficients of the
words of length not greater than k in the series s. As rational series
are characteristic of finite dimensional bilinear systems, the problem
is reduced to the following : build a rational series g, which is an approximation of s up to order k (i.e. the coefficients of the words of
length not greater than k in g and s are identical), and which is of minimal rank. Then with g we associate a bilinear system, which is a solution to our problem.
INTRODUCTION
Several methods may be used for computing the input-output behaviour of
a control system : transfer functions, functional expansions, especially
Volterra series, and generating power series.
All these descriptions allow to approach the following question :
Is it possible to give an approximation of the input-output behaviour of
any system by that of a more elementary system, like for instance a
linear system ?
The transfer function of a single input system can be used to build
a linear approximation of this system, via the Pade approximants [1,2J.
In the same way, the use of Pade approximants in two commutative
variables allows to build approximants of a system when its output is an
analytic function of several inputs [1]. This method is connected with
the continued fraction expansions, and the orthogonal polynomials [1,13J
However, in general, it is impossible to approximate nonlinear systems by linear ones.
511
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 511-520.
1986 by D. Reidel Publishing Company.
512
1.1.
=L
(s,w) w
where
\II=X
x.
JI
Jk
for
k ~ 0 and x, ... X
Ji
is
equal
to
k).
(w is called word and its length Iwl
The output, y of system (S) may be expressed in the following way
w
513
yet)
where
ow
=E
(s,w) It ow
0
for w=x x .
Jl
Jk
The generating series may be rewritten in the form
s =
hiq (0 )
E
\1:)0
J\I hi q (0 )
E A . A .
.
.
Jo,,J\I=o
Jo
(8)
bilinear system
approximating (5)
s
______'"~
g
k ______~>
generating series
rational series equal to s,
of (5)
for the words of length ~ k
diagram 1
the method
514
... (s)
~p,~q
H~k(s),
... -l.o''''l.o
)
the family of these column vectors, indexed
vj j=l r
by words Vj (diagram 2).
m
q=(E u.(t)M.)q(t)
i=.o l.
l._
by computing the matrices Mi given by
[10] :
515
/// .. '/
V /'
~
i~
?:~,/A
"
lul=k
I ,//1
1/
diagram 2
[F
I
lul=k-l
~
~,
, 1//1
I
f
I; I
H~k (s)
2. PRACTICAL COMPUTATION
2.1. Direct application of the method
Consider the nonlinear differential equation relatin~ the current excitation i(t) and the voltage v(t) across a capacitor L6] :
(E )
i.e.
lIIith
{q(t)
v(t) = q(t)
= -k l q(o) - k2 q(o)
= -k l - 2k2 q(o)
= -2k 2
2
and
d
A0 = adq
d
Al = dq
v~
E
Aj A. q/ox . x.
jo, ,jv=o,l
0
Jv
Jv
Jo
"In
EIN
516
\In
E IN
n
xoxl
e
e n+l + nb e n + n(n-l)
2
c xn-l
xl
xl
I
=a
0 0 0 0
abc
o a 2b
o 0 a
Mo =
(p-l)(p-2) c
2
(p-l) b
MI =
I)
o I
p(p-l)
basis
expressed in
pb
0 a
= (ex xn)
= (e n+l)expressed in basis(e n)
xl
xl
= A q(t)
where
q(o)
= (q(o) I 0 0)
~W
for J/,
j=l
ei
= O,l, ,m
517
il i2
iN
a IR-automaton whose states are the vector fields Dl D2 DN In
order to build this automaton, we compute the product of these vector
fields and AR, :
N dj,N,R, dj,N_I,R,
= E E
E
j=l p=o
q=o
(v)
ej(p)(q)
R,
q~ q~-l q'l
irv
iN_I-q iN-p
DI
DN_I
DN
Dj
is the degree of e~ in qk
j,k,R,
'"
Every term of the sum on the right-hand side of this relation can be
connected with one transition of the automaton for letter xR, (diagram J)
where d
il-v
iN_I-q iN-p
DI
DN_I
DN
Dj
diagram J
The initial state is vector field I
iN
il
Eyery st?te DI DN is a final state whose weight is equal to
~l
~N
DI DN
h/q(o)
il
iN
We shall call minimal order index of state DI DN ,the length of the
shortest successful path labelled by a word
. that gets through
this state. Then, the number of states whose minimal order index is ~ k ,
gives us a dimension (not necessarily minimal) of the state-space.
Consider the Duffing equation
(E 2 )
{r
Ao =
aq2 + q2 aql =
D2 + q2 DI
Al = aq2 = D2
We obtain the actions of Ao and Al on states
Di
D~
518
oj Ok A - oj Ok+l
1 21- 1 2
or
o~ Ao
= F oi O~+l +
00'=7
@q=6
(1'=
(diagram 5)
6 =5
(1=1
For k =7,9 states are enough to compute the approximant. (the minimal
order index sr is written in the diagram, for every state). We shall see
that the dimension, computed by the method of the truncated automaton,
is not necessarily minimal.
2.3. A quotient IR-automaton of the truncated automaton (cf. [10]) .
We use section 2.2., and compute the minimal rank.
Let us go back to the Ouffing equation (E 2 ).
The method developed in 2.1 leads us to compute the rank of the
matrix M defined as follows: (for k = 7)
its rows are indexed by the 9 states selected in 2.2.
its columns.are indexed by the words of length ~ 4
for state 01 o~ and word w, the corresponding term is given by
j
0102w
__
0102A. A. A. oh
J1 J2
Jn
with
w=x. x . x .
J1 J2
In
519
Xo
-a
-aF+ql'
F'
q2 F"-aF'
F"
a 2+F'
-a
F(a +F')+q2(q2F"-aF')
F"
Fill
F"
o
o
F' (a +F')+FF"+q2(ql""-aF"
_a(a 2+2F' )+2q l"
F"(a 2+3F' )+F'" (F-aq )
2
-2aF"+2q 2F'"
2F"
3F,,2+F," (a 2+4F')
-aFIII
Fill
Fill
-2aF'"
2FIII
2
2
The rows indexed by states DID2 and DID2 are linear combinations of the
Table 1
Order of approximation k
6.
15600
30000
5700(
2500 7300
31900
dimension (according to
method 2.2)
10
14
22
30
+ MULTICS-MACSYMA
520
REFERENCES
C. BREZINSKI,'Pade-type approximation and general orthogonal
polynomials', INSM 50, Birkhailser.
[2J
[3J
~,
1974, p. 197-222.
M. FLIESS, 'Un outil algebrique : les series formelles non commutatives', in Mathematical Systems Theory (G. Marchesini and S.K.
Mitter, Eds.), Lect. Notes Econom. Math. Syst., vol. 131, Springer-Verlag, 1976, p. 122-148.
-
[5]
[8]
nO
~,
springer-Verlag.
C. HESPEL, 'Approximation de ser~es formelles par des series rationnelles', RAIRO, Informatique Theorique, vol. 18, nO 3, 1984,
P. 241-258.
-
[10]
Ell]
[12]
[13]
Int.
Applications
Riccardo Marino
Seconda Universita di Roma, Dipartimento di Ingegneria
Elettronica
Via O. Raimondo - 00173 Roma - Italy
1.
INTRODUCTION
d .
~ = f(x) +
dt
Xo
i=l
n
CR
which
are
m
(1)
+ m
is the state, u(t): R +R is the control, say
m
Xo
R. MARINO
524
DEFINITION 1.
di
dt
AX +
i=1
b i vi(t) ~ AX + Bv(t)
(2)
= ~(x)
(3)
- a state feedback and linear state dependent change of coordinates in the control space
u
= a(x)
+ S(x) v
(4)
f(x,u)
(5 )
can be found in [35] for the single input case and in [38] for the
multi-input case. Results presented in [15] have been stimulated by
pioneering design techniques developed in aeronautics ( [30], [31])
Recently the more general concept of partial feedback linearization
has been introduced. Conditions which characterize the largest feedback linearizable subsystem were obtained in [18] for the single
input case and in [26] and [34] for the multi-input case.
Feedback linearizable systems are appealing since, once the
transformations (3) and (4) are determined, linear control techniques
apply and are employed in the design of v(t). On the other hand, the
application of the overall nonlinear control technique has raised the
following problems.
a. Necessary and sufficient Lie algebraic conditions, which are
rather restrictive, have to be checked:
this can hardly be done by
hand computations. Computers equipped with symbolic languages can do
it at the actual state of art provided that, roughly speaking, n is
not larger than ten (the reader is referred to [9] and [101 for a
detailed discussion of this point). Approximations of those conditions
have been proposed in [19].
b. The domain of transformations (3) and (4) needs to be
MO
Mi
Mn-m(x)
525
R. MARINO
526
k., , = n.
More generally, as
j=1 J
m
shown in [22] and [23], a set of controllability indices can be uniquely associated to any nonlinear system (I). If their sum equal n,
the state space dimension, the system is locally feedback linearizable.
In any case, their sum gives the dimension of the largest locally
feedback linearizable subsystem.
The construction of the feedback transformation which takes (I)
into (2), where (2) is in controllable canonical form, involves the
iteration of the following two steps for i = 0, , m-l [28]:
a. determine a function '0 +1 such that d~ +1 vanishes on
i
i
Mki+I-2
and is independent of d~j for 1 ~ j < 0i+1 in Ux ;
00 = 0,
b.
1 = k 1 , , 0i =
Let
compute,
Li- 1 ,
i+j
i+l
for j = 2, , k i + l
The new coordinates are given by";c = ~(x), where, =('1' " ) ' is a
local diffeomorphism in U : they are not unique and should He carexo
fully constructed since their choice will affect the final control
algorithm. Transformation (4) is given by
-k
vI
1
Lf '1
v2
Lf
k
m
Lf
'0 +1
1
'0
m-l
k -I
1
L
'1
L
gl f
k -1
2
L
'1+ 1
gl Lf
k -1
+1
a(x) + Sex) u
L m
f
gm
k -1
1
Lf
'I
k -I
gm L/
~01+1
k -1
L m
f
u1
'0
m-I +1
u2
(6)
527
Note that step a. involves the solution of systems of linear partial differential equations.
Considerable amounts of symbolic computations are required in step b. and in order to check conditions (i)
and (ii) in Theorem A.
In [9] and [10] an algorithm is presented
which. among other things. allows the automatic verification of conditions (i) and (ii) on computers equipped with symbolic languages
such as MACSYMA or REDUCE. The construction of the feedback transformation can also be performed automatically excepting the solution of
systems of linear partial differential equations (step a.). which
remains the most serious problem in the construction of the control.
The functions ~1' ~ +1' ~
+1 can be interpreted as m
a1
am-I
output function:
the equation (6) allows then to compute a state
feedback decoup1ing control in terms of u which makes the input output
behaviour from the new inputs (VI' v) linear. contro~lab1e and
m
decoup1ed (see [22]).
The overall control scheme can be interpreted in terms of the
block diagram in Fig. 1.
Reference
Model
e
v
Plant
Adaptive
Outer Loop
Control
Fig. 1.
The plant is given by system (1). the inner loop control law is
given by (4) and the reference model is
[iLJ
d~
or equivalently
-1
A~(x)
R
[iLJ
dXR
-1
Bv( t)
(7)
528
R. MARINO
(8)
where
xR =
~(xR)' ~
being the
local
diffeomorphism
defined by (4).
~fi(x.t)
x~
0 and every i = 1 n.
The
(iii)
M(x.t) G(x)
t ~ to'
B G(x)
x~
f(x) - Ax G(x)
x~
x~
(10)
0
0
529
(2) are also allowed, necessary and sufficient conditions are available (Theorem A) for the existence of an inner loop control (5) such
that (see [121 for a proof of these two statements):
e(t) = 0 for every t>t o ' if e(t O) = 0;
if e(t )
o
* 0,
then
lim e(t)
t+~
=0
under
Xo
= -6 -1 (x)(l(x)
a-1 (x)
(11 )
-1
[X]
-1
A
,(x) =
f(x)
= a-lex)
~f(x),
R.MARINO
530
In fact it cannot be e(t) = 0 for every t>t o when e(t O) = 0 for two
reasons:
uncertainties are present in the plant; the inner loop
control (6), which may be too complex, has been simplified.
It
is clear that more sophisticated simplification techniques can be
used.
However, in general, simpler control laws can be obtained at
the expenses of higher control signals.
The design techniques discussed in this section will be applied
in Section 3 to robot manipulator models and in Section 4 to electric
power system models.
3.
Consider an open mechanic chain of N+I rigid bodies (links) interconnected by N actuated joints. Let qi' i = 1, " ' , N, denote the relative displacement between links i and i-I. Let u i be the generalized
force delivered at joint i; u = (u 1 ' , uN)T represents the control
vector. We refer to non redundant mechanic chains. The configuration
T
vector is q = (qI' " ' , qN) and the state space vector is x
T
qN' qI' " qN)' The kinetic energy is given by
T(q,q)
1 .T
=2
= (ql"'"
B(q) q
where B(q) is the NxN symmetric and positive definite inertia matrix
whereas the potential energy U(q) is due to gravity.
If u is considered a vector of external forces the Lagrangian equations of motion
are
j
1, , N
(12)
q+
(13)
a(q,q) = u
= a(q,q)
+ B(q) (K .q + K
p
q+
vet)
(14)
(K
K oq + K oq + vet)
p
531
+ K oq. + v(t)
v R
The inner loop control is (the subscript I stands for "inner")
q
K oq
P
= K oq + K o~ + v(t)
I
P
v
The closed-inner-loop system becomes
"outer")
= -a(q,~) +
B(q)q
K .q + K
e=
Ke + K
p
e+
= qR
(the
q+
subscript 0 stands
v(t) +
for
- q)
oq
+
(15)
> 0)
if
(16)
if
are designed to force and maintain the system very close to the hyper-
surface
+ Ce = 0 (see [2], [37]~ [39] for specific techniques) the
averaged control action is evaluated as follows (C = diag[c , ,c ])
1
e+ c ~
Kp e + Kv
e+
(I
-1
532
R, MARINO
or
u
av
Hence
u
+ u
av
= a(q,q)
e = Kp e
+ K
e+o.
The
(17)
= -C
(18)
The strategy (16) demands for control signals which are less
+
complex with
inequali-
ties) but in general higher in amp1itudeo Note that only bounds for the
non1inearities a(q,q) are required. In the particular case of the
robot manipulator this feature greatly simplifies the construction of
the dynamical model.
A similar control scheme which employs both loops in Fig. 1 is
proposed in [3] and applied to robot manipulators in [18] via hyperstability theory (see also [32])
u
It(w,x) x + W(w,v) vi - H x + H v
x
(19)
where
x = (q ,q0) T
w
De
w
T
t = r IwO (sgn x)
w
1jI = s Iwn (sng v)T
r, s suitable scalars, D, H ,H
p
constant matriceso
x + H v,
Example 2
Consider now a single link robot arm actuated by an electric motor
through a flexible joint, for instance a high-ratio, high-torque gear
mechanism. Flexibility is modeled by a linear torsion spring between
the rotor and the link. The Euler-Lagrange equations of motion are in
this case (friction and viscous phenomena are neglected)
where J 1 and J 2 are the moments of inertia for the link and the rotor,
6 1 and (6 1 + 62 ) the corresponding angular displacements, M is the
mass of the link, k the elastic constant, u is the control, i.e., the
torque delivered by the motor. In state space form the system is
MgI
- sin 61 + ~ 6
111 = - J1 2
J1
62
= 112
1
1
1
sin 6 - (-+ - ) k6 2 + T
u(t)
112 =~
J1
J2
1
J1
2
= (6 1 ,
it is easy to check that conditions (i) and (ii) are satisfied and
therefore the system is locally feedback linearizable. By using the
change of coordinates
533
R. MARINO
534
= - Mgl n
J1
cos 9
+ k n
~ 2
k
Mgl
n ) - v(t)}
+ a 4 ( - - n cos 9 1 + J1 2
J1
1
the closed loop system becomes in new coordinates
4
i=1
4.
a y
i
+ v(t)
2
d 6i
1
-2- = -M (p i - P i)
dt
i
m,
e,
where k ij
P -
Ei E.
Mi Z.~ ; let x = (6 1 , ""
Xo
~]
j=l
ij
cos (6 i -6].+aij )
(20)
be
[20].
535
Proposition 1.
Assume that only one independent control u i is available and acting at node 1. Then (20) is locally feedback linearizable
in any U
C 9 n = {xe:Rn : I <Si_1-<Si+<X i _ 1 I '" t
i=2, ,N, i odd te:Z},
T'
Xo
if, and only if, k ij = 0 for j '" i-I and j '" i+1, i = 2, , N-l, and
k i ,i-l '" 0, k i ,i+l '"
o.
Proposition 2.
U
Xo
C Sl =
{URn
k32
kS2
k N- 1 ,2
k14
k34
kS4
k N- 1 ,4
kIN
k3N
kSN
.......
N-l,N
If independent
536
R. MARINO
n
- Pi +
ui
k ij cos (~
j=l
~.
<l
ij
)-
- ali(~i - ~O)
- aZi (Wi - Ws ) + vi(t)
i
where (ali' a 2i ) are design parameters and
i=l, ... ,N
... ,
~N'
00,
(21)
,(0)
s
d2~R
i
_ ~O) - a (WiR - w ) + vi(t)
-2-= -ali (~R
s
2i
i
i
dt
where
vi (t)
sets
the
reference
equilibrium point.
i-1, ,N
The
simplified
-a1i(~i
~~)
- a 2i (wi - Ws ) + vi(t)
satisfies the differential equation
j=l
- ali
(~i
~~)
- a 2i (w i -
W~)
+ uO,i
We choose
loop control
the
discontinuous
(variable
structure system)
outer
if
if
(u;,
537
This implies
+
ui
<-
Pi
j=l
k iJ cos (5 i - 5j + Qij) -
00:)
One can also treat the left hand side of model (20) as a bounded
uncertainity: in fact condition (lOi) is satisfied. In this case no
full state knowledge is needed at the expense of higher control power.
The following inequalities can be used (see also [40])
n
+
ui
<-
P.
ui
>-
P.
l.
j=l
n
l.
j=l
R
kij - aU (5 i - 5R) - a 2i (oo i - wi)
i
k ij - a
(5
R
- 5R) - a 2i (wi - wi)
i
aV,i
- Pi +
j=l
k ij cos (5
- 5j + Qij)
R
) - a21 (w i - wi)
- aU (5 i - 5R
i
- A (5 - 5R)
i
i
i
The overall averaged control action is asymptotically given
lim
t~
e=
0) by
(lim e=O,
t+ao
538
R. MARINO
n
u 1 ,i
+ u av,l..
- Pi +
j=1
kiJ cos (6 i - 6j + a ij )
,,0.)
- ali ( u" i _ ul.
a 2i ( wi - Ws ) + vi (t)
i = I , ,N
~ =
W -
W
S
(22)
Here 6 represent the angle of the rotor with respect to a frame rotating
at synchronous speed ws; w is the rotor speed, JP and J(k 1 e~ sin 6 +
k2 sin 26) are the mechanical power received by the turbine and the
electric power delivered to the electric network, respectively (J' is
the moment of inertia). The first two equations represent the rotor
angle dynamics, which are affected bye' whose dynamics is controlled
q
The state
e~
is proportional to
where x
= f(x)
(6,w,e')
q
+ g vf(t)
1
(w - ws )
~6 +
(p - kl
e~
~w
539
By computing
- 1<,i kS
(1Il -
lIl S )
cos )
Olll -
k3 kS oe'
P - kl
e~
(1Il -
1Il )
(-k
k2 sin 215
(23)
sin
(24)
By using the state feedback control
k3
k4
(1Il - 1Il )
v(x,t) = + -kS e'q - -kS cosl5 - v fo + kl kS sinl5
s
(k
- 1 e q' cos 15 - 2k 2 cos 15)
+
1
(a (15 - 15 ) + a
kl kS sinl5
1
0
2
(1Il -
1Il )
where v(t) is the reference signal and aI' a 2 , a 3 are design parameters, system (23) becomes
R. MARINO
540
v.
CONCLUSION
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[ 19:1
[20]
[21]
541
542
[22]
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
[38]
[39]
R. MARINO
Isidori, A., Krener, A. J., Monaco, S., Gori-Giorgi, C., 'Nonlinear decoupling via feedback:
a differential geometric
approach,' IEEE Trans. Autom. Control, vol. 26, 331-345, 1981.
Marino R., 'Feedback equivalence of nonlinear systems with
applications to power system equations,' Doctoral Dissertation,
Washington University, St. Louis, MO, 1982.
Marino R., Nicosia S., 'Linear model following control and feedback equivalence to linear controllable systems,' Int. J. of
Control, vol. 39, 3, 473-485, 1984.
Marino R., 'On the stabilization of power systems with a reduced
number of controls,' INRIA Conf. Nice '84; Lecture Notes in
Control and Inf. Sci., vol. 62, 259-274, Springer Verlag, 1984.
Marino, R., 'On the largest feedback linearizable subsystem," to
appear in System and Control Letters.
Marino, R., 'An example of nonlinear regulator,' IEEE Trans.
Autom. Control, vol. 29, 3, 276-279, 1984.
Marino, R., Boothby, W. M., Elliott, D. L., 'Geometric properties
of linearizable control systems,' Mathematical Systems Theory,
vol. 18, 97-123, 1985.
Marino, R., Uie-Spong, M., 'Stabilization and voltage regulation
of electric power systems with (p,Q) load' buses preserved,' Proc.
Conference on Decision and Control, Fort Lauderdale, Florida,
1985.
Meyer G., Cicolani L., 'Applications of nonlinear system inverses
to automatic flight control design-system concepts and flight
evaluations,' Agardograph 251 on Theory and Applications of Opt.
Control"Aero. Systems, P. Kant ed., 1980.
Meyer, G., Su, R., Hunt, L. R., "Application of nonlinear transformations to automatic flight control," Automatica, vol. 20, 1,
103-107, 1984.
Nicosia S., Tomei P., 'Model reference adaptive control algorithms for industrial robots,' Automatica, 20, 5, 635-644, 1984.
Respondek, W., 'Global aspects of linearization equivalence to
polynomial forms and decomposition of nonlinear control systems,'
this volume.
Respondek W., to appear in Proc. MTNS '85 Conference, Stockholm,
June 85, C. Byrnes and A. Lindquist eds., Elsevier, Amsterdam.
Su, R., 'On the linear equivalence of nonlinear systems,' System
and Control Letters, vol. 2, 48-52, 1982.
Su, R., Meyer, G., Hunt, L. R., 'Robustness in nonlinear
control,' Differential Geometric Control Theory Conference,
Birkhauser, Boston, R. W. Brockett, R. S. Milemann, H. J.
Sussmann, Eds., vol. 27, 316-337, 1983.
Utkin U.I., Sliding Modes and' their Applications in Variable
Structure Systems. MIR Moscow (English translation), 1978.
Van der Schaft, A. J., 'Linearization and input-output decoupling
for general nonlinear systems,' System & Control Letters, vol.
5, 27-33, 1984.
Young K.K.D., 'Design ~f variable structure model following
control systems,' I~EE Trans. Autom. Control, vol. AC-23 ,
1079-1085, 1978.
[40]
543
C.A. O.
FEEDBACK
LINEARIZATION
WITH
OECOtJPLING,
PERTURBATIONS RE.JECTION AND
APPLICATIONS TO THE DYNAMIC CONTROL OF A
ROBOT ARM
Saint-Hono~,
ABSTRACT: We
and simulate
M. Fliess andM. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 545-572.
1986 by D. Reidel Publishing Company.
546
developments
techniques
[8).
In the first section, the main results on D, PR, FL, and on the
system's graph algorithm are recalled, and the integration scheme to be
used for simulations is displayed. Then the main package units are
sketched.
The second section is devoted to two robotics applications: motion
of translation for the end body of a robot arm, and stabilization of an
inverted pendulum on the end body. These applications are currently
being implemented on the robot arm for teaching and research PAMIR,
designed by P. Durand and F. Viette of ESIEE.
Note that these examples have been chosen since each one displays a
degree of difficulty that cannot be completely taken into account by
linear or adaptive techniques (singularities, less controls than angular
variables ).
The control law computation and the simulations are presented for both
examples.
Finally the robustness of the proposed control laws is studied by simulations where random noises are introduced.
1. Theory
1, ... ,p
given in local coordinates in a connected n-dimensional analYjic manifold X, where x E X is the state of the SJstem, Jf:lere u - (u , . , um) ,
is the control,('for transposition), w- (w , .. ,w)' is the perturbation, and y - (Yl'" .,y )' is the output.
The vector fields fO,fl.~ ,fm,gl.""'9w are supposed to be analytic
from X to T( X) the tangent bundle of >C and the functions h 1 , ... ,hp are
analytic from X to ~
For obvious notational reasons , we shall denote D for decoupling,
PR for perturbations rejection, and FL for feedback linearization.
The problem D
when m - p and M - 0 , consists in finding a feedback control law of the
form
u (x, v) -
a (x)
(1)
547
with a,{3 analytic and (3(x) invertible for all x,. v is the auxiliary control, such that the output y. only depends on v~ , in a neighborhood of
X to be precised.
J.
The problem PR
when M ~ I , consists in finding a feedback control law of the form ( 1 )
such that the output y becomes independent of the perturbations w in a
suitable neighborhood.
The problem FL
when M == 0, consists in finding a feedback control law of the form ( 1 ),
such that the input output: v ~ y becomes linear, in a neighborhood of
X.
The three problems or two of them
can be solved simultaneously
(with obvious adaptations of the dimensions m, M, or p ):
The necessary and sufficient conditions for the existence of a
feedback law (1) are now well known in any case ([3], [5], [15], [16],
[17], [23] ); the best formulation for our calculation is the following
( [3], [ 15 J ):
shall denote L f h .(x) the Lie derivative of the function h. with
i J
J
respect to the vector fJ.eld fi evaluated at x :
We
n
..k
Bh.
L f h.(x) =
fi(x) ~(X)
i J
k-l
r:
Definition 1:
The characteristic number p. of order i, i
integer satisfying:
~
3j
Vj
{1, ..
,m}, Vk
Pi
L f L f h.
j
0
J.
1, .. ,p,
(2)
is
the
unique.
< Pi
L f L f h. -= O.
j
0 ~
(3 )
(4)
'13,
Yi
CT.
the
548
(3)
and
( 4) with
Denote:
i
t..(x)
OK
Pi
Lf Lf
h.(x)
j
0
1
i = l, ... ,p
j = l, ..
,m.
( ii) a and
IJD. -
(j
decoupling
solve
the
l, ... ,p
(5)
=B
(6)
with
i
i
Pl
Pp
pi+l.
A (x) - . , (hl(X), ,Lfo hl(x), ,hp(X), ,Lfo hp(X - LfO
h i (x)(7)
where the .,i are arbitrary analytic functions, i " 1, . ,p,
i
i
Pl
Pp
Bj(X) -= IIIj (hl (X), ,Lfo hl(x), ... ,hp(X), ... ,Lfo hp(x
(8)
, ... ,
,i -
1, ... ,p
(9)
(10.a)
549
i-l, .. ,p.
with
(=
(~, ...
( 10.b)
,cPPp )'
i
Pi
(x) ... ., (h.(X),L f h.(x), ... ,Lf h.(x
~
O~
p i +l
- Lf
h.(x)
0
(11)
B.(x) - 0
J
V j
(12)
;I;;
1, ... ,~-:
Ai(x)
Pj
r.
j - l k-o
.,~,k(j
J
(13)
Where the
and the
Bj(X) - IIIj
the
III~
J
FL
( 14)
1, .. ,m.
and
III~
in place of
respectively, i - 1, . ,p , j - l, . ,m
.,i( () and
\II~( ()
550
l. .2. The
sys~em'
s graph
algori~hm
p. and a.
dllriva~il>ns.
we
of
X,
l.
m l.
M
.
u , ... ,u ,w , .. ,w called 1npu~-nodes,
X1 " " , Xn called
s~ate-nodes,
~ iffJe: ~ ;If 0
(resp. gi ) ,
8~
ar
to xr iff ~
;If
( resp.
l, .. ,n;
ab.
a;l
r
Defini~ion 4:
call d( u , y.) the minimal number of oriented arcs of r joining u j
Yi ' and:
1
we
d. 1
Min
j-1,
d(u j ,y.)
,m
d~
1
Min
to
d(~ ,y.).
j-1, ,M
Also, we call r. . the subgraph made of all the minimal paths joining u.
to y. , of lengfhJ di ' for i - 1, .. ,p, j - 1, . ,m.
J
'!'he lMight of a patH is the product of every weights of the arcs of the
path, ~e weight of r 1 j , noted W1 j , is the sum of the weights of all
the paths of r 1 .
'
,
,J
551
Theorem
di - 2
41
Pi
VI-I, ... ,p ,
>
1.
graph
and
1.
charac-
The advantge of Theorem 4 is then to replace, at least generically, an iterative computation of Lie derivatives by the determination
of minimal paths. We want to stress on the fact that those minimal
paths and the decoupling matrix ~ are generically obtained by performing
only first order derivatives to evaluate the weights of the associated
arcs (see definitions 3 and 4).
For example, for a system with 2 inputs, 3 states and 2 outputs, and if
the 2 characteristic numbers are equal to 1, we shall compute at most 24
partial derivatives if we apply directly the definition 1, whereas only
12 partial derivatives are needed with the graph's method.
Furthermore, in non generic cases, the graph still simplifies the computations of the characteristic numbers and of the decoupling matrix,
since we can only start computing Lie derivatives from the order
obtained in the graph.
The computation of a feedback law is thus completed in 3 steps:
computation of minimal paths,
computation of the characteristic numbers and the matrix
552
inversion of /).
Details can be found in [14].
Remark :
(15 )
8p'
_ 8H),
8q
f.
1.
8C.
= (- _2:
8p
8Ci
--)'
( 16)
8q
truncating
the
553
Lf
... L f h. (x( O)}
O J
. 1
Jk
1
k~1
JdC.
t
y.(t} = h.(x(O}}+ I:
Jk
.. dC. (17)
J1
with:
Co(t}
=t
Ju.(s}ds
t
, C.(t} =
J
0 J
,Vj
1, ... ,m
(18 )
and, by induction :
t
t
s
. .. dC . .
J dC . . . . dC. = J dC. (S}J dC.
o Jk
J1
0 Jk - 1
J1
0 Jk
{19}
then
at
the
truncating
1:.
J 1 ,,J k = 0
L f . . . L f . x i (t1 }
J1
Jk
,N=<>,1,
This scheme can thus be implemented from exact fo~l computation of the
iterated Lie derivatives of (20), computation which is done off-line
once for all , with piecewise constant inputs between NT et (N+1 }T
8=1,2, ...
Remark that for piecewise constant inputs'i one obtains in fact .an
approximation of order r of : exp T(fo + ~ u fi)(X(NT}}
In the case of fast variables, the integrns of the inputs can be computed with a suitably finer sampling, or simply replaced by their average.
This method has been tested at the orders 2 and 3 for the robot
Simulations presented below and, for instance, at the order 2, the
resulting precision is sharper than the one that would result from the
Runge-Kutta method of order 4 with adaptive steps.
554
Before to go into details, let us remark that the problem of controlling a robot arm with the same number of motors as the number of
links by decoupling or linearizing is almost trivial as it is often
noted ( see [6] and others ), at least when the output functions are a
globally diffeomorphic transformation of the state variables.
Things become different either when the output functions can be singular
or when there are more links than motors. This is why we have chosen the
two following examples, the first one corresponding to the case where
the output functions have Singularities, and the second one with 4 links
and 3 motors.
2 .1. Model of the robot arm PAMIR
The robot arm PAMIR designed by P. DURAND and P. VIETTE of ESIEE is a
robot arm with 6 degrees of freedom and whose motors and reduction
units, relatively to each link, are fixed on the corresponding
rotary
joints ( see Pig 1).
Moreover, the arm can be considered as rigid,
namely flex1bilities can be ignored.
The model presented here, includes both the robot's kinematicS and the
set
set
set
set
.th
set
1
1:
2:
3:
4:
555
mass
m1
centre of
gravity
OiGi Ri inertias
xg1
Yg1
G1
Zg,
2
m2
G2
Xg 2
Yg2
Zg?
m3
G3
Xg3
Yg3
Zg1
m4
G4
d5
0
0
r1
Y
r2 r2
x' Y
r2
Z
r3 r3
x' Y
r3
Z
o
r
a
r
t
r
u
n
k
r" r"
x' Y
r4
Z
o
r
e
a
r
556
equations of the "motoreductors" (sets made of one motor and its reduction unit). In fact, the motor controls are implemented in voltage and
not in intensity ( or torques), electrical signals being too much
unstable, and thus it results an increase of the system's dimension.
Notice that in practice, this point of view must be taken into account
if we want to avoid using local analog loops for torque controls, more
expensive and less reliable.
2.1.1. Equations of the "motoreductors"
The motors are modelized by the mechanical relations (inertias) and the
electrical relations.
We call:
e the 3-dimensional vector of the motors angular variables,
J the diagonal 3x3 matrix of motors inertias,
V the diagonal 3x3 matrix of viscous frictions coefficients,
K the diagonal 3x3 matrix of torques constants,
I the tridimensional vector of electrical intensities,
Q the tridimensional vector of resistant torques,
r the diagonal 3x3 matrix of inductors reSistances,
1 the diagonal 3x3 matrix of inductors inductances,
X the scalar coefficient of counterelectromotive forces,
R the diagonal 3x3 matrix of reduction ratios,
u the 3-dimensional vector of voltage controls applied to the motors,
9 the 3-dimensional vector of angles between 2 consecutive links (see
Fig.l ),
q the 3-dimensional vector of motor torques applied to the links.
Then we have:
( 21)
rI
+ 1I + xe -
e.-
(22 )
(23 )
R9
Equations (21) and (22) are respectively the three motors mechanical and
electrical equations, and equations (23) are those of the reduction
units.
2.1.2. Dynamics of the robot
6] , (26] ), where the state variables are
the relative angles between 2 consecqtive links on the one hand, and the
corresponding angular velocities on the other hanei.
Let us denote 9. the angle between the links i-1 et i (see Figure 1 ).
In both appli&tions presented below, one can restrict to 3 degrees of
freedom. Moreover, since in the second example the third link is topped
by a stick turning without friction on a ball joint, and since this
amounts to 4 additional state variables, we shall derive the two models
separately.
We apply the classical method
557
2.~.2.a.
Prom
(25)
second
Since we are only interested in the behavior of the end body in the
Euclidian space, the outputs are chosen as the cartesian coordinates of
the end body :
h~(9)
...
(-d~
h 2(9) - d 2
h 3 (9) '""
+ (d 3- Yo)Sin92 +
(-d~
(d 3-Yo)COS92
Grouping
x~
= (91 ,
(d 3- Yo)Sin92
(2~)
*
= -(r~(x~
x3 ... -1
-~
(rx3
-~
ZoSin9~
d 4cOS(92+93 )
( 27)
d4sin(92+93sin9~
ZOCOS9~
9 2 , 9 3 )'
x2
d4sin(92+93COS9~
x2 -
9 2 , 9 3 )'
x3 "'"
(1~,
1 2 , 1 3 )'
(28)
(r2(x~)X2X2
)(Rx2)
(9~,
-1
+ VR2 X2
u
ro(x~)
- XRx3 )
(29)
558
( 30)
~,2,3.
2 L - 2 LO
(31 )
L
e.e.A. .(e)
l;(i, j;(S l. J l., J
by
formal
as
( 29 )
calculus,
by
the
(32 ).
1.
loop
system
( 10. a),
has q
== p + L
hI ,... a 4 (94
c
e4 )
h2 .... a 3 (9 3
eC)
3
C
e )
1
h3 - al,el
( 10. b )
c
as(es - 9 5 )
(33 )
a 2 (92 _ e C )
2
where the .set points e': , i .. 1, ... ,5, are chosen such that e~+9~+9~-O ,
and 9~ - 0 to ensurel.the vertical position of the stick, and Where the
a I ' I -1, .. ,5, are reals that will be adjusted later to ensure the
stability of the whole system.
ROtice that, by making the set points vary, one could also maintain the
stick in another position, or follow a given trajectory, (for example a
trajectory, minimal in the sense of a given cost function, and avoiding
559
obstacl.es ) .
2.2. Decoupl.ing and feedback l.inearization of the robot arm PAMIR
2.2.l.. Straight l.ine fol.l.owing
To fol.l.ow a straight l.ine, without specifying the reference model
on the line, it suffices to choose the cartesian coordinates for Which
this l.ine is for example the x-axis. Hence, after decoupling, only 1
input wil.l suffice to move on the given line. Moreover, since the
reference model is not specified, one can choose a stable linear
behavior to reach a given point on this line.
Thus, we shall try to decouple and to linearize by feedback simultaneously.
Before stating the. result , it can be easily checked that the Jacobian of the transformation: e ... h( e) defined by (27) is Singular only in
the two following cases :
or
(mod IT)(34)
= tr(RKl-1 ) det J
560
them, of course, to place the poles of the closed loop system; then for
the straight line following, it remains to control this system using the
inputs V, by an affine feedback, the affine term being computed to
obtain the desired asymptote. All these calculus are parts of the classical methods in automatic control, and we shall not go into details.
Let us focus on the fact that the control law is implemented on decoupled sub-systems and thus, the synthesis is easier by pole placement of
scalar input-output transfer functions, than by the general linear quadratic control methods.
2.2 . 2. The inverted pendulum
PropoSition 2:
the characteristic numbers are all equal to 2 and the system can be globally simultaneously decoupled and feedback linearized (no singularity).
The feedback control law is, as previously, implemented by pole
placement, and the 4-dimensional unobservable part remains bounded for
bounded inputs.
The matrix A is here, given by:
G1 .
,~
= 1,2,3
i
1,2,3.
wh~re
and
where
561
"fii
200V
i = 1,2,3.
(35)
In the non perturbed case, the given point is reached_~nder 0.2 s. (at a
mean speed of 211\,/S) with a maximum error less than 10
m.
In the case of identification errors, taken between -100 % and +100 % on
all the coefficients, the_ 3point is still reached under 0.2 s. with a
m.
maximum error less than 2.10
Finally, in the perturbed case, with independant random noises of mean
intenSity equal to 0.1 (rd. or A. according to the variables), the point
;is o!!~e more reached at the same speed, with a maximum error of about
2.10
m.
Remark that, since the closed loop system has the same dimension as the
original one, the observability remains unchanged (locally) by feedback,
and the stability of the closed loop system implies trivially the stability of the original one.
2.3.2. Inverted pendulum
As previously, one places the poles of the closed loop system to stabil-
ize the stick around its vertical poSition, and to satisfy the constraints (35) on the motors.
In the simulation displayed in Figure 6, the initial deviation from the
vertical is equal to 9 degrees for 9 5 and to 0.5 degrees for 9 4 ' to
become less than 2 degrees after a few seconds.
The robustness to random noises and identification errors is similar to
the previous case.
(D
1-'.
':tJ
20
30
u2
u1
40
20--30
40
0
10
-.3g01e+01
u3
-.g400e+02
.1071e+02
-.8725e+02
.g018e+01
50
50
60
60
60
-.319ge-01
10
0
. 6745e+00
271ge+00
674ge+00
10
. 4734e+00
.4741e+00
20
20
h3
30
h2
30
h1
40
40
50
50
60
60
60
?l
zttl
;.;;
ttl
t"'
z
,...Ij
););-
ttl
:;
z
Ij
Ij
<.n
Rl
(1)
'1
Qtj
~
"l
1-'-
10
-.467ge+01
7162e+02
-.9797e+02
10
.7424e+01
-.8732e+02
.7133e+01
20
u3
u2
30
u1
30
40
40
40
50
50
50
60
60
60
.... 333ge-01
10
.6722e+00
.271ge+00
. 6748e+00
10
0
. 4732e+00
.4742e+00
@?
20
20
h3
30
h2
30
h1
40
40
50
50
60
60
el
v.
C/J
tIl
~
C/J
...,-<
C/J
Z
tIl
>
::0
t""'
Z
Z
0
::0
."
!:J
('J
>
U1
CD
~"'S
1-"
"'l
.1010e+02
-.8815e+02
1182e+02
u1
40
40
50
50
60
60
. 6752e+00
o 10
. 4730e+00
.4741e+00
.
20
30
h1
40
50
60
u.
565
. 2898e+00
.7971e+00
. 7857e+00
80 160 240
.2475e+00 xl
.1000e-Ol
.,..7043e-02
x4
80
160 240
.7353e+00 x2
-.1421e+Ol
80 160
-.1570e+Ol
. 4226e+00
-.1742e+Ol
40
80
556ge+00
80
160 240
.7154e+00 x3
240
x5
. 9595e+02
ul
-.1595e+03
u2
566
As mentioned in section 2.1.2.b, the linearizing feedback makes 4 dimensions unobservable. Nevertheless, with the outputs (33), the stability,
in the senseof uniform boundedness of the inputs and of the state variables, can be checked on simulations.
Further theoretic works are
currently done to prove this stability property.
We
CONCLUSION:
have proved .in this paper :
that formal calculus (Macsyma, Reduce, Maple, .. ) was essential,
but that the graph's methods appreciably reduce the computation of
the feedback control law,
that the package provides a mean to compare the "ideal" feedback to
"simplified" ones, and to evaluate the robustness of these control
laws,
that the Decoupling, Perturbations Rejection and Feedback linearization methods could be applied to an important class of problems
in robotics, (see also (11], [22], [29]),
that, doing so, one can recover the classical
Systems Theory (see also [11], [29]),
methods
of
Linear
computa-
567
REFERENCES
[1]
~veloppement
[2]
[3]
D. CLAUDE : Decoupling
Letters, 1982, 242,248.
[4J
[5J
[6 ]
[7]
[8]
[9]
M. FLIESS, F. IAMNABHI-IAGlUUUGuE :Application of a new functional expansion to the cubic anharmonic oscillabor. J, Math.
Phys. 23(4), 1982.
[10]
[11]
[12]
of
nonlinear
systems,
mod~lisation
Syst.
et COJIIIIande.
COntr.
Hermes,
Docteur-In~nieur.
par
de
568
[13]
J . P. GAUTHIER, G. BORNARD, S. BACHA, M. IDIR : Rejet de perturbations pour un modele non lin~aire de colonne a distiller. in
~veloppement et utilisation d'outils et modeles
math~matiques
en
automatique, analyse des systemes et traitement du signal. I. Landau coordonnateur. Ed. CNRS, Vo1.3, 1983.
(14)
[15]
[16]
[17]
B. JAKUBCZY.K, W. RESPONDEK IOn linearization of control systems. Bull. Acad. Pol. Sci., Ser. Sci. Math., 28, 1980, 517,522.
[18]
[19]
J. LEVINE, G. PIGNIE :Exact finite dimensional filters via systems realization for a class of discrete-time nonlinear systems.
Syst. COntr. Letters, V(6), 1985.
[20]
[21]
[22]
S. NICOSIA, F. NICOLO, D. LENTINI :Dynamical control of industrial robots with elastic and dissipative jOints. 8th IFAC World
congress, Kyoto, 1981.
[23]
[24]
in
differential
geometric
control
cona
569
[25]
bilin~aire
[26]
mathematics,
programming
[27]
[28]
[29]
[30]
Int.
J.
and
Syst.
physical
570
ANNEX ~:
(24)
l.,)
~2,3 ... m3 (x
g3
~3,3 - m3 (x
g3
g3
2
o
~ ..
l., )
l.,)
"'),
..
), l.
V i,j
(d 3
3
+yg3 ) + I Z
571
whil
A . . = A..
1, J
J, 1
that
the
(~)
Lagrangian
is
V i,j
We note :
572
-dS(zO+dSSin94Sin9S)(COS(92+93)COS94COS9S-Sin(92+93)Sin9S))
+I:Sin(92 +93 )Sin94
"2,2 - m4{d4(d4+2(d3-YO)COS93) + (d 3 -yO )
.22
2
2
+ dS(cOS 9 4 Sl.n 9S+COS 9 5 )
2dSd4+(d3-YO)COS93)COS9S-(d3-YO)Sin93CoS94Sin9S
I:Sin 2 9 4 COS 2 9 S
+ 1;Sin2 9 4 Sin29 S
+ I:COS 2 9 4
"2,4 - -dSm4sin94Sin9S{{d3-YO)COS93+d4+dSCOS9S)
- (I:-I;)Sin9 4 Sin9SCOS9S
"2,5
= m4dS({(d3-YO)COS93+d4)COS94COS9S-(d3-YO)Sin93Sin9S+dSCOS94)
4
+ I ZCOS94
2 2
2
.22
"3,3 - m4 {d 4+d S (COS 9 4 Sl.n 9 S+COS 9 S )+2d 4 d S COS9S )
+
"3,4 -
~4dS{d4+dScOS9S)sin94Sin9S
"3,5 - m4dS(dS+d4coS9S)COS94 +
~4,4
- m4 d:Sin 2 9 S + I:Sin 2 9 S +
"4,5 - 0
2
"5,5 - m4 d S
*
**
Nonlinear feedback control laws are proposed for large angle attitude
maneuvers of a rigid satellite by using pulsed thrusters and reaction
wheels. The kinematic equations in the quaternions parametrization
together with the Euler dynamical equations constitute the mathematical
model of the system under study. The design of the control law is based
on the nonlinear input-output decoupling input-output linearization and
linear pole placement theories. Simulation results are discussed.
Keywords: Large angle attitude control, nonlinear attitude control,
linearization, stabilization.
1. INTRODUCTION
The problem of large angle attitude maneuvering of a rigid satellite
was widely investigated, good engineering references are [1] and [2].
Several investigation have been based on Lyapunov's stability theory
and nonlinear optimization methods as pointed out in [3].
In this paper the design of feedback control laws for attitude
maneuvering is achieved, by means of the modern nonlinear control theor~
arguing as in the sequel. The mathematical model of the attitude control
problem under study has the structure of a linear analytic dynamical
system, (1). For such a class of systems, starting from the earlier
basic results in [4] and [5], it has been shown in [6] that under
suitable conditions, summarized in section 2, a nonlinear static state
feedback control law can be synthetized to solve the one-one noninteracting control problem: i.e. each input of the closed-loop system affects just one output. A by-product of the theory is that, on the open
subset of the state space on which the control law is defined, the
closed-loop system is input-output diffeomorphic to a linear one. More
precisely the closed-loop system is composed of a linear controllable
dynamics and a possibly nonlinear dynamics which do not affect the
outputs. Standard pole placement technics can now be applied to get
573
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonli'lear Control Theory, 573-595.
1986 by D. Reidel Publishing Company.
574
~(t)
x(O)
= f(x(t
= xo
i=l
g.(x(tu.(t)
1.
1.
E M
y.(t) = h.(x(t
1.
Q.
1.
(1)
i = l , ... ,m
where the control u. the output yare vectors in~Q., and~m respectively;
the state x belongs to an n-dimensional smooth manifold M; f and gi's,
are analytic vector field on M and h. 's are analytic functions.
l.
575
I s1J.. (x)v.1
a. (x) +
1
j=l
(2)
such that, in suitable local coordinates, the new system looks like:
Ii'm(xm)
A system of the form (3) is certainly one-one input-output decoupled, that is each input vi does not affect the outP~ts Yj when
j " i.
The ensure that each input v. can really control the corresponding
1
'"
'"
(4)
In (4) we denote by L A the usual Lie derivative i.e. the derivative
T
1J
{L
L 1 h.}
g. f 1
J
(5)
576
um(t)
d + 1
L m h
m
f
(6)
It is useful for the sequel to specify a coordinates transformation under which the feedback system (1)-(6) looks like (3) and ,moreover,
results to be characterized by a linear input-output dynamics.
hl (x)
z12
Lfh l (x)
zl
Z
d
1
Lf hl (x)
l,dl+l
zml
h (x)
m
zm2
Lfhm(x)
d
m
Lf hm(x)
m+l
m,dm+l
t (x)
and define:
z
1
(7)
T(x)
Z
t(x)
It can be shown ([6]) that, under the invertibility hypothesis of A(x)
on U C M and for suitable choices of t(x), the nonlinear map T(x) de-
577
0 . 0
O 0
zl +
zl
O ..... 01
0 ..... 0
z +
m
o.......... 01
f m+ l(zl''z m+ 1)
0 .... 0)zl
(1
0 . O)z
vI
(8)
0 .. 0
(1
+ i=l (g.)
l(zl''z m+ l)v.~
~ m+
Thus in the new coordinates the feedback system has exactly the
decoupled structure depicted in Fig. 1, moreover each i-th input-output
channel is an open loop chain of d i + 1 integrators. We also note that,
if d 1+ ... +dm+m is strictly less than n, the cedoupled system also includes an unobservable part, possibly nonlinear.
Since the possibly nonlinear part of the system in equation (8)
represented by the coordinate z 1 clearly does not affect the inputm+
output characteristics of the system, we may replace the system by the
purely linear part, whose transfer matrix is:
1
---d +1
l
0 .. 0
w(s)
1
dZ+10 ........ 0
o
s
0 ........ 0
578
procedure to obtain the stabilizing feedback control law for the system
(1). The resulting control law has the form:
u(x)
(9)
1-.
I
I
I
I
I
CO~R~LC~
Fig. 1.
We note that in (9) the control law can be modified by changing
the matrix K.
3. APPLICATION TO ATTITUDE CONTROL
In this section we synthetize the control laws both in the gas-jets and
reaction-wheels control modes.
579
Mathematical models
The most direct way to characterize an attitude maneuver of a body
consists in specifying the direction of the axis about which the body
must be rotate~ and the required angle of rotation, say ~, about that
axis (Euler's theorem). I t is well known ([ 1]) that denoting bye. 's,
1
xl
x2
-Z
x3
-WI
-w 2
-w 3
wI
w3
-w 2
xl
w2
-w3
wI
x2
w3
w2
-wI
x3
(10)
where
x
cos
~/2
I
i=O
e. sen
1
~/2
1,2,3
2
x. = 1.
1
We recall that the quaternions parametrization is convenient because the representation is free from singularities and the attitude
matrix is algebraic in the quaternion components (thus eliminating the
need for trascendental functions). Moreover for our purposes the
quaternion parametrization is suitable in characterizing the control
law needed to perform a fixed maneuvre; this will be done here after by
specifying the quaternions of the actual frame with reference to the
desired one and looking for a control law which brings x to 1 thex. 's
o
and the wi's to zero asymptotically: i. e. we look for a stabi 1 iz ing control law
for the system represented by the kinematic equations (10) and the
dynamic equations in the considered control mode.
In the gas-jets control mode, with reference to a principal axis
frame, the synamic equations take the form:
ti\
W2
w3
-1
J l (J2
-1
J 3 )W 2W3 + J l u l
J-1(J - J l )W l W3 + J -1
2 u2
2
3
J 2 )w l w2 + J -1
J -1
3 (J l
3 u3
(ll)
580
where ui's, i = 1,Z,3 are the control torques acting on the satellite.
Similarly, in the reaction wheels control mode one has:
wI
Wz
W3
s'll
s'lZ
?l3
-1
J l [(JZ-J3)WZW3+JwzWllZ-JW3Wil31
-1
JZ [ (J 3 - J l )w l w3 + J W3 Wlrl3 - J Wl W3rll 1
-1
J 3 [ (J l - J Z)wlw Z + J Wl wZrl l - J WZ wlrlZl
-1
+ J ul
1
wI
-1
+ J Uz
Z
Wz
-1
+ J u3
3
w3
-1
J lUI
-1
J ZU Z
-1
J 3u3
(lZ)
-w
-w
-w
where the rl. 's and the u. 's, k = 1,Z,3, are the angular rates and the
l
W.
's,
easily derived from (10) and (11) or from (10 and (12). As far as the
outputs are concerned we will assume, for the computation of the feedback control law discussed in section 2, that they coincide with the
quaternions x. 's, i = 1,2,3. In the simulations which we will discuss
1
(13)
Lfh.
0, it follows that d l =d 2=d 3=1. Let
gj
l
us how consider the 3 x 3 matrix in (5):
for any i,j = 1,2,3 while L
-1
-J 2 x3
-1
-1
J 3 x2
Xo
-1
-J 3 xl
-1
J 2 xl
-1
J 3 Xo
J2
581
-1
AGJ(x)
2
X
2
2
J l (x o + xl)
J l (x ox 3 + xl x 2 )
J 1 (x 3x l - x ox 2)
J 2 (x l x 2 - x o x 3 )
2
2
J 2 (x o + x 2 )
J 2 (x 2x 3 + xox l ) (14)
J 3 (xl x3 + x o x 2 )
J 3 (x 2x3 - xox l )
2
3
J 3 (x o + x 3 )
1 .... 2
1
xl + I(fsx o - f6 x 3 + f 7x 2 )
- L;"lwl
fGJ(x)=
1 .... 2
1
x 2 + I(fsx 3 + f 6X0 - f 7x l )
1 .... 2
1
x3 + I(-f s x 2 + f6 x l + f7 x o)
- L;"lwl
- L;"lwl
where 1~12
(15)
3
L w.2 and
o
K
(17)
582
*,
3
1
n ker dLi h
n
i
i=l j=O
{O}.
x.
this implies the convergence to zero of the w. 's, i = 1,2,3. Hence the
1
Moments of inertia
of R.W. (kg o m2)
200.0
0.272
150.0
0.272
100.0
0.272
o -n01se
2
(rad/sec )
2
x
2
0
y
2
0
z
0
10- 2
10- 2
10- 2
583
control law (9) with A(x), f(x) and T(x) given by (14),(15) and (16)
enables to perform automatically the fixed maneuver which consists of
a rotation of = 2 radians around the Euler axis with unitary components in the principal axes frame at time 0; the feedback gains k .. 's
1J
have been chosen to fix, for the dynamics of the linearized part of the
system, the time constants at the magnitude of some seconds consistently
with the choice of the interval of integration 10- 1 < P. < 10- 2
(Pi ~ {O
for our purposes). The needed feedback control law turns out
J W3 W1Q3 - J w1 W3Ql
u3
= J wl W2Q1
(18)
- J w2 Wl Q2
584
IS,
x.
1,2,3 is assured.
w.
IS,
1,2,3.
585
,..reactionwheels
torques
Satellite
Model
gas-jets
torques
'---
GAS- JETS
CONTROLLER
REACTION
WHEELS
CONTROlLER
Fig. 2.
f--
586
~---:--
,,
,,
,
,
,,
,
,
--.. ------_. ----_.. ----.... --. -., -_... _. -_. i------.. -.t- ._... ---- --_.. _.~
1.60
--
_. -
.. -
-]~
1.40
-.-----~-----.-.--
1.20
,
- - - _..1 _____ ___
_.
_ ,.;
_~..
_.
<,_.' _. _
-- -
-~-
_______ ~_~
- - .. -
+. -
~.
.
.
--~--.-.-.!--.------.!.--~.--.--~
--
.. -----.--
..
.,.. .. _. _. .... _. _.
.
-----_-:---------:'_--------t--_---_
.
---- -_. -_. - .. -.. ... -. . -_____ ._ .. __ __ ____ .. !:__ _______
..
,
,
,
_.- -
,
,
,
-_. -. _. ---,. . -
_.
-_
~.
.. -
-- - --- -
".
+. -
.. -
1.00
0.80
0.60
0.40
0.20
-- -
- - .. -
-_
_... - -- -
,
~~.
,
--
---
- --
-~
--
~-.---,..
_~
0.00eL.r-~~'-~~~---n~--~~~~;r~=:ra~--~~~--~La--'~~-'~.8
6.00
,
,
--.- ....
__ .. _--- .. --5.20
,
,
,
"
'
"i-- ________ __ .0______ ; ______ . __ _
--t'-_.
--"
____
4.40
:
:
:
:
:
:
:
:
:
! ____ .. _
3.60
,
,
,
,
,
2.80
U1
2.00
1.20
,
,
,
,
,
.
.
0.40 -- - --- --:. - - -- -. ---:- ---- --"-. :-- ---. -- -- :---- .. - --- i-- -- -----1-- ----- -- -: ---. _. ---. :., _.- -- - _OM: -- --- ____ _
-0.40 -.. ------1- --- --- .. -:-- --- --- --;--- . -----:-. --- ----"; __
-!. --- --.---:- ---. --- ~-1.20
1------.- 1----- --~ to. --- ----- ~-- -2.080
-
~-.
-. -
~-.
--
.-
---~-
--.~""
... ___
-- -
--. -
.~o
~-
____
-_ -
~-.-
0-
~-
-- ---
----.--- -{---- -- -
~-
---
____ -
~_
__
OM
0- __ 0
~-
__
___
______
_________ _
,--::-<E:::-:-::--:-::-::T::-:-::T---::-::j:::--::::r-:->{-:::-:-::-:-----::.
.
--i--" -----1 "--------
OM - - - - - - ; - - - - - - - -
0-
-: -_. -. -
---
-. -
--
.8
1.50
1. 30
1.'10
0.90
0.70
0.58
U2
0.30
0.10
-1.00E-01
-0.30
___
--
--~-
-~.
__ 0 ' _
-- -
-----
_.'
-.. -' ---:--------. ': ----_. ----:._- -- - ":' -' -------!- .. ------.. -----.---~.-----~-.~- i------ --i---
~ -_. -"-- -.. t-- --. -"!- --. ---""-
.~
~.
----
.... -; . -------.- ------.. +------ --i----- ----j-- .. ----- -j-~-------- j----.- ---+. ---.----
oJ_ _ ____
_~
____
20'.1
__ __ -
,
,
;rIj:JjJjj
-0.58 .
3.00
2.60
2.20
1.80
1. 40
U3
1.00
0.60
0.20
-0.20
-0.60
,
,
_____ - - __
~._
_~
__
_ _ _ _ _ _. . . . _
30'.1
- __ T- . _ -
~---
1".1
- -
~-~~.
_____ 4
____
50'. e '
- - - -..
_.
~~
______ ____
_________
~----.
_.
__
~.
__
____ _
:f:I:::!~:~:J::j:I:~::lt:)/:ji:
.---: :::::::::::::::;::::::::::r::::: :::r:
::r:::::::::
:::::::1::::: :::
-1. ....
.~~~~~~~~~r-~~er.~~~~~~r.r~
Fig. 3
587
2.00
,
,
1.ao
-----i---------- .---0----- .--------- .--------1.60
1.40
1.20 ::::::::-~-::::::::r::::::::r:::::::T::::::::r::::::::r:::::::::
. ___ --__ - ___ -___ -- _____ ". __ - _______ -___ _______ . __ __________ ___ . _____________ . -- _0- __________ . _____ _
1.00
,
.'
,
------------------ --------o.ao ---------,------ ---.,----------.,----------,----------,----------,---------0.60
0.40
---- -- ... ~- ---------~--- -- -- -- --~ --- ---- ---- -- ---- ----. -- --0.20
.'.
0.00 0
.
.
------ --1----------1----------1----------1----------1---..
__ -
.... _
I
I
__ -
_ _ _ ... _
,
,
__ -
,
,
_;. -
___ w
_ _ "0_ -
,
,
__ -
- _ , ___ w
I
I
_ _ _ _ _ _ 'O_ -
_. -
__
__ -
__ -
__ -
__
Ul
u3
10.00
8.110
6.00
4.00
2.00
0.00
+2.00
+4.00
+6.00
+a.oo
+10.0 0
10.00
-a. 00
6.00
4.00
2.00
0.00
+2.00
-4.00
+6.00
+a.oo
+10.08 0
,,
,,
,,
-- .. -.--------.------.--- .. --------,
,
,
-1----------1----------1-------
. :----------1----------;---------
10.00
a.oo
6.00
4.00
2.00
0.00
-2.00
-".00
+6.00
-a.8(J
+10.
Fig. 4
588
2. 00
1.80
~,,-.-.,----..,------~~--.,..-~~------,-----------.-~
........
1.60
1.40
. ..
.. ,
,"
,.,.,.
. -- -,-
,--
1.20
1.00
0.80
0.60
o 40
0:20
.. -- -_ ...'
.
------i-
.----~--
I'
-------i----
--: ...... --
-{, ..
. ..
,"
.-_.-.- ...... .
~.1
:~~4--::";::";:'-~~~~--
. .
50.0- 60.0
~&0':V-
90.0
._-""C"
:0.00
8.00
6.00
4.00
2.00
0.00
-2.00
-4.00
-6.00
-8.00
: "
,......
nlallHIH,-_ , .. -....
;'.
- ~ -- - -
-.:.
--------_
-10.00e.l
10.00
8.00
6.00
4.00
U3
-10.
Fig. 5
... - - -
.0
589
0.20
0.18
0.1.6
0.14
0.12
0.1e
8.eeE-02
6.eeE-02
".0eE-02
2.0eE-02
0.00e
10.0e
:
:
:
:
:
8.00 --.-- --- -~- ---------~ - -- ---- -~-- --------.~- --------- ~-- --- ---6.00 --. _____ -~---- -- ----~ .- _. ---~------ ----~--- -------4-- ----- -____ woo -~ _____ -___
... 00 ------- --~----- ----- ~ - ____ ow). _____ -___ ~-.
-_.-------:
_.---"":---."
----:------2.00
ul
,
,
.- ---,
,
. ,
0.00
-2.00
- ... 00
-6.00
-----~--
.
..
-~-----
_w
--
:---- ---.-. i-----. --- -~- ---- -----~-- -- -----:_ -__ -_____ ~ __________ ~ -________ ) _____ . ___ _
.
.
,
.
-. ----- - .
----- .
- ---" ---- --- -- ----.
--~--
---~
' "
.--------1----------1----------
._-------,----------
::::::_::E::::::T:::::::-
---------.--------.
u2
...
10.00
,
,
,
,,
8.00 -----_._- . ---------- --------..,----------""---------- ._------- ..,,---------- .., ---------- --------- .. ---------6.00 ---------i------------------1----------1---------- ---------i---------... 00 . __ ._----:------------------i--------~~ ~~ ~ ~.~~~ r~
~~ ~ ~~~~~ 1~~
~ ~ ~~~ ~~ ~
2.00
,,
,,
0.00
:
:
::
: : :
-2.00
- ... 00 ~~~ ~ ~~~~~],~~:~ ~ ~~ ~~~~_.
-6.00
. -------:----------:----------:--------- :----------1----------1----------1----------8.8' ---------..;--------- . . : - ----- .. .!---.------.!------.---.!--------- ----- .. ---- ..... --------.---------- .. ----------
-E::f
-10.00e.l
1"'.,
r:;,,',
t8.00
8.00
6.00
".00
u 3 2.00
0,00
-2.00
-4.00
-6.00
-8.00 ---------:---------- -te.00e.l
~
r:;.,.
",,.1
::
"",I
:>.,."
D' ."
:::
Fig, 6
.,
590
~.
2.00 ~----~----~----~~--~~~~~--~~--~~----~----~--~
1.80
.............................................................. j.......... j....... ,
1.60 .................. j .......... j.......... j-- ............................ j .......... j .......... j--....... .
~ :::
1.00
0.80
0.60
0.40
0.20
0. 0ea
..
---------1--
---:----
- .- . :.,; ....:___
--'-.--~----------:---
. ________ _ .I.
_ _ __ _ _ _ __
: .................. .
---------:i----------t---
J ___
--'----~---.------
~------_-
-:--------_.
.,
_____ ....
_____ ___ ..., _
,,
l. ,.......-~:.....---:rrh-.,.;....._~h_.::::;;;:;;:::~!+=::::::;;;.;t;=~;=::::m;;:;::::;;:;j. 0
.
-,---------.., ---,
-.; - ---- -- -- - ~ ---.J __________
J____
J ____ _
Ul
0.00
-2.00
-4.00
-6.00
-8.00
-10.
U2
u3
10.00
8.00
6.00
4.00
2.00
0.00
-2.00
-4.00
-6.00
-8.00
-10.
- -
.,
~-
,,
- - - - - - - 'Y' - - -
t----------:----------
10.00
8.00
6.00
4.00
2.00
0.00
-2.00
-4.00
-6.00
-8.00
-10.
Fig. 7
591
~--~----~:----~:----~:~--~:----~:----~----~:----~:----,
1.00
e.80
_. __ ----- ~--". ---_. i- __ --- ----~------ --- v~_. ---- - ---~- - - - --- ~ _. ---- ---- -- -- ... - --~. --. -- .---~--- --- ----_... --- -~-- --.. ---- -!~- -;- ----- ~- .-. -- ---- ~--. ----.---~-- - - -. - - --~---- -_ .. - - -_ .... ---. - ~ - - _. --- - --~ -- -- _.-
0.60
e.40
0.20
- -- ~ --- -~- _.- .--- - .~- _. ------.~. ---~-. _.- ~.-- ------- :----- ----- ------ -_.. ~---. --- -- .~---:
:
:
::
::
. --.- -- - ,,-_ ... -----,,-.~.- -_. - ~~--- -" ---- ~~. -.-- ----~- ---- --- --~-. ----- --- --- - ------t--- ------i----- --_. ~
-----. -- .~- -- _. --- ~- ----- -- - -~----------~- _.- - ----.~ ~.- --_. -- -1- ------~ - -.----- ---~---~--. -- -i-- __
:
::
.......... 0. . . . :. 0 :...... :..... :
.. 0 .......... : .. 0.0 .
.~
---------1-----I
---
_._--
..
0.eee.ILy--~~7r--~~,_--~L
.
.
. r--.ni,---~C;::~i;~~~tiF~~~--~~~~_J
.
.0
Ul
U3
2.00
1.60
1.20
0.80
0.40
0.00
-0.40
-0.80
-1.20
-1.60
-2.000
- - ' - - ____ A
_ _ _ _ _ _ _ _ ._~
-----,-_.-----~------ .--~------.- -~-.--------~----------~ ... -.----.!---.---. _. - - ~ __ - - - - _._ - ~ __ - __ - _0. ~ ______ _ . ~_. ________ ~. __ __ ___ ~ . __
-~ -
-:- -
-~-----
. J ______
-,-
---1---
_.
~.
1.00
0.80
0.60
0.40
0.20
0.00
-0.28
-8.40
Fig. 8
-. -
+! --- +-----
------:-------+1--.--- __
1.00
0.80
0.60
0.40
0.20
0.00
-0.20
-0.40
-0.60
-0.80
-1.000
_1 ___ -
y --
-t----- -
592
2.00 '"""'""--,.----,.--1.80
............. . ........................
1.60
.... , ................... , .......... , ..
~ -
- -
..
~.
_. -
-.
~.
1.40
1.20
1.00
0.80
0.60
0.40
...
... -
-; -
..
~ -
-;-
... -
--_._--.--------
.-
-,- -
- . - - . -,_ .. - -- - - -,_. _.
---_.-:----------:----
.......... ~
__ J _ _ _
-- -- -. - ,. - -- . - . _.
_ : __ __
_ ____
~.
__ . _ ._
......................
.
0.2l__~j~-~m~--~.:~~~~~
=~t~=~~~==~~'~=~v=9~~~
...~..~
..
.~
0.000
,0
4.00 ~--~----~----~----~~~~----~--~----~----~----,
,
,
3.20
.. ..........
..... , .......... ,........
., .......... ,...
. ..........
~
2. 40
1 .. 60
0.80
~iii
. . . ) ........ :
' ....
<.........
..............
~........
0.00
-0.80
-1.60
-2.40
3.20
,,
,,
,
,
----------.-_
, .. _----- .., ---------.
-.. ;_. _... ----:------_. --1- ----_. ---1- .-_.- --1----------1- --------- ----------: ----------:- --- - -- -- -:- - - --- -- --:- ---- -- -- - - ----- ---- -- ~~
. -.
,
,
------- .. ----------,.
----_._,--
_... __ .-
--1'---------1
. ,
--------.---------.. --_ .. _----"
~
----------.
, ---- - - --- - - _.
-- - -- ---- -_. -- - -- - -!.
,
----t---------1---------- ----------t--------
.::::::;:::::::::~.:::::::):::::::::T::::::::L::::::L.::::::::1:::::::::: :::'::::::~::::.::::'
-0.49
-0.80
=~ :~:
,,
.,
,,
-2.9~,~-,~r-~~~~~~,_~~_.~r_~~~~--~r_~.e
Fig. 9
593
2.00
,
,
1.80
,
.,
.
,
1.60 .. ---- --i-... ------1.---------1 .... ------1----- -- -~ ---------,-------~-". _0-_---- 1. 40
1.20 --------.-'. ------_.----------- ...--.-----_ ..... -.----- .... --- .. _-_ ... -._ .... ----
.
.
--- ------_ .... --_._--- .... - -------- .. ----------.-_.-.-.--- ._.
1.00 ---_.---- -.
.
----,_
..
-.
-,
-_.
--...
-"
..
---...
-. ---_. ---, _ ... _. _. --,-- ---- - --0.80
0.60 --- ~ -- - --:~ ---- -- -. -: 0.40 - -- - - --- --'. -- -... - --' ... 0.20 ---- .... - .. ---_ ... __ ----_._-_.,...
0.0'\).,~~~:~-=~'r-~:~~~~~r=~~~~~~r_~~~.0
- -
~_ -
- -
- ~ -
___
I
I
~ .... - -
_: ___ ______
_. ______
....
...
,
100.00
60.00
20.00
~1 =~::::
-100.00
-140.00
-180.00
-220.00
-260.00
_._-
- l' -
_ -
_. -
- - - - - -
.
......
~_
,
,
.. - - -
- -
- -
______
.
I
.
,
.. - -
- -
- -. - . - - -
- -
_. -
___ ___ . __
.
.
.
__ _ _
_ ____ J. _____ .
_. _. _ . __ _
-~~~r~~~~~~~~~r~~~~-~~~~r~~~~~~~~~r~~~~~~~~~
.
.
.
- -
..
-:.
..... ....
- -":._----.-.
:----------1- ---.--- --:' --.0..
_____ ":"___
__ . . : __________ : _____ _ . . : . - - .
_ J
_ __ J_
:...
-,. - .- .. - ---,,--"
_
--t-
.. _.1_.
_ ____ J
. ......... , ....... .
-300.000L.r--,~~~~,---~~--~~--~5~0~.~0--'b~0~~~0r-~(0r.~0r--v80~~.0r-~9D0L~0~~~m!.0
50.00
30.00
10.00
-10.00
~30.00
~3 -50.00
-70.00
-90.90
-110.00
-130.00
-lS0.00e.
.
.
,
. .. _--,----_._-.
_._----.,.----------,-----_
.
.
50.00
30.00
10.00
-10.00
-30.00
~2 -50.00
-70.00
-90.00
-110.00
.
---,,--.- .... _-
.------..
... +
. -.. . j-....-.-.+-...-...-j........-.j.-....-...!..........1....... __ .
-1--" --. --i---'- -----1----------;-. _..
-I::::Y:--j:r--r:j::rII
w
--
::;;~-~I:l::lI::r:::j:::l-d:.0
Fig. 10
594
..
"
~-
...
:;::1:)1:
-
~-
- - - - - ."t. - . - _. _. - -
....
- - - . - - _. - -
.....
...
...
ul
___
_________ __
10.0e
8.00
6.00
4.00
2.00
0.00
-2.00
- .... 00
-6.00
-8.00
-10.
...
u2
4.00
2.00
0.00
-2.00
- .... 00
-6 . 00
-8.00
-10.
u3
10.00
8.00
6.00
.... 00
2.00
0.00
-2.00
-4.00
-6.00
-8.00
-10.
...., - .. __ ....... .
________
__ -
______ A
_______
__
~---
_______
_________
____
_____ :
_________
,,
,,
- -1- -- -- -- -- --:- ---- _. -- -1--- -- - -- --1-'- - - - - -. -{- --- .. --- -i' - -- --. -- -1- - - . - - '.
,
I
--~-
_________ _
J, - _., - --
. +:.:::::+:::::::: ::::::::::;::::::::;:::::::T:::::::T:::::::j::::::::::
---~--
_. -
,_, -
_ ... '
_,
___ -
J-., - - ' . - -
-~--.
,., -
~-_
- - - -- - _. 0
_0
-~~ ~.:
.. _ _
_ _
__ 0
- .. ,- - - _. - ! - . -- _. - .. - : - - - -- - - - . ~ . - - -
___ -
. _., -- -- - -- - - '
~
~.
_. -
'
,
~
--- - _ ... :
'
-------- -_.. ----, --- -- -------,--i,---'- -------t---'
_.- ------ _. ----_. --~.. .-.----1- ------ --"1----- ------ -1' -----' ---
_. --. - -- . -~
Fig. 11
-j- -
r------ ----------
595
REFERENCES
[1]
[2]
[3]
[4]
[5]
[ 6]
[ 7]
[8]
[ 9]
Singh and
Araujo: 'Asymptotic reproducibility in nonlinear
systems and attitude control of Gyrostats'.IEEE Trans. Aerosp.
and Elect. Syst., March 1984.
Dadugineto
M. Fliess and M. Hazewinkel (ed;.), Algebraic and Geometric Methods in Nonlinear Control Theory, 597-621.
1986 by D. Reidel Publishing Company.
DADUGINETO
598
Level
control~--~
()
~ -
(Cb + CO)
00
Ql;
. q(t) - crt) -
cCtl.q(t)
with
599
(2.1)
600
DADUGINETO
(Fig. 2):
Paper pulp
basic weight
flow - rate
R./h
(g/ mZ)
machine
(m/mn)
Time
constant
(sec)
~ain
(g/m ). Cl/h)
30
25
21
18
15
14
Time-delay
(sec)
1
0.9
86
B4
0.75
0.7
0.65
78
76
74
O.B
eo
601
x(t)
x(t)
yet)
x(t) + wet)
(3.
DADUGINETO
602
x(k + 1) - x(k).e((a+n.u(k.4T) +
+ [1 - e(a+n.u(k).4T)
J.( a+n.ulkj
b.u(k)
"'T'T"
{C, ... R - 1}
41 (t)
r
1 -
2r
ex
These 2R functions set an orthonormal system in the
closed interval t(C R 1]. Each function is constant during the
time interval 4T-2- . Each term of equation (3.1) can be
expanded in a serie:
1
J x(t).lIIl(t).dt C
- OU)
N-1
1II1(k).[x(k + 1) - x(k)]
k-D
(3.2)
Jax(tl~l(t).dt -
603
N-1
~
k-o
DCk).P(k,l) +
10
DCkl,PCk,U - XU)
C3.3)
N-1
Jau(tl..l (tl.dt
1
~
k-o
C3.4)
- UU)
1
JauCt).x(t)~lCt).dt -
N-1
~
k-o
UCk).X(k
1) +
10
~
N
( a
U(k).X(k
1) - WCl)
C3.5)
C3.6)
n )
DADUGINETO
604
The computation of the eQuation (3.5) is greatly simplif ied by a judicious choice of the control signal (for example a
Walsh function). So, the transformation of the equation (3.1)
can easily be expressed in machine code and then can be
computed very fastly. So, it is very interesting in the cases
of on-line estimations.
(1)
ill
~
I/,
(D Q"3.SL/H
.{o
(2)
Q~13.
5L/H
I
lJJ
'<t
process
,.
model
SEC
o~--~--~----~--~--------~--~--~
200
600
1000
1400
605
x(k + 1) - [AO +
y(k)
~
i-1
Pi
(U ,
- C.x(k)
u. (k) R
1
; y R
a(Q).l:(k) + B(Q).u(k)
".l:(k)
Y (k)
(4. 1 )
{ a(Q) -
1)
i"'O
q
B(Q)
~
i"'O
(4.2)
606
DADUGINETO
With (4.1) and (4.2) and the introduction of a supplementary state component we obtain :
[~(:+1) J -[[ :0 OJ
i=O
y(k)
U(k).Qi(k)
Qi(k)
i""l
[:i :}
[ :1J]
a
a
[~:klJ
(4.3)
knowled~allo~
...
Ail,'D' -
[l::
607
., -[iJ
C -C1 O:J
Thus, we write a relationship between the output y(k) and
the previous inputs of u and Q
n
Q
y(k) -
~ ~
~ ~ (a 1j . Yb (k - 1).Q(k - i)j+
1-1
j-o
(4.4)
608
DADUGINETO
elk)
a 10
a nq
b 10
e(k+K)
nq
Y- X
+ E
e- (
-1
- . - 9 n z
-n
- G(z-l) .b(k)
q
with
9 i ~ ~ a ij Qj
j""O
Q.
b(k)
= elk)
~
i'""l
9i.b(k - i)
~b(k+n-l) ~
and we compute
b(j) - e(j) +
e(k+n-l)
~
i-l
9i.b(j-i)
for
j -
k+n to K.
609
x(k+1)
J.
x(k)
C x(k)
[one
0.025
-2'-----2-----~----2---
0.020
o.
~,/....--- ~
015
//5
----- - -
~-
I/;
0.010
~-t--
--+--
0.005
1000
2000
)0:
3000
610
DADUGINETO
Cone
o. OZS
...-'f':"---
O. OZO
O. Oi.S
,-/'"
"/
-c-
-j.-- - -
/~
0.0i.0
-+-- +--
0.005
i.000
-e----i!----
ZOOO
3000
Cone
O. OZ5
O.OZO
O.Oi.S
0 . 0i.0
-+-
- -+--
0 . 005
i.000
ZOOO
3000
611
Gain
0.025
0.020
0.0.1,5
0.0.1,0
--t--
--+--
0.005
YO)
.1.0
.1.2
Ti
110
conotant
(8)
800
500
400
--t--
200
--+--
YO)
.1.0
.1,2
612
DADUGINETO
~y(t)
~c.e(t
- Ly(Q
(4.5)
613
rCa) ==
[LX~a)J
+ int
and
(a) == Ly(a)//lIT
AT
a(av ) u (k )
+ 1) - a (a v ) . E (k) +
~ E (k
y(k)
y(a
(4.6)
).E(k)
with
n
nC ( [
a(a )
v
.~
I
v)
0
0
0
0
"-
nl
/
[( rCa ) + 1).n ]
v
C
[( rCa ) + 1). n ]
v
nd
614
DADUGINETO
[(r(Qy ) + 1).nc 1
"t(Q y ) -
[0
-y
'-
1 (Qy)
"tr(Q ) -
"trCQ
)(Qy)]
..............
n
l'
/'
[(r(Qy ) + n.n 1
c
The matrices a(Qy)' S(Qy) and "t(Qy) may be computed with
the relationships:
aO(Qy) - exp[ac (Qy).6Tl
1
,(Q
. y) - [
y -
"trCQ )
\,."
z(Qy ), 1
z(Qy ),
.... ,
1 - z(Qy )]
c times
nc times
4.4.2. DeriYing the unique non-linear model. In the method of
section 4.1., the state-space dimension n of the N linear
systems is assumed to be constant for all Q [Qmin' Qmaxl.
Because of the variation of the time-delay, this assumption is
not valid for system (4.6). If we consider nm x - max(ndi
(1-1, ,N)}. i t will be possible to rewri te ~he N linear
realizations with the same dimension by increasing the state-
615
_~CQ~~/AT ]
----
[ [1
0 ... 0 ]
.. ....
times
~~------
~""'"
-----~
max
GNnnc -1+int[~Q)/AT]
max
~Q)//AT
AT
n
o ..
0 ]
~Nnnc +int[-77Q)/AT]
max
times
with Nn
upper nilpotent matrix of dimension n
.n
max max
max
DADUGINETO
616
11
[~Ck
+
eCk +
12
g]- [~-o
P"j(QCk)l.A j +
~ P2,jCQ(k))'UCkl.B j ] .
. [l:Ck)]
eCk)
[ [ [ [, - .Y(Q~;'4T ] 0 ... 0 ]
----
y(k)
nc times
~Nnnc -1+int[~Q)/4T]
[ [YCQ)//4T
max
--4T
a ...
0]
times
n +int[.YCQ)/4Tl
LNnc
max
with p. 0 - 1
1,
and
9(k)
1 ,
~k
617
(s/m2)
........ _ - - ;..
.....,.....-1.."............~o-b
.._ ..-;..
..
1GI.81G1
1GI.61G1
1GI.41G1
6"."
IGI. "
Error
P ..... PER
Sp d
M..... CHINE
.25121 m/mn
L. .L.:i.n_Q'" Mgd.l
........... FFin_ Mgd.l
E .R.1Q~:i.v_ "',..0,..
121G1. "
18". "
24"."
12". "
18". 121
24"."
Tim.
(5)
(:x)
1GI.91
6121.121
-1.51
Fig. 9.
w.
<g/m2)
1GI.81G1
I~
IGI. 6"
".4"
PAPER
Sp __ d
L.
I
f
1GI.2"
". "
6121."
M.....CHINE
.35e1 rn/mn
.L.1n_Cl,.. Mod.l
12". "
1 B". "
24".121
1GI.91
-. 91 "
"
6121.121
-1.51
Fig.
618
B .. W.
DADUGINETO
Cg/m2)
121.8121
.x-'= ....
121.6121
I
I
121.4121
/'
----..-'-..-.,-..-.-..-.-..-.-..
PAPER
i
I
121.2121
MACHINE
Sp __ d
.45121 m/mn
L Ll.n_ar- Modal
A .A.f'.f'l.n_ Mod_l
121.121
6121.121
12121.121
-.91121 121
6121.121
12121.121
18121.121
24121.121
.R_la~1Y_
Err-or-
:3121121.121
121.91
I
I
I
-E-E-e-e-e--e-&-e-e-~<E-E-.-..e-e:-e-e-e--E
18121.121
24121.121
:3121121.121
TiM.
<S>
-1.51
B .. W..
(g/m2)
121.8121
121.6121
//'
--_._._._._.--.--.-.-
121.4121
121.2121
PAPER
MACHINE
Sp __ d
.325 M/mnft"
A .A.f'.pin_ Mod_l
.I
121.121
6121.121
12121.121
18121.121
24121.121
:3121121.121
Tim_
(S)
619
<g.lm2>
121.8121
.1(""
__
.--.-.-.--;...-.-----Io~
II
121.6121
121.40
PAPER
0.20
MACHINE
Sp __ d
.425 m/mn
" ,I\,p.p,,.,. Mod_l
I
121. 121
6121.121
12121.0
18121.121
24121.121
3121121.121
5. CXJICLUSION
AN)
PROSPECTS
DADUGINETO
620
[1] DANG VAN MIEN H., NORMAND-CYROT D., 'Non-linear stateaffine identification methods; applications to electrical
power plants', Automatica, Vol. 20, N' 2, p. 175-188, 1984.
[2] FLIESS M., NORMAND-CYROT D., 'Vers une approche algebrique
des systemes non-lineaires en temps discret', Analysis and
Optimization of systems (A. Bensoussan et J.L. Lions, ed.),
Versailles, Lectures Notes in Control and Information Sciences, Vol. 28, Springer Verlag, Berlin, p. 594-609, 1980.
[3] NEYRAN B., SAADE-CASTRO J., THOMASSET D., REYNAUD R.,
'Modelisation dynamique non-lineaire d'un procede chimique de
neutralisation', Congres SEE, Nice, 1984.
[4] HAMMARSTRDM L.G., GROS K.S., 'Adaptation of optimal
control theory to systems with time-delays', International
Journal of Control, Vol. 32, N' 2, p. 329-357, 1980.
[5] THDMASSET D., 'Representation d'etat discrete d'un systeme
multivariable retarde generalise', RAIRO Systems Analysis and
Control, Vol 17, N' 1, P 23-3B, 1983.
[6] KARAMAN V., FRICK P., MOHLER R., 'Bilinear identification
by Walsh functions', I.E.E.E. Transactions on Automatic
Control, Vol. 23, N' 4, p. 709-713, 1978.
[7] DUFOUR J., THDMASSET D., 'Definition and synthesis of
state-affine control algorithms: application to a varylinear
system', Int. Conference on Computers, Systems and Signal
Processing, Bangalore ,India, 1984.
621
M. Fliess and M. Hazewinkel (eds.), Algebraic and Geometric Methods in Nonlinear Control Theory, 623-632.
1986 by D. Reidel Publishing Company.
S. BELGHITH ET AL.
624
625
q
with
-1
Aq + BU I + F(q,t) u 2 + w(t)
all
a 21
a l2
a 22
a 23
a 32
a 33
a 34
(1)
an- l ,n-2
an-l,n-l
an-l,n
, ..... ,
where the state q = (ql, ,qn) belongs tomn and where the function
f(ql,q2, ... ,qn,t) : lRn xm -+m, wl(t) : m -+m, wn(t) : m -+lR are
analytic.
The set of admissible controls U = (u l ' u 2) is the set of all piecewise analytic functions with a finite.number of switchings, u l (.) Em
and with the control constraints : u;~n S u 2 (.) .:S u;ax
(2)
T o
2
{[a~ + Bq2 + YU 1 - T2 (t)]
+
o
K u 1} dt
(3)
s. BELGHITH ET AL.
626
Here Ti(t) a given function. We shall also write T2 for a.ql + 13q2 + YU 1
Remark. In the example under study, the coefficients a .. , a. and the
1J
= h(q(t
subject to
T fixed.
(4)
=I
i=l
p.(t) F1 (q(t),u(t),t)
~,F>
and
p(t) = -
aH
ail
p(T)
= ~~
(5)
(q(T
In this problem, the Maximum Principle gives the first order necessary
conditions for optimality.
aH (p(t),q(t),u(t),t) = 0
aU.
1, ,m,Vt [o,T 1
~ aH
dt
au.1
(p(t),q(t),u(t),t) =
Vt [ 0, T], i = 1, .. ,m
627
u.
1.
>
1.
0,
fields :
n
J.
at
and
k=l
aqk
k=l
This result clarifies the link between the first order Volterra kernel
associated with the functional J and the Hamiltonian H [4]. This
formulation is also useful from a practical point of view : The
total derivatives of aH can then clearly be systematically computed.
au.
1.
(6)
n
+
1.
i~Z
p. [a. I . q. I + a . q. + a . . I q. I + w. (t)]
1.
1.-,1.
1.-
1.,1.
1.
1.,1.+
1.+
1.
PI
=-
af
- (all + aqluz)PI-aZI P2
61 1)
628
S. BELGHITH ET AL.
P2
o
p.
3, x,n
(n )
~
aH
aU I
o
Assume moreover that ID
f(qI,q2, ,~,t)
0 , Vt E [a,b]
(7)
a.K
= cte
ya 2I -a.a 2
(-I) (n-I)
(_I)n an,n-Ian-l,n
an-l,n-l +
a
n,n
~ In the practical example under study, this assumption always holds.
In any case a system (I) such that f(ql, qn,t) vanishes on a subinterval of [o,T], becomes a linear-quadratic one in ul on this subinterval
and can thus be solved in the usual way.
629
More precisely, this relation is obtained by solving the following system of (n-l) equations in (n-2) unknowns :
Sa21 - aa 22
- a 32P3 + ya
21 - aa 2
- a 23 _ _-=a_K_ _
ya 21 - aa 2 - a 33 P3 - a 43 P4
o
4, ... ,n
But this condition which depends only on the parameters of the system
does not hold. Hence no control u2 satisfying (6) and (7) can be
optimal. We can now state :
Theorem: The optimal control u = (ul, u2) for the problem (1)-(2)-(3)
satisfying (7), is bang-bang as far as the second component
u 2 is concerned.
Remark
(i) We assume here a priori that the optimal control is piecewise analytic with a finite number of switching points.
There is actually no explicit hypothesis for the general
problem which could ensure that the optimal control is
"regular". Note nowever Sussmann' s important result [10].
on the time-optimal control-linear problem.
(ii) The bang-bang character is here proved using only the first
order necessary conditions. It is worth mentionning thqt
in general one needs higher order tests which can be also
expressed in terms of the Lie brackets of vector fields
associated with the system (4) [7,8].
v-
SIMULATION
630
S. BELGHITH ET AL.
changes sign.
an d
compute.the.t~mes
~f e11
a
um
2 x
u~~n
if not
e21
8'12
and2
8 2 where (/) ( t ) changes
s~gn.
Thus one
f~nds
3 on.
The time variations (Fig. 2). of the state variables (ql q6). of
the control vector (ul. u2). and of the inside air temperature T2. are
in agreement with what could be physically expected. We wish to emphasize the following points :
- The temperature T2 is fairly constant. below the standard of Ti.
- The control ul varies smoothly according to the low-pass filter model.
- The control u2 is as expected :
theshulters are open during the day. closed during the night and
the switching points occur in order to maximize the solar energy supply
to the collector wall. Furthermore the use of the 2 - dimensional control (ul. u2) instead of the single control ul. allows a 12 % saving
in costs.
.
631
U Uz
.1
..
..
..
I.'
..
..
"
tlhl
...
zo
...
...
~ !~
.,hl
REFERENCES
[1] M. ATHANS and P. FALB, "Optimal Control", Mac Graw Hill, NY, 1966.
[2] C. BENARD, "Optimisation de la representation reduite d'une paroi
thermique", submitted to Int. J. of. Heat and Mass Transfer.
[3] E. BOILEAU, C. BENARD and B. GUERRIER, "Comparaison de differentes
approximations des fonctions de transfert d~une paroi thermique"
Revue Generale de Thermique, 257, 1983, p. 391-404.
[4] M. FLIESS, "Lie brackets and optimal non linear feedback regulation"
Proc IXth IFAC World Congress, Budapest, july 1984.
[5] R. GABASOV and F.M. KIRILLOVA, "High order necessary conditions
for optimality." SIAM J. Control, 10, 1972, pp. 127-168.
632
S. BELGHITH ET AL.
[7] A.J. KRENER, "The High Order Maximal Principle and its application
to singular extremals", SIAM J. Contr. Optimization, IS, 1977,
p. 256- 293.
[8] F. LAMNABHI-LAGARRIGUE, "Series de Volterra et Commande OptiInale
Singuliere", these d'Etat, Universite Paris-Sud, Orsay, 1985.
[9] L. PONTRYAGIN, V. BOLTYANSKII, R. GAMKRELIDZE, E. MISCHTCHENKO,
The mathematical theory of Optimal processes, NY, John Wiley, 1962.
[10] H.J SUSSMANN, "A Bang-Bang theorem with bounds on the number of
switchings", SIAM J. Contr. Optimization, 17, 1979, p. 629-651.
INDEX
accessible
accessible
ad-conditions
adaptive model-following control amfc
adjoint equation
affine connection
affine system
algebraically observable
almost periodic vector field
analytic system
approximate linearization of nonlinear dynamics
by state feedback and coordinate change
approximate linearization
approximation of nonlinear systems
approximation order
a ttainal be set
attitude control of satellites
attitude control
attitude control
baker-campbell-hausdorff formula
balanced actions
banach-lie groups
banach-lie theory
bang-bang extremals
bang-bang extremals
bang-bang principle
bang-bang property
bang-bang solutions
basic vector fields
bass-gura formula
behavior of the transfer functions matrix at the infinity
bilinear system
billinear identification
boIza problem
boundary layer system
brownian motion
brunovsky canonical form
brunovsky canonical form
brunovsky canonical form
brunovsky form
c.a.d. nonlinear systems decoupling
c.i.f. linear system
calculus of variations
calculus ov variations
campbell-baker-hausdorff formula
633
13
398
350
527
326
209
349
47
496
62
184
182
511
216
423
65
573
578
455
459
496
491
334
339
361
627
623
84
298
126
514
604
389
162
500
196
235
266
86
545
191
348
378
334
634
cascade decomposition
causal derivation
causal
characteristic number
characteristic numbers
chemica:l neutralization pilot plant
christoffel symbols
christoffel symbols
closed-loop linearization
closed-loop transfer function family
co-state
codistribution
compensation of nonlinearities by feedback
complete system
cones, lie wedges
conjugate points
conserved quantity
conserved quantity
controllability lie rank condition
controllability
controllable
controlled invariance with singular control
controlled invariant distribution
controlled invariant distribution
controlled invariant sub manifolds
conversation law
cost functional
crouch's theory of finite series
crouch-byrnes conditions
curvature tensor
decomposition
decoupling feedback robot
decoupling perturbations rejection
decoupling via dynamic state-feedback
differential operator
differential representation
dimension minimal
dimension of a distribution
dimension
directional derivative
discrete-time
discretization
distribution
distribution
distribution
INDEX
276
374
4
547
191
598
210
79
302
304
390
122
525
258
72
332
393
397
259
421
51
149
405
433
402
393
325
27
60
84
257
559
546
142
416
35
13
116
419
372
448
423
116
122
419
INDEX
distribution
dynamic compensator
dynamic output feedback
dynamic precompensation
dynamic state-feedback
elliptic point
energy management
envelopes
equivalence to polynomial forms
estimation theory
euler-lagrange equations
euler-lagrange equations
exact linearization via dynamic state-feedback
exponential lie series
extended hamiltonian
extended kalman filter
extended linearization
extended-linearization inverse systems
extended-linearization
extremal
extrinsic geometry
falb-wolovich rank condition
family of extremals
fast walsh transform
feedback control problem
feedback group
feedback linearizable
feedback linearization
feedback linearization
feedback linearization
feedback linearization
feedback linearized and output block decoupled system
feedback transformed
feedback
finite automaton
finite lie rank
finite zero dynamics
finite zeroes
flat
fliess' local approach
fold point
formal power series
formal realization
formal vector field
free monoid
635
443
122
318
313
122
364
624
327
257
487
358
397
135
334
399
298
287
292
285
366
77
101
326
603
299
245
524
227
523
545
546
228
396
432
516
33
169
160
84
17
364
17
19
35
34
636
INDEX
160
395
450
367
366
413
374
327
344
553
580
578
90
374
33
512
360
266
175
160
15
270
243
22
257
7
243
419
205
172
172
15
550
60
375
496
326
513
374
382
489
364
277
597
INDEX
637
184
426
47
160
487
69
101
5
46
166
443
443
79
419
432
147
317
172
299
558
560
561
468
106
117
500
553
499
383
496
384
79
263
493
495
488
160
143
383
512
372
444
36
57
18
638
INDEX
lie rank
36
lie rank
447
lie sub semi groups
72
lie-brackets
374
linear analytic realization
417
linear canonical form
248
linear connection
79
linear delayed process with variable parameters
599
127
linear model matching problem
linear tychonov theory
164
linearization families
286
linearization of a nonlinear dynamics through diffeomorphism and feedbackl83
linearization to controllable canonical form
245
linearization
181
linearization
-576
local accessibility rank condition
69
local analytic realization
19
local canonical form
115
local construction
14
local controllability
55
local diffeomorphism
244
local feedback linearization
234
431
local input-output decoupling
local linearization
311
local realization
33
local realization
8
locally accessible rank condition
58
locally accessible
56
locally controllable
56
locally controlled invariant distribution
109
locally controlled invariant with singular control
148
locally minimal
13
locally weakly controllable
423
locally
433
306
macsyma
525
macsyma
545
macsyma
500
malliavin calculus
123
maximal controlled invariant distribution algorithm
147
maximal controlled invariant distribution algorithm
502
maximal lemma
626
maximum principle
326
maximum principle
353
maximum principle
381
maximum principle
INDEX
maximum principle
mayer fields
mayer problem
minimal time problems
minimal
minimaJity
minimum phase systems
minimum phase
moment map
morgan's problem
morse series
morse theory
morse-bott theory
multiplier
necessary condition
nerode factorization
neural-endocrinian model
noether's theorem
non commutative power series
noncommutative power series
noninteracting condition
nonlinear canonical form
nonlinear compensators
nonlinear discrete time system
nonlinear discretetime system
nonlinear luenberger observer
nonljnear model matching problem
nonlinear observer
nonlinear observers
normal extremal
normal rank
null extremal
null normal extremal
observability indices
observability of the linearization family
observable
observable
observable
observation algebra
observation field
optimal control problem
optimal control problem
optimal control
optimal control
optimal control
639
389
359
626
349
51
398
160
172
495
101
492
489
491
326
390
15
192
397
499
512
III
269
304
432
411
270
148
89
297
326
11
326
326
80
288
13
51
80
46
46
389
396
347
381
392
640
optimal hamiltonian
orbit theorems
output q-reproducible system
output controllability
output feedback
paper machine
parallel decomposition
parameter identification
parameterized vector field
perturbations rejection
phase portrait
poincare-bendixon theorem
poincare polynomial
poisson bracket
pole assignment
polynomial affine system
polynomial systems
polynomial vector field
popov-kronecker invariants
power series approach
power system stabilization
power systems
projective hilbert space
pseudo-hamiltonian
pseudocompensation
pseudolinearization
pseudolinearization
pseudolinearization
rank volterra series
rank
rational generating series
reachable set
reaction-wheels control mode
reaction-wheels control
realization Euclidean
realization ~ complete
realization of F
realization theory for nonlinear systems
reduce
regular controllability distribution
regular optimal control
regular point
response map
response map
restricted block decoupling
INDEX
393
441
132
107
301
616
276
601
472
545
163
252
492
393
576
418
45
47
110
22
534
523
493
389
316
209
295
312
25
7
513
56
583
578
10
7
5
3
525
109
395
123
51
434
INDEX
641
574
83
132
554
533
545
530
523
554
161
476
424
471
441
142
6
24
173
473
424
37
362
259
382
402
336
392
116
443
306
493
464
165
299
605
108
108
182
121
361
62
4
107
169
464
642
structure at infinity
sufficient conditions for optimality
sussmann conditions
swithching times
symmetries hamiltonian system
symmetries
symmetry
symplectic submanifold
system inversion
system simulation
system's graph algorithm
the duffing equation
theorem poincare-birkhoff-witt
thermal control
thermal model
thorn stratified mapping
torsion free
torsion tensor
trace class
transition
truncated automaton
two-step-transformation
tychonov's theorem
variety
varylinear systems
volterra kernel
volterra kernel
volterra kernels
volterra kernels
volterra linearization
volterra series approach
volterra series
walsh functions
weak controllability
weakly controllable
weakly controllable
word
word
zeitz canonical form
zero at infinity
zero dynamics
zero dynamics
zero structure at infinity
zeros at infinity
INDEX
147
356
66
629
392
55
393
405
292
552
550
519
38
623
624
362
84
84
489
517
516
93
163
46
605
375
377
26
374
185
23
289
601
259
423
53
34
512
258
168
160
171
126
126