Académique Documents
Professionnel Documents
Culture Documents
Linear Transformations
6.1 Introduction to Linear Transformations
6.2 The Kernel and Range of a Linear Transformation
6.3 Matrices for Linear Transformations
6.4 Transition Matrices and Similarity
6.5 Applications of Linear Transformations
6.1
W,
V ,W : vector spaces
V: the domain ( ) of T
W: the codomain ( )
of T
Image of v under T ( T v ):
T ( v) w,
then w is called the image of v under T
(For each v, there is only one w)
The range of T (T ):
The set of all images of vectors in V (see the figure on the
next slide)
6.2
The preimage of w (w ):
The set of all v in V such that T(v)=w
(For each w, v may not be unique)
The graphical representations of the domain, codomain, and range
For example, V is R3, W is R3, and T
is the orthogonal projection of any
vector (x, y, z) onto the xy-plane, i.e.
T(x, y, z) = (x, y, 0)
(we will use the above example many
times to explain abstract notions)
Then the domain is R3, the codomain
is R3, and the range is xy-plane (a
subspace of the codomian R3)
(2, 1, 0) is the image of (2, 1, 3)
The preimage of (2, 1, 0) is (2, 1, s),
6.3
where s is any real number
(a) Find the image of v=(-1,2) (b) Find the preimage of w=(-1,11)
Sol:
(a) v ( 1, 2)
T ( v) T ( 1, 2) ( 1 2, 1 2(2)) ( 3, 3)
(b) T ( v) w ( 1, 11)
T (v1 , v2 ) (v1 v2 , v1 2v2 ) (1, 11)
v1 v2 1
v1 2v2 11
v1 3, v2 4 Thus {(3, 4)} is the preimage of w=(-1, 11)
6.4
Linear Transformation ( ):
V , Wvector spaces
T : V WA linear tra nsformatio n of V into W if the
following two properties are true
(1) T (u v ) T (u) T ( v ), u, v V
(2) T (cu) cT (u),
c R
6.5
Notes:
(1) A linear transformation is said to be operation preserving
(because the same result occurs whether the operations of addition
and scalar multiplication are performed before or after the linear
transformation is applied)
T (u v ) T (u) T ( v )
Addition
in V
Addition
in W
T (cu) cT (u)
Scalar
multiplication
in V
Scalar
multiplication
in W
6.6
T (u v ) T (u1 v1 , u 2 v2 )
((u1 v1 ) (u2 v2 ), (u1 v1 ) 2(u2 v2 ))
((u1 u 2 ) (v1 v2 ), (u1 2u2 ) (v1 2v2 ))
(u1 u2 , u1 2u 2 ) (v1 v2 , v1 2v2 )
T (u) T ( v )
6.7
6.8
(c) f ( x) x 1
f ( x1 x2 ) x1 x2 1
f ( x1 ) f ( x2 ) ( x1 1) ( x2 1) x1 x2 2
f ( x1 x2 ) f ( x1 ) f ( x2 ) (f(x) = x+1 is not a linear transformation,
although it is a linear function)
In fact, f (cx) cf ( x)
6.9
6.10
Zero transformation ( ):
T :V W
T ( v ) 0, v V
Identity transformation ( ):
T :V V
T ( v ) v, v V
Theorem 6.1: Properties of linear transformations
T : V W , u, v V
(1) T (0) 0 (T(cv) = cT(v) for c=0)
(2) T ( v ) T ( v) (T(cv) = cT(v) for c=-1)
(3) T (u v) T (u) T ( v) (T(u+(-v))=T(u)+T(-v) and property (2))
(4) If v c1v1 c2 v2 cn vn ,
then T ( v) T (c1v1 c2 v2 cn vn )
c1T (v1 ) c2T (v2 ) cnT (vn )
(Iteratively using T(u+v)=T(u)+T(v) and T(cv) = cT(v))
6.11
Sol:
(2,3,2) 2(1,0,0) 3(0,1,0) 2(0,0,1)
According to the fourth property on the previous slide that
T
(
c
v
c
v
c
v
)
c
T
(
v
)
c
T
(
v
)
c
T
(
v
)
1 1
2 2
n n
1
1
2
2
n
n
6.12
0
3
2
3
1
The function T : R R is defined as T ( v ) Av 2
v1
v2
(a) v (2, 1)
R 2 vector R 3 vector
0
3
T ( v ) Av 2
1
T (2,1) (6,3,0)
2
1
6
3
0
(vector addition)
(scalar
6.13
multiplication)
Note:
R n vector
a11
a21
Av
am1
a12 a1n
a22 a2 n
am 2 amn
T ( v ) Av
R m vector
vn am1v1 am 2 v2 amnvn
T : Rn Rm
6.14
sin
cos
T(v
)
6.15
cos
sin
x cos
y sin
r cos cos r sin sin
r sin( )
T ( v ) Av
sin
cos
sin
cos
r cos
r sin
Ex 8: A projection in R3
The linear transformation T : R 3 R 3 is given by
1 0 0
A 0 1 0
0
0
0
is called a projection in R3
1 0 0 x
If v is ( x, y, z ), Av 0 1 0 y
0 0 0 z
x
y
6.17
T ( A) AT
(T : M mn M nm )
function:
domain:
codomain:
image of v under T: T v
range of T: T
preimage of w: w
linear transformation:
linear operator:
zero transformation:
identity transformation:
6.19
Sol:
0 0
ker(T ) 0 0
0
0
R
)
2
1 2 3 x
3
ker(T ) ?
Sol:
ker(T ) {( x1 , x2 , x3 ) | T ( x1 , x2 , x3 ) (0, 0), and ( x1, x2 , x3 ) R 3}
T ( x1 , x2 , x3 ) (0,0)
1 1 2
1 2
3
x1
x2 0
0
x3
6.22
1 1 2 0
G.-J. E.
1 2 3 0
1 0 1 0
0 1 1 0
x1 t
1
x2 t t 1
x
t
3
1
ker(T ) {t (1,1,1) | t is a real number }
span{( 1,1,1)}
6.23
T (u v ) T (u) T ( v ) 0 0 0
T (cu) cT (u) c0 0
(u ker(T ) cu ker(T ))
T is a linear transformation
2 0
1 3
0 2
0 0
1 1
1 0
0 1
2 8
1 1 0
1 3 1 0 0
G.-J. E.
1 0 2 0 1 0
0 0 0 2 8 0
1
2
x1
2 s t
x
s 2t
x x3
s s
4
t
4
x5
1 0
0 1 1 0 2 0
0 0 0 1 4 0
0 0 0 0 0 0
s
t
1 0
2 1
1 2
1 t 0
0 4
0 1
ker(T ) NS ( A) x | Ax 0, x R n (subspace of R n )
6.27
6.28
Pf:
Q T (0) 0 (Theorem 6.1)
range (T ) is a nonempty subset of W
Since T (u) and T ( v ) are vectors in range( T ), and we have
because u v V
T (u) T ( v ) T (u v ) range (T )
T is a linear transformation
Notes:
T : V W is a linear transformation
6.30
a11
Ax x1 21 x 2
M
a
m1
a 12
a 22
L
M
a m 2
a1n
a
xn 2 n b
M
amn
Therefore, the system Ax=b is consistent iff we can find (x1, x2,, xn)
such that b is a linear combination of the column vectors of A, i.e.b CS ( A)
Thus, we can conclude that the range consists of all vectors b, which is a linear
combination of the column vectors of A or said b CS ( A) . So, the column space
6.31
of the matrix A is the same as the range of T, i.e. range(T) = CS(A)
1 0 0
1 0 0
A 0 1 0 , such that 0 1 0
0 0 0
0 0 0
x
x
y y
z
0
x1
1
0 0
CS ( A) x1 0 x 2 1 x3 0 x2 , where x1 , x2 R
0
0 0
0
6.32
2 0
1 3
0 2
0 0
1 1
1 0
0 1
2 8
1 1
1 0
A
G.-J. E.
1 0 2 0 1
0
0
0
2
8
c1 c2 c3 c4 c5
1
2
2
1
0
3
1
0
0
0
w1
0 2 0 1
1 1 0 2
B
0 0 1 4
0 0 0 0
w2 w3 w4 w5
Note:
If T : R n R m is a linear transformation given by T (x) Ax, then
rank(T ) dim(range(T )) dim(CS ( A)) rank( A)
nullity(T ) dim(ker(T )) dim( NS ( A)) nullity( A)
The dimension of the row (or column) space of a matrix A is called the rank of
NS ( A) {x | Ax 0}
A
The dimension of the nullspace of A (
) is called the nullity6.35
Pf:
6.37
Sol:
rank (T ) rank ( A) 2
nullity (T ) dim( domain of T ) rank (T ) 3 2 1
The rank is determined by the number of leading 1s, and the
nullity by the number of free variables (columns without leading
1s)
6.38
Sol:
(a) dim(domain of T ) n 5
dim(kernel of T ) n dim(range of T ) 5 2 3
One-to-one ( ):
A function T : V W is called one-to-one if the preimage of
every w in the range consists of a single vector. This is equivalent
to saying that T is one-to-one iff for all u and v in V , T (u) T ( v)
implies that u v
one-to-one
not one-to-one
6.40
Pf:
() Suppose T is one-to-one
Then T ( v) 0 can have only one solution : v 0
i.e. ker(T ) {0}
u v ker(T ) u v 0 u v
T is one-to-one (because T (u) T ( v) implies that u v )
6.41
6.42
Onto ( ):
A function T : V W is said to be onto if every element
in W has a preimage in V
The definition
of onto linear
transformations
6.43
Consequently, T is onto
Therefore, T is one-to-one
6.44
Ex 11:
The linear transformation T : R n R m is given by T (x) Ax. Find the nullity
and rank of T and determine whether T is one-to-one, onto, or neither
1 2 0
(a) A 0 1 1
0 0 1
1 2
(b) A 0 1
0 0
Sol:
= dim(R ) =
n
n
(c) A
= dim(range
of T)
= # of
leading 1s
1 2 0
0 1 1
= (1) (2) =
dim(ker(T))
T:RnRm
dim(domain
of T) (1)
rank(T)
(2)
nullity(T
)
(a) T:R3R3
(b) T:R2R3
(c) T:R3R2
(d) T:R3R3
1 2 0
(d) A 0 1 1
0 0 0
If nullity(T)
=
dim(ker(T))
=0
If rank(T) =
dim(Rm) = m
1-1
onto
Yes
Yes
Yes
No
No
Yes
No
No
6.45
Isomorphism ( ):
A linear tra nsformatio n T : V W that is one to one and onto
is called an isomorphis m. Moreover, if V and W are vector spaces
such that there exists an isomorphis m from V to W , then V and W
are said to be isomorphic to each other
Q T is onto
dim( range of T ) dim(W ) n
Thus dim(V ) dim(W ) n
Note
Theorem 6.9 tells us that every vector space with dimension
n is isomorphic to Rn
one-to-one:
onto:
isomorphic space:
6.50
2 1 1
(2) T ( x) Ax 1 3 2
0 3 4
x1
x
2
x3
It is simpler to write
It is simpler to read
6.51
T (e1 ) 21 , T (e 2 )
M
a
m1
a 12
a1n
a
a 22
, L , T (e n ) 2 n ,
M
M
a m 2
amn
A T (e1 ) T (e 2 ) L T (e n )
M M
am1 am 2
L
L
O
L
a1n
a2 n
,
M
amn
6.52
Pf:
v1
v 2 v1
M
1
0
v 2
0
1
L
M
0
0
v n
v1e1 v2e 2 L vne n
M
1
a11
a21
Av
am1
a12
a22
am 2
6.53
a11
a 12
a1n
a
a
a
v1 21 v 2 22 vn 2 n
am1
a m 2
amn
v1T (e1 ) v2T (e 2 ) vnT (e n )
Therefore, T ( v ) Av for each v in R n
Note
Theorem 6.10 tells us that once we know the image of every
vector in the standard basis (that is T(ei)), you can use the
properties of linear transformations to determine T(v) for any
v in V
6.54
Sol:
Vector Notation
T (e1 ) T (1, 0, 0) (1, 2)
T (e 2 ) T (0, 1, 0) (2, 1)
Matrix Notation
T (e1 ) T ( 0 )
1
2
0
2
T (e 2 ) T ( 1 )
1
0
0
T (e3 ) T ( 0 )
0
0
6.55
A T (e1 ) T (e 2 ) T (e3 )
1 2 0
2
1
0
Check:
x
x
1 2 0 x 2 y
A y
y
2 1 0 2 x y
z
z
i.e., T ( x, y, z ) ( x 2 y, 2 x y )
1x 2 y 0 z
2 x 1y 0 z
Sol:
T ( x , y ) ( x , 0)
1 0
A T (e1 ) T (e 2 ) T (1, 0) T (0, 1)
0
0
Notes:
(1) The standard matrix for the zero transformation from Rn into Rm
is the mn zero matrix
(2) The standard matrix for the identity transformation from Rn into
Rn is the nn identity matrix In
6.57
6.58
Pf:
(1) (T is a linear transformation)
Let u and v be vectors in R n and let c be any scalar. Then
T (u v) T2 (T1 (u v )) T2 (T1 (u) T1 ( v))
T2 (T1 (u)) T2 (T1 ( v)) T (u) T ( v)
T (cv ) T2 (T1 (cv )) T2 (cT1 ( v )) cT2 (T1 ( v )) cT ( v )
Note:
T1 T2 T2 T1
6.59
Sol:
2 1 0
A1 0 0 0 (standard matrix for T1 )
1 0 1
1 1 0
A2 0 0 1 (standard matrix for T2 )
0 1 0
6.60
1 1 0 2 1 0 2 1 0
A A2 A1 0 0 1 0 0 0 1 0 1
0
1
0
1
0
1
0
0
0
2 1 0
A' A1 A2 0 0 0
1 0 1
1 1 0 2 2 1
0 0 1 0 0 0
0 1 0 1 0 0
6.61
Note:
If the transformation T is invertible, then the inverse is
unique and denoted by T1
6.62
(1) T is invertible
(2) T is an isomorphism
(3) A is invertible
Note:
6.63
2 x1 3 x2 x3
3x1 3 x2 x3
2 x1 4 x2 x3
2 3 1 1 0 0
A I 3 3 3 1 0 1 0
2
4
1
0
0
1
6.64
G.-J. E.
1 0 0 1 1 0
0 1 0 1 0 1
0 0 1 6 2 3
A1
0
1 1
A1 1 0
1
0
1 1
T 1 ( v ) A1 v 1 0
1
In other word s,
x1 x2
x1
x2
x1 x3
x
6
x
2
x
3
x
2
3
3 1
T 1 ( x1 , x2 , x3 ) ( x1 x2 , x1 x3 , 6 x1 2 x2 3x3 )
Check T-1(T(2, 3, 4)) = T-1(17, 19, 20) = (2, 3, 4)
6.65
(a linear transformation)
B {v1 , v 2 ,L , v n }
c1
c
if v can be represeted as c1 v1 c2 v 2 L cn v n , then [v]B 2
M
cn
T ( v) B ' A[ v]B ,
where A is called the matrix of T relative to the bases B and B (T
B B' )
6.66
T ( v1 ) B '
a11
a
21 ,
M
am1
T ( v 2 ) B '
a 12
a
22 ,L ,
M
a m 2
a1n
a
2n
T ( v n ) B '
amn
a11 a12
a
a22
21
M M
am1 am 2
L
L
O
L
a1n
a2 n
M
amn
Sol:
T (1, 2) (3, 0) 3(1, 0) 0(0, 1)
T ( 1, 1) (0, 3) 0(1, 0) 3(0, 1)
T (1, 2) B ' 03 , T (1, 1) B ' 03
6.69
Ex 6:
For the L.T. T : R 2 R 2 given in Example 5, use the matrix A
to find T ( v), where v (2, 1)
Sol:
v (2, 1) 1(1, 2) 1(1, 1)
B {(1, 2), (1, 1)}
1
v B
1
3 0 1 3
T ( v ) B ' A v B
0 3 1 3
T ( v ) 3(1, 0) 3(0, 1) (3, 3)
B' {(1, 0), (0, 1)}
Check:
T (2, 1) (2 1, 2(2) 1) (3, 3)
6.70
Notes:
(1) In the special case where V W (i.e., T : V V ) and B B ',
the matrix A is called the matrix of T relative to the basis B
(T
B
)
(2) If T : V V is the identity transformation
B {v1 , v 2 ,L , v n }: a basis for V
the matrix of T relative to the basis B
A T ( v1 ) B T ( v 2 ) B L
T ( v n ) B
1 0 L
0 1 L
M M O
0 0 L
0
0
In
1
6.71
6.72
( a linear transformation)
( a basis of V )
T ( v n ) B
( matrix of T relative to B)
(T B )
T (w n ) B '
P w1 B
P v1 B '
1
w2 B
v2 B '
L
L
w n B
v n B '
6.73
indirect
direct
6.74
Sol:
(1) A ' T (1, 0) B '
T (1, 1) B '
3
T (1, 0) (2, 1) 3(1, 0) 1(1, 1) T (1, 0) B '
1
2
T (1, 1) (0, 2) 2(1, 0) 2(1, 1) T (1, 1) B '
2
3 2
A' T (1, 0) B ' T (1, 1) B '
1 2
6.75
(2) standard matrix for T (matrix of T relative to B {(1, 0), (0, 1)})
2 2
A T (1, 0) T (0, 1)
1 3
transition matrix from B ' to B
1 1
P (1, 0) B (1, 1) B
0
1
(0, 1) B '
1 1
0
1
1 1 2 2 1 1 3 2
A' P AP
0
1
1
3
0
1
1
2
6.76
R
relative to B.
3 7
Find the matrix of T relative to B '
2
Sol:
Because the specific function is unknown, it is difficult to apply
the direct method to derive A, so we resort to the indirect
method where A = P-1AP
3 2
transition matrix from B ' to B : P (1, 2) B (2, 2) B
1
1
transition matrix from B to B ': P (3, 2) B ' (4, 2) B '
2
2 3 3 7 2 1 1 3
2
3
6.77
v B'
3
1
3 2 3 7
v B P v B ' 2 1 1 5
2 7 7 21
T ( v) B A v B 3 7 5 14
1 2 21 7
1
T ( v) B ' P T ( v) B 2 3 14 0
2 1 3 7
or T ( v ) B ' A' v B '
1
3
1
0
6.78
Similar matrix ( ):
For square matrices A and A of order n, A is said to be similar
to A if there exist an invertible matrix P s.t. A=P-1AP
Theorem 6.13: (Properties of similar matrices)
Let A, B, and C be square matrices of order n.
Then the following properties are true.
(1) A is similar to A
(2) If A is similar to B, then B is similar to A
(3) If A is similar to B and B is similar to C, then A is similar to C
Pf for (1) and (2) (the proof of (3) is left in Exercise 23):
(1) A I n AI n (the transition matrix from A to A is the I n )
(2) A P 1 BP PAP 1 P( P 1BP ) P 1 PAP 1 B
Q 1 AQ B (by defining Q P 1 ), thus B is similar to A
6.79
Ex 4: (Similar matrices)
2 2
3
(a) A
and A '
1
3
2
2 are similar
1 1
1
because A' P AP, where P
0
1
2 7
2 1
(b) A
and A '
are similar
3 7
1 3
3 2
because A' P AP, where P
6.80
Sol:
The transitio n matrix from B' to the standard matrix
P (1, 1, 0) B
P 1
1
2
1
2
(1, 1, 0) B
1
2
12
0 0
0
0
1 1 0
(0, 0, 1) B 1 1 0
0 0 1
6.81
A ' P 1 AP
1
2
1
2
1
2
0
0
1
1 3 0
3 1 0
0 0 2
1 0
1 1 0
0 0 1
1
12
0 0
0
4 0
(A is a diagonal matrix, which is
0 2 0
simple and with some
computational advantages)
0 0 2
6.82
d1k 0
k
0
d
2
(1) D k
M M
0 0
1
d1
L
L
O
L
0 L
1
0 d2 L
(3) D
M M O
0 0 L
0
M
k
d n
d1
0
0 L
d2 L
M M O
0 0 L
0
M , d i 0
1
d n
0
0
d n
(2) DT D
6.83
matrix of T relative to B: T B
similar matrix:
6.84
6.85