Tensor Theory Introduction and Definitions

1
Tensor Theory

Introduction and definitions

In n-dimensional space V
n
(called a "manifold" in mathematics), points are specified by
assigning values to a set of n continuous real variables

x
1
, x
2
. .. ..x
n
called the coordinates.
In many cases these will run from - to +, but the range of some or all of these can be
finite.

Examples: In Euclidean space in three dimensions, we can use cartesian coordinates x, y and
z, each of which runs from - to +. For a two dimensional Euclidean plane, Cartesians may
again be employed, or we can use plane polar coordinates r, u whose ranges are 0 to and 0
to 2t respectively.

Coordinate transformations. The coordinates of points in the manifold may be assigned in
a number of different ways. If we select two different sets of coordinates,

x
1
, x
2
. .. ..x
n
and

' x
1
, ' x
2
, ..... ' x
n
,

there will obviously be a connection between them of the form

' x
r
= f
r
(x
1
, x
2
....x
n
)
r = 1, 2........n. (1)

where the f's are assumed here to be well behaved functions. Another way of expressing the
same relationship is

' x
r
= ' x
r
(x
1
, x
2
.. .. x
n
)
r = 1, 2........n. (2)

where
' x
r
(x
1
,x
2
. .. .x
n
)
denotes the n functions

f
r
(x
1
,x
2
....x
n
)
, r = 1, 2......n.

Recall that if a variable z is a function of two variables x and y, i.e. z = f (x, y), then the
connection between the differentials dx, dy and dz is

dz =
cf
cx
dx +
cf
cy
dy
. (3)

Extending this to several variables therefore, for each one of the new coordinates we have

d ' x
r
=
c ' x
r
cx
s
s= 1
n
dx
s
. r=1, 2........n. (4)

2
The transformation of the differentials of the coordinates is therefore linear and
homogeneous, which is not necessarily the case for the transformation of the coordinates
themselves.

Range and Summation Conventions. Equations such as (4) may be simplified by the use
of two conventions:

Range Convention: When a suffix is unrepeated in a term, it is understood to take all values
in the range 1, 2, 3.....n.

Summation Convention: When a suffix is repeated in a term, summation with respect to that
suffix is understood, the range of summation being 1, 2, 3.....n.

With these two conventions applying, equation (4) may be written as

d ' x
r
=
c ' x
r
cx
s
dx
s
. (5)
Note that a repeated suffix is a "dummy" suffix, and can be replaced by any convenient
alternative. For example, equation (5) could have been written as

d ' x
r
=
c ' x
r
cx
m
dx
m
. (6)
where the summation with respect to s has been replaced by the summation with respect to
m.

Contravariant vectors and tensors. Consider two neighbouring points P and Q in the
manifold whose coordinates are x
r
and x
r
+ dx
r
respectively. The vector

P Q

is then described by the quantities dx
r
which are the components of the vector in this
coordinate system. In the dashed coordinates, the vector

P Q
is described by the components

d ' x
r
which are related to dx
r
by equation (5), the differential coefficients being evaluated at
P. The infinitesimal displacement represented by dx
r
or
d ' x
r
is an example of a contravariant
vector.

Defn. A set of n quantities T
r
associated with a point P are said to be the components of a
contravariant vector if they transform, on change of coordinates, according to the equation

' T
r
=
c ' x
r
cx
s
T
s
. (7)

3

where the partial derivatives are evaluated at the point P. (Note that there is no requirement
that the components of a contravariant tensor should be infinitesimal.)

Defn. A set of n
2
quantities T
rs
contravariant tensor of the second order if they transform, on change of coordinates,
according to the equation

' T
rs
=
c ' x
r
cx
m
c ' x
s
cx
n
T
mn
. (8)

Obviously the definition can be extended to tensors of higher order. A contravariant vector is
the same as a contravariant tensor of first order.

Defn. A contravariant tensor of zero order transforms, on change of coordinates, according to
the equation

' T = T
, (9)

i.e. it is an invariant whose value is independent of the coordinate system used.

Covariant vectors and tensors. Let | be an invariant function of the coordinates, i.e. its
value may depend on position P in the manifold but is independent of the coordinate system
used. Then the partial derivatives of | transform according to

c|
c ' x
r
=
c|
cx
s
cx
s
c ' x
r
(10)

Here the transformation is similar to equation (7) except that the partial derivative involving
the two sets of coordinates is the other way up. The partial derivatives of an invariant
function provide an example of the components of a covariant vector.

Defn. A set of n quantities

T
r
covariant vector if they transform, on change of coordinates, according to the equation

' T
r
=
cx
s
c ' x
r
T
s
. (11)

4
By convention, suffices indicating contravariant character are placed as superscripts, and
those indicating covariant character as subscripts. Hence the reason for writing the
coordinates as
x
r
. (Note however that it is only the differentials of the coordinates, not the
coordinates themselves, that always have tensor character. The latter may be tensors, but this
is not always the case.)

Extending the definition as before, a covariant tensor of the second order is defined by the
transformation

' T
rs
=
cx
m
c ' x
r
cx
n
c ' x
s
T
mn
(12)

and similarly for higher orders.

Mixed tensors. These are tensors with at least one covariant suffix and one contravariant
suffix. An example is the third order tensor

T
st
r
which transforms according to

' T
st
r
=
c ' x
r
cx
m
cx
n
c ' x
s
cx
p
c ' x
t
T
np
m
(13)

Another example is the Kronecker delta defined by

o
s
r
= 1, r = s

= 0, r = s
(14)

It is a tensor of the type indicated because (a) in an expression such as

B
pq..
mn..
o
m
t
, which
involves summation with respect to m, there is only one non-zero contribution from the
Kronecker delta, that for which m = t, and so

B
pq..
mn..
o
m
t
=B
pq..
t n..
; (b) the coordinates in any
coordinate system are necessarily independent of each other, so that

cx
r
cx
s
= o
s
r
and

c ' x
r
c ' x
s
= ' o
s
r
; so these two properties taken together imply that

' o
s
r
=
c ' x
r
cx
m
cx
n
c ' x
s
o
n
m
. (15)

Notes. 1. The importance of tensors is that if a tensor equation is true in one set of
coordinates it is also true in any other coordinates. e.g. if

T
mn
= 0
(which, since m

5
and n are unrepeated, implies that the equation is true for all m and n, not just for
some particular choice of these suffices), then

' T
rs
= 0
also, from the transformation
law. This illustrates the fact that any tensor equation is covariant, which means that
it has the same form in all coordinate systems.

2. A tensor may be defined at a single point P within the manifold, or along a curve,
or throughout a subspace, or throughout the manifold itself. In the latter cases we
speak of a tensor field.

Tensor algebra

Addition of tensors. Two tensors of the same type may be added together to give another
tensor of the same type, e.g. if

A
st
r
and

B
st
r
are tensors of the type indicated, then we can
define

C
st
r
= A
st
r
+ B
st
r
. (16)

It is easy to show that the quantities

C
st
r
form the components of a tensor.

Symmetric and antisymmetric tensors.
A
rs
is a symmetric contravariant tensor if

A
rs
= A
sr
and antisymmetric if
A
rs
= A
sr
. Similarly for covariant tensors. Symmetry
properties are conserved under transformation of coordinates, e.g. if
A
rs
= A
sr
, then

' A
mn
=
c ' x
m
cx
r
c ' x
n
cx
s
A
rs
=
c ' x
m
cx
r
c ' x
n
cx
s
A
sr
= ' A
nm
. (17)

Note however that for a mixed tensor, a relation such as

A
r
s
= A
s
r
does not transform to give
the equivalent relation in the dashed coordinates. The concept of symmetry (with respect to a
pair of suffices which are either both subscripts or both superscripts) can obviously be
extended to tensors of higher order.

Any covariant or contravariant tensor of second order may be expressed as the sum of a
symmetric tensor and an antisymmetric tensor, e.g.

A
rs
=
1
2
(A
rs
+A
sr
) +
1
2
(A
rs
A
sr
)
. (18)

6
Multiplication of tensors. In the addition of tensors we are restricted to tensors of a single
type, with the same suffices (though they need not occur in the same order). In the
multiplication of tensors there is no such restriction. The only condition is that we never
multiply two components with the same suffix at the same level in each. (This would imply
summation with respect to the repeated suffix, but the resulting object would not have tensor
character - see later.)

To multiply two tensors e.g.

A
rs
and

B
n
m
we simply write

C
rsn
m
= A
rs
B
n
m
. (19)

It follows immediately from their transformation properties that the quantities

C
rsn
m
form a
tensor of the type indicated. This tensor, in which the symbols for the suffices are all
different, is called the outer product of

A
rs
and

B
n
m
.

Contraction of tensors. Given a tensor

T
np
m
, then

' T
np
m
=
c ' x
m
cx
r
cx
s
c ' x
n
cx
t
c ' x
p
T
st
r
. (20)

Hence replacing n by m (and therefore implying summation with respect to m)

' T
mp
m
=
c ' x
m
cx
r
cx
s
c ' x
m
cx
t
c ' x
p
T
st
r

=
cx
s
cx
r
cx
t
c ' x
p
T
st
r

= o
r
s
cx
t
c ' x
p
T
st
r

=
cx
t
c ' x
p
T
st
s
(21)
so we see that

T
mp
m
behaves like a tensor

A
p . The upshot is that contraction of a tensor (i.e.
writing the same letter as a subscript and a superscript) reduces the order of the tensor by 2
and yields a tensor whose type is indicated by the remaining suffices.

Note that contraction can only be applied successfully to suffices at different levels. We may
of course construct, starting with a tensor

A
qrs
p
say, a new set of quantities

A
qrr
p
; but these
do not have tensor character (as one can easily check) so are of little interest.

7

Having constructed the outer product

C
rsn
m
= A
rs
B
n
m
in the example above, we can form the
corresponding inner products

C
msn
m
= A
ms
B
n
m
and

C
rmn
m
= A
rm
B
n
m
. Each of these forms a
covariant tensor of second order.

Tests for tensor character. The direct way of testing whether a set of quantities form the
components of a tensor is to see whether they obey the appropriate tensor transformation law
when the coordinates are changed. There is also an indirect method however, two examples
of which will now be given:

Theorem 1. Let
X
r
be the components of an arbitrary contravariant vector. Let

A
r
be another
set of quantities. If

A
r
X
r
is an invariant, then

A
r
form the components of a covariant vector.
Proof: Since
X
r
is a tensor, it obeys the tensor transformation law. Invariance of

A
r
X
r

means that

A
r
X
r
= ' A
s
' X
s
= ' A
s
c ' x
s
cx
r
X
r
(22)
and so

(A
r
' A
s
c ' x
s
cx
r
)X
r
= 0
. (23)

Hence, since
X
r
is an arbitrary tensor,

A
r
=
c ' x
s
cx
r
' A
s
. QED (24)

As an extension of this theorem, it is easy to show that any set of functions of the
coordinates, whose inner product with an arbitrary covariant or contravariant vector is a
tensor, are themselves the components of a tensor. For example, if

A
rs
X
s is a tensor
B
r
, then

A
rs
is a second order contravariant tensor.

Theorem 2. If

a
rs
X
r
X
s
is invariant,
X
r
being an arbitrary contravariant vector and

a
r s
being
symmetric in all coordinate systems, then

a
r s
are the components of a covariant tensor of
second order.
Proof: From our assumption about the invariance of

a
rs
X
r
X
s
,

a
mn
X
m
X
n
= ' a
rs
' X
r
' X
s

= ' a
rs
c ' x
r
cx
m
c ' x
s
cx
n
X
m
X
n
(25)

8
-Hence

b
mn
X
m
X
n
(a
mn
' a
rs
c ' x
r
cx
m
c ' x
s
cx
n
)X
m
X
n
= 0
. (26)

Since
X
m
is arbitrary and the total coefficient of
X
m
X
n
is

b
mn
+b
nm
, we deduce that

b
mn
+b
nm
= 0
, i.e.

a
mn
+ a
nm
= ' a
rs
c ' x
r
cx
m
c ' x
s
cx
n
+ ' a
rs
c ' x
r
cx
n
c ' x
s
cx
m

= ( ' a
rs
+ ' a
sr
)
c ' x
r
cx
m
c ' x
s
cx
n
(27)
on interchanging the summation variables r and s in the second term. But

a
mn
= a
nm
in all
coordinate systems, hence

a
mn
= ' a
rs
c ' x
r
cx
m
c ' x
s
cx
n
. QED (28)

The metric tensor

The Euclidean space. Consider first the familiar Euclidean space in three dimensions, i.e. a
space in which one can define Cartesian coordinates x, y and z so that the distance
d

between two neighbouring points

x, y, z
and

x +dx, y +dy, z +dz
is given by

d
2
= (dx)
2
+(dy)
2
+ (dz)
2
. (29)

If we choose any other coordinates
x
1
, x
2
, x
3
to identify points in this space, the original
coordinates will be functions of these new coordinates, and their differentials will be linear
combinations of the differentials of the new coordinates. Thus in terms of the latter
coordinates,

d
2
= a
mn
dx
m
dx
n
(30)

where the

a
mn
will be functions of
x
m
. (For example in spherical polar coordinates

x
1
= r, x
2
= u, x
3
= |
we have

a
11
= 1, a
22
= r
2
, a
33
= r
2
sin
2
u
and all other a's are zero.)

We now show that

a
mn
is a covariant tensor of second order. The proof goes as follows:

(a)

a
mn
may be taken to be symmetric since each

a
pq occurs only in the combination

a
pq
+ a
qp on the RHS of (30).

9
(b)

d
2
= a
mn
dx
m
dx
n
is invariant, since the distance between two points does not depend
on the coordinates used to evaluate it.

(c) By keeping one point fixed and letting the second point vary in the neighbourhood of the
first,
dx
r
may be considered an arbitrary contravariant tensor.

Hence, using the theorem above,

a
mn
is a covariant tensor of second order. It is called the
metric tensor for the Euclidean 3-space. A similar tensor obviously exists in the case of a two
dimensional Euclidean space.

Riemannian space. A manifold is said to be Riemannian if there exists within it a covariant
tensor of the second order which is symmetric. This tensor is called the metric tensor and
normally denoted by

g
mn
. Its significance is that it can be used to define the analogue of
"distance" between points, and the lengths of vectors. We will assume that all manifolds that
we will be dealing with from now on are Riemannian.

Defn. The interval ds between the neighbouring points
x
r
and
x
r
+dx
r
is given by

ds
2
= g
mn
dx
m
dx
n
. (31)

This is of course invariant. In the familiar Euclidean space where

g
mn
is just the

a
mn
above,

ds
2
= d
2
>0
, being zero only when the two points coincide. In other cases however, e.g. in
spacetime in relativity theory,
ds
2
may take on negative values, so that
ds
itself is not
necessarily real. If ds = 0 for
dx
r
not all zero, the displacement
dx
r
is called a null
displacement. Note that there is no requirement that ds should necessarily have the physical
dimensions of length.

The conjugate metric tensor. From the covariant metric tensor

g
mn
we can construct a
contravariant tensor

g
mn
defined by

g
mn
g
np
= o
p
m
. (32)

To show that

g
mn
is a tensor, we note that, for any contravariant vector
V
p
,

g
mn
g
np
V
p
= o
p
m
V
p
=V
m
. This means that the inner product of

g
mn
with the arbitrary
covariant vector

g
np
V
p
is a tensor,
V
m
, and so we deduce that

g
mn
is indeed a tensor of
the type indicated. It is said to be conjugate to

g
mn
. It is easily shown that when the metric

10
tensor is diagonal, i.e. when

g
mn
= 0, m= n
, the conjugate tensor is also diagonal, with each
diagonal element satisfying

g
nn
= 1/ g
nn
.

The following theorem can be proved, but will just be quoted here: if g is the determinant of
the matrix

g
mn
(i.e. choosing to write the components of the tensor

g
mn
in the form of a
matrix array), then

g
mn
c
cx
r
g
mn
=
c
cx
r
ln g
. (33)

Raising and lowering suffices. Given a tensor

T
rs
m
, we may form another tensor

T
mrs

defined by

T
nrs
= g
nm
T
rs
m
(34)

Note that

g
mn
T
nrs
= g
mn
g
nt
T
rs
t
= o
t
m
T
rs
t
= T
rs
m
. (35)

The tensor

T
nrs
may therefore be regarded as possessing a special relationship with the
original tensor

T
rs
m
in that either of them may be found from the other by the operation of
forming the inner product of the first with the metric tensor or its conjugate. For this reason,
the same symbol is used (T in this instance), and we describe the above processes by saying
that in (34) we have "lowered the suffix m", and that in (35) we have "raised the suffix n".
The process of raising or lowering suffices can be extended to cover all the indices of a
tensor. For example we can raise one or both of the suffices in the tensor

T
mn
, generating
the corresponding tensors

T
n
m
,

T
n
m
and

T
mn
. Notice the distinction between the two
forms of the mixed tensor, effected by leaving appropriate gaps in the set of indices. When
the tensor is symmetric however this distinction disappears and we simply write either of
these as

T
n
m
.

Cartesian tensors

Flat space. A space or manifold is said to be flat if it is possible to find a coordinate system
for which the metric tensor

g
mn
is diagonal, with all diagonal elements equal to 1,
otherwise the space is said to be curved.

The familiar Euclidean space in two or three dimensions is obviously flat, the diagonal
elements then being all equal to + 1. We normally assume that the ordinary three

11
dimensional space which we inhabit is flat, likewise in the special theory of relativity that the
4-dimensional "spacetime" is flat. In the general theory of relativity however this assumption
must be abandoned, and we have to deal with the consequences of spacetime being curved.

It should not be assumed however that curved spaces never arise in elementary physics or
mathematics. Take for instance the surface of a sphere, where it is natural to identify position
on the surface by spatial coordinates

(u, |) ; these are the second and third members of the
set of three spherical polar coordinates

(r, u, |), the first one having been set equal to a
constant, viz. the radius of the sphere. The expression for the line element on the surface of a
sphere is

d
2
= a
2
(du
2
+ sin
2
u d|
2
) (36)

where a is the radius of the sphere. No coordinate transformation can be found from

(u, |)
to new coordinates

(x
1
, x
2
) such that the line element can be re-expressed in the form

d
2
= (dx
1
)
2
(dx
2
)
2
(37)

and so the space is by definition curved. Of course in this case the result is in accordance
with our everyday notions regarding curvature. Geometry in a curved space is intrinsically
different from that for flat spaces, e.g. parallel lines do eventually meet, and the sum of the
angles in a triangle is not 180
o
.

Homogeneous coordinates. These are coordinates for which the metric tensor is diagonal
with all diagonal elements taking the values +1. The metric expression is then

ds
2
= (dx
1
)
2
+ (dx
2
)
2
+ (dx
3
)
2
+ ...... (38)

Clearly such coordinates can exist only if the space in question is flat. If this condition is
satisfied, it must always be possible to find a set of homogeneous coordinates, since any
minus signs in an expression for the metric can be transformed away by re-defining
coordinates (albeit with imaginary values) with appropriate factors of i inserted.

Cartesian coordinates in the Euclidean plane or the Euclidean 3- space are obviously
homogeneous.

Orthogonal transformations. These are linear transformations between two sets of
homogeneous coordinates,

x
m
and

' x
m
of the form

12

' x
m
= A
n
m
x
n
+ A
m
(39)

where the coefficients

A
n
m
and

A
m
are constants. Since the set

' x
m
are homogeneous,

ds
2
= d ' x
m
d ' x
m
. (40)

But, from (39),

d ' x
m
= A
n
m
dx
n
(41)

and so

ds
2
= A
n
m
dx
n
A
p
m
dx
p
. (42)

But the coordinates

x
m
are also homogeneous, and so the RHS of (42) is required to be
equal to

dx
p
dx
p
. Hence

A
n
m
A
p
m
dx
n
= dx
p
(43)
which requires

A
n
m
A
p
m
= 1, n = p
= 0, otherwise (44)

Cartesian tensors. If we are dealing with a flat space, homogeneous coordinates are an
obvious preferred choice since they facilitate geometrical calculations. Any change of
coordinates will normally involve orthogonal transformation equations satisfying equation
(39). It is convenient therefore to define Cartesian tensors as quantities which transform
according to the usual tensor transformation equations when the coordinates undergo an
orthogonal transformation, i.e. as we pass from one set of homogeneous coordinates to
another.

Note carefully that orthogonal transformation equations are a subset of all possible
transformation equations. Therefore "Cartesian tensors" will not in general obey the tensor
laws when subjected to an arbitrary coordinate transformation. On the other hand any
(unrestricted) tensor automatically satisfies the definition of being a Cartesian tensor, since
the conditions for the latter are a subset of the conditions for the former. We therefore have
the seemingly paradoxical statement that "all tensors are Cartesian tensors, but not all
Cartesian tensors are tensors".

Consider now the inverse transformation equations for an orthogonal transformation. Starting
from (39) in the slightly modified form

13

' x
m
= A
p
m
x
p
+ A
m
, (45)

we have

A
n
m
' x
m
= A
n
m
A
p
m
x
p
+ A
n
m
A
m
(46)

= x
n
+ A
n
m
A
m
(47)
using (44). So the inverse equations are

x
n
= A
n
m
' x
m
+ ' A
n
(48)

where

' A
n
= A
n
m
A
m
. (49)

The whole point of this analysis is now revealed: from equations (39) and (48) we see that

c ' x
m
cx
n
= A
n
m
,

cx
n
c ' x
m
= A
n
m
. (50)

The two differential coefficients involved in these equations are therefore equal; but we see,
looking back at equations (7) and (11), that it was the presumed difference between them
which was the whole basis of the distinction between covariant and contravariant tensors.
Therefore if we restrict ourselves to Cartesian tensors, the distinction between covariant and
contravariant tensors disappears, and there is no reason to continue to differentiate between
indices used as superscripts and those used as subscripts. For convenience, subscripts are
almost invariably the preferred choice in practice.

For example, in solid state physics we may require to calculate the electrical conductivity of
a metallic crystal. In an isotropic medium such as a polycrystalline material the conductivity
equation

j
i
= oE
i
relates the components of the current density j to the components of the
electric field E, with the conductivity o taken to be constant. But in a single crystal the
general relationship would be expressed as

j
i
= o
ij
E
j where

o
ij is the conductivity tensor
and the usual summation convention applies. In most textbooks on such topics the
underlying assumption that the crystal or other system under consideration is embedded in a
flat space is taken for granted, and Cartesian tensors are automatically implied by the choice
of a Cartesian coordinate system.

N C McGill

Tensor Theory Introduction and Definitions

Transféré par

Informations du document

Description originale:

Titre original

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Tensor Theory Introduction and Definitions

Transféré par

Droits d'auteur :

Formats disponibles

1

Vous aimerez peut-être aussi