Académique Documents
Professionnel Documents
Culture Documents
Springer
Contents
Minkowski Spacetime . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
2.2 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
2.3 Geometrical Structure of M . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
2.4 Lorentz and Poincare Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
2.5 Poincare Algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
2.5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
2.5.2 Lie Algebra of L+ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
2.5.3 Lie Algebra of P+ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
2.5.4 Universal Enveloping Algebra and Casimir Invariants . . . . . . 85
2.6 Momentum Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
2.6.1 Orbits and Isotropy Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
2.6.2 Invariant Measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
2.6.3 Momentum Space and the Character Group . . . . . . . . . . . . . . 102
2.7 Spacetime Symmetries and Projective Representations . . . . . . . . . . . 105
2.8 Positive Energy Representations of P+ with m > 0 . . . . . . . . . . . . . . . 107
1
1
14
24
29
33
vii
viii
Contents
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
Preface
During the period in which the foundations of quantum mechanics were being laid
it was clear that there were two issues that demanded attention. On the one hand,
there were classical physical systems, such as the electromagnetic field, that could
not be understood from the perspective of classical mechanics and so were not directly amenable to the techniques that were evolving for the quantization of mechanical systems. On the other hand, at the time this development was taking place
the special theory of relativity was a firmly established part of theoretical physics
and yet quantum mechanics took no account of it and it was clear that a really satisfactory quantum theory must be relativistic. The problem of reconciling quantum
mechanics and special relativity is formidable and, indeed, they are essentially irreconcilable if one insists on retaining also the classical notion of a point material
particle. Naively, this is due to the fact that special relativity requires that any such
particle can be regarded as at rest in some inertial frame of reference and, in such a
frame, it would have a well-defined position (the spot where it rests) and momentum
(namely, zero) and this would be forbidden by the uncertainty principle of quantum
mechanics if, as relativity demands, all such frames of reference must be regarded
as physically equivalent.
In a proposed sequel to the manuscript [Nab5] on the Foundations of Quantum
Mechanics we will attempt to sort out, for those whose background is in mathematics rather than physics, how such a reconciliation might be carried out and how the
notion of a quantum field emerges as a result. The brief manuscript before you now
is an excerpt from this sequel that, we hope, may be of some independent interest.
It centers around the problem of building the machinery required to define what it
means to say that a quantum theory is relativistic.
Chapter 1 is a synopsis of the material on Lie groups, Lie algebras and their representations that we will require. Sections 1.1 and 1.2 contain the basic definitions
and examples, while Section 1.3 introduces the important notion of a projective representation of a Lie group. In Sections 1.4 and 1.5 we describe the so-called Mackey
machine for constructing all of the irreducible, unitary representations of certain
semi-direct products of Lie groups. There are some rather deep theorems here that
ix
Preface
we will state, but not prove. We will, however, make every eort to provide ample
references for all of the details we do not include.
In Chapter 2 we turn our attention special relativity. The first three sections motivate and describe the basic geometrical structure of Minkowski spacetime M and are
drawn largely from [Nab4]. Section 2.4 contains the construction of the 2-fold universal covering groups SL(2, C) and ISL(2, C) of the Lorentz and Poincare groups
L+ and P+ , respectively. In Section 2.5 we study the structure of the Lie algebras
of L+ and P+ (and therefore of SL(2, C) and ISL(2, C)). Here we also introduce the
universal enveloping algebra U(g) of a Lie algebra g since this is where one finds the
so-called Casimir invariants. The application of the Mackey machine to ISL(2, C)
requires some rather detailed information about the algebraic dual of Minkowski
spacetime, called momentum space, and all of this will be derived in Section 2.6.
In Section 2.7 we identify the relativistic invariance of a quantum system with the
existence of a projective representation of P+ on the Hilbert space H of the system and indicate how the problem of finding all of these can be reduced, via a deep
theorem of Bargmann, to that of finding the unitary representations of the double
cover ISL(2, C) of P+ . Finally, we pull all of this information together in Section
2.8 to enumerate those particular irreducible, unitary representations of ISL(2, C)
that, from the perspective of the physical applications we have in mind, are of interest to us, namely, those of positive mass and positive energy. We will not consider
the massless case, but will supply references.
The source of our interest in the irreducible, projective representations of the
Poincare group resides in physics. Specifically, these are precisely the objects required for a rigorous definition of the relativistic invariance of a quantum theory.
One need not understand the physical motivation in order to appreciate the mathematics, but one then knows only half of the story. For those inclined toward full
disclosure we have included Appendix A with brief discussions of some physical background material on classical and quantum mechanics, taken largely from
[Nab5]. For our purposes here the central notion is that of a symmetry or, more generally, a symmetry group of a physical system. In Sections A.2 and A.3 we will try to
describe enough of the formalism of classical Lagrangian and Hamiltonian mechanics to understand what it means to say that a classical mechanical system admits a
symmetry group and what can be inferred from this. In Section A.4 we will attempt
the same thing for quantum mechanics and this will lead us to the projective representations of P+ . The Appendix concludes with Section A.5 in which we provide a
brief introduction to the quantum mechanical phenomenon of spin.
A few editorial comments are worth making at the outset. The subtitle we have
chosen contains the word sketch and this is entirely appropriate. The rather modest
goal here is to provide something of a global picture of the ideas, both mathematical
and physical, that are involved in understanding the construction and significance
of one particular class of irreducible, unitary representations of the Poincare group
and its universal cover. We will not pretend to have oered an exhaustive treatment.
We will, however, make a concerted eort to direct those who need the full story to
appropriate sources for all of the details we have not included here.
Preface
xi
We must also confess that the manuscript assumes a fairly substantial mathematical background. We have provided brief synopses of some of the mathematical
machinery required to carry out our task, but generally this has been limited to topics
that were not discussed in some detail in [Nab5]. Naturally, it does not matter where
this background has been acquired and when it comes time to provide references we
will include some readily accessible sources in addition to [Nab5].
We will consistently employ the Einstein summation convention according to
which a repeated index, one subscript and one superscript, indicates a sum over the
range of values that the index can assume. For example, if i and j are indices that
range over 1, . . . , n, then
Xi
L ! i L
L
L
=
X i = X1 1 + + Xn n
q i
i=1
3
!
,=0
v w = 00 v0 w0 + + 03 v0 w3 + + 30 v3 w0 + + 33 v3 w3 ,
and so on.
We should also say a few words about the signature of the quadratic form of
Minkowski spacetime. There was a time when the world was evenly divided between ( + + +) and (+ ) and heartfelt arguments were put forth in favor of
each. However, there is some statistical evidence that (+ ) has won the day. If
so, this would put our reference [Nab4] on the losing side. In any case we have opted
for (+ ) here. This changes nothing essential beyond a few minus signs and
the sense of a few inequalities in the definitions. Even so, considering the number
of references we will make to [Nab4], we thought it only fair to alert the reader to
these adjustments. Forewarned is forearmed.
Chapter 1
number multiplication, the nonzero complex numbers C with complex multiplication, and the complex numbers of modulus one S 1 (also denoted T in this context)
with complex multiplication. The general linear groups GL(n, R) and GL(n, C) consisting of all nonsingular n n matrices with real and complex entries, respectively,
are also Lie groups under matrix multiplication. The product of two Lie groups is a
Lie group with the product manifold structure and the direct product group structure.
We will see more examples shortly. The following is Theorem 3.21 of [Warn].
Theorem 1.1.2. (Closed Subgroup Theorem) A closed subgroup H of a Lie group
G is an embedded submanifold of G and a Lie group with respect to the induced
manifold structure and the group operations inherited from G.
The proof of the following Corollary is a nice application of the Closed Subgroup
Theorem so we will include the argument.
Corollary 1.1.3. Let G and H be Lie groups and : G H a continuous group
homomorphism. Then is smooth.
Proof. Let : G G H be the graph map defined by
(g) = (g, (g)).
Let G and H be the projections of G H onto G and H, respectively. is continuous, injective, and maps onto the graph (G) of . Its inverse is the restriction
of G to (G) and this is also continuous. Consequently, is a homeomorphism
of G onto (G). Since is a homomorphism, (G) is a subgroup of the Lie group
G H. It is closed because it is the inverse image of the diagonal in H H under
the continuous map G H H H defined by (g, h) $ ((g), h).
According to the Closed Subgroup Theorem, (G) is an embedded submanifold
of the Lie group G H and is itself a Lie group under the group operation on G H
and the submanifold structure. Now notice that
= (H | (G) ) (G | (G) )1 .
The projections G : G H G and H : G H H are smooth and, because
(G) is an embedded submanifold of G H, the inclusion map : (G) *
G H is smooth. Since H | (G) = H , it is also smooth. Similarly, G | (G) =
G is smooth. To show that is smooth it will therefore suce to show that
the homeomorphism G | (G) is a dieomorphism. Moreover, since (G) is a Lie
group, left translation by any element is a dieomorphism so it will be enough
to show that G | (G) is a local dieomorphism near some point. Since G | (G) is
smooth this follows from Sards Theorem which asserts that G | (G) must have a
regular value.
Remark 1.1.1. If you would prefer to avoid an application of Sards Theorem there
is a direct proof of the last assertion in Lemma 9.4 of [VDB].
+
4. The unitary group U(n) consists of all nn complex matrices A that satisfy AA =
T
T
A A = idnn , where A is the conjugate transpose of A. These are precisely the
matrices which, when identified with linear transformations on Cn , preserves the
standard Hermitian inner product x, y = x1 y1 + + xn yn , that is, which satisfy
Ax, Ay = x, y for all x, y Cn . The determinant of any element of U(n) is a
complex number of modulus one. As a manifold, U(n) has dimension n2 . Note
that we have taken the Hermitian inner product to be complex-linear in the second
slot and conjugate linear in the first.
5. The special unitary group SU(n) is the subgroup of U(n) consisting of those
elements that have determinant 1. As a manifold, SU(n) has dimension n2 1.
When n = 2 the special unitary group SU(2) is closely related to the rotation
group SO(3) (see Section 2.4).
6. If n is a positive integer and p and q are non-negative integers with n = p+q, then
the semi-orthogonal group O(p, q) consists of all n n real matrices A satisfying
AT A = , where
Remark 1.1.2. Any real Lie algebra A has a Lie algebra complexification AC defined in the following way. The points of AC are ordered pairs (X1 , X2 ) of vectors in A and one defines addition and complex scalar multiplication in AC by
(X1 , X2 ) + (Y1 , Y2 ) = (X1 + Y1 , X2 + Y2 ) and (a + bi)(X1 , X2 ) = (aX1 bX2 , aX2 + bX1 ).
This gives AC the structure of a complex vector space. It is customary to write
(X1 , X2 ) as X1 + iX2 so that these operations take the same form as those of C, that
is,
(X1 + iX2 ) + (Y1 + iY2 ) = (X1 + Y1 ) + i(X2 + Y2 )
and
(a + bi)(X1 + iX2 ) = (aX1 bX2 ) + i(bX1 + aX2 ).
The bracket [ , ]C on AC is then defined by just multiplying out, that is,
"
# "
#
[X1 + iX2 , Y1 + iY2 ]C = [X1 , Y1 ] [X2 , Y2 ] + i [X1 , Y2 ] + [X2 , Y1 ] .
If A1 and A2 are two Lie algebras with brackets [ , ]1 and [ , ]2 , respectively,
then a linear map T : A1 A2 that satisfies T ([X, Y]1 ) = [T (X), T (Y)]2 for all
X, Y A1 is called a Lie algebra homomorphism. If A2 is the algebra End(V)
of endomorphisms of some complex vector space V with the commutator bracket,
then T is called a Lie algebra representation of A1 . If T : A1 A2 is a linear
isomorphism then it is called a Lie algebra isomorphism and, if such a thing exists,
we say that A1 and A2 are isomorphic. A Lie algebra isomorphism of A onto itself
is called an automorphism of A. A Lie subalgebra of a Lie algebra A is a linear
subspace B of A that is closed under the bracket [ , ] of A and therefore is a Lie
algebra in its own right with this same bracket.
Every Lie group G has associated with it a Lie algebra g that can be defined in
two equivalent ways. A vector field X on G is said to be left invariant if, for each
g G, (Lg ) X = X Lg , where Lg : G G is the left translation dieomorphism
Lg (g ) = gg for every g G and (Lg ) is its derivative. Left invariant vector fields
are necessarily smooth (Theorem 5.8.2 of [Nab2]). Every tangent vector at the identity in G gives rise, via left translation, to a unique left-invariant vector field on G.
One can think of g as the real linear space of left invariant vector fields on G with
[X, Y] taken to be the Lie bracket of the vector fields X and Y (see pages 263-264
of [Nab2] for Lie brackets). Equivalently, g can be identified with the tangent space
T e (G) at the identity e in G with the bracket of two tangent vectors x and y in T e (G)
defined by writing x = X(e) and y = Y(e) for left invariant vector fields X and Y and
setting [x, y] = [X, Y](e). In particular, the linear dimension of g is the same as the
manifold dimension of G.
Exercise 1.1.1. Work all of this out for the additive group Rn . Specifically, prove
each of the following.
1. Vector addition is a smooth map from Rn Rn to Rn so Rn is a Lie group.
k=1
k=1
so(n) = o(n).
O(n) and SO(n) are not isomorphic as Lie groups, but they have the same Lie algebras. Two Lie groups are said to be locally isomorphic if their Lie algebras are
isomorphic.
Once the Lie algebra g of a matrix Lie group G is identified with a particular
vector space of matrices it is often convenient to have in hand a more explicit description of the commutator bracket. For this one can select a basis {Xi }i=1,...,n for
g. The bracket on g is then completely determined by the commutators [Xi , X j ] for
i, j = 1, . . . , n. But each [Xi , X j ] is a linear combination of the basis elements so
there exist constants Ci jk , k = 1, . . . , n, such that
[Xi , X j ] =
n
!
Ci jk Xk .
k=1
These are called the structure constants of g relative to the chosen basis and they
determine the bracket completely.
Exercise 1.1.2. Show that
0 0 0
0 0 1
0 1 0
X1 = 0 0 1 , X2 = 0 0 0 , X3 = 1 0 0
01 0
1 0 0
0 0 0
is a basis for o(3) (and so(3)) and compute the commutators to show that
[Xi , X j ] = i jk Xk , i, j = 1, 2, 3,
where i jk is the Levi-Civita symbol (1 if i jk is an even permutation of 123, -1 if i jk
is an odd permutation of 123, and 0 otherwise).
Example 1.1.3. It is not a trivial matter to come up with Lie groups that are not
isomorphic to matrix Lie groups and all of the examples of interest to us are, in fact,
isomorphic to matrix Lie groups (although perhaps not obviously so). For each of
the following matrix Lie groups we will specify the set of matrices that constitute
its Lie algebra; the bracket is always matrix commutator.
1. The Lie algebra gl(n, R) of GL(n, R) is the linear space of all real, n n matrices.
2. The Lie algebra gl(n, C) of GL(n, C) is the real linear space of all complex, n n
matrices.
3. The Lie algebras o(n) and so(n) of O(n) and SO(n), respectively, both consist of
all real, n n, skew-symmetric matrices.
4. The Lie algebra u(n) of U(n) is the real linear space of all complex, nn matrices
T
X that are skew-Hermitian (X = X).
5. The Lie algebra su(n) of SU(n) is the real linear space of all complex, n n
T
matrices X that are skew-Hermitian (X = X) and tracefree (Trace (X) = 0).
Remark 1.1.3. In Exercise 1.2.6 (9) we will exhibit a basis for su(2) in terms of
the Pauli spin matrices and you will use it to show that su(2) is isomorphic to
so(3) so that SU(2) and SO(3) are locally isomorphic.
6. The Lie algebras o(p, q) and so(p, q) of O(p, q) and SO(p, q), respectively, both
consist of all real, n n matrices X that satisfy X T = X, where
= diag (1, . p. . , 1, 1, . q. . , 1).
When p = 1 and q = 3 this implies that o(1, 3) = so(1, 3) consists of all real
matrices of the form
0 X 0 1 X 0 2 X 0 3
X 0
1
1
0 X 2 X 3
.
X = 0 1 1
X 2 X 2 0 X 2 3
0
1
2
X 3 X 3 X 3 0
This is also the Lie algebra so+ (1, 3) of the proper, orthochronous Lorentz group
SO+ (1, 3) = L+ (see Section 2.4).
7. For Lie groups such as the Poincare group (see Section 2.4) which are given as
semi-direct products one can define the general notion of a semi-direct product of
Lie algebras and prove that the Lie algebra of a semi-direct product of Lie groups
is the semi-direct product of the Lie algebras of these groups (see pages 301-306
of [Nab5] for a brief description and an example or Section I.4 of [Knapp] for
the details). One can often avoid this abstract description of the Lie algebra by
finding an explicit representation of the Lie group semi-direct product as a matrix
Lie group; such a representation of the Poincare group is described in Section 2.4.
Example 1.1.4. An associative algebra over a field K is a vector space A over K on
which is defined a K-bilinear map B : A A A, written simply B(a, b) = ab for
all a, b A and called multiplication, that satisfies a(bc) = (ab)c for all a, b, c A.
A is said to be unital if there exists an element 1 A, called the unit, satisfying
1a = a1 = a for all a A. A subalgebra of A is a linear subspace B of A that
is closed under multiplication and is therefore an algebra in its own right with the
same operations as A. If A is unital, then B is also required to contain the unit 1.
On the other hand, a (two-sided) ideal in A is an additive subgroup J of A with the
property that x J implies ax J and xa J for all a A. If A is unital and J
contains the unit, then, in fact, J = A. If S is an arbitrary subset of A, then the ideal
generated by S is the intersection of all ideals in A that contain S .
Any associative algebra A over R or C can be given the structure of a Lie algebra
by defining on it the commutator bracket
[a, b] = ab ba.
We call this the commutator Lie algebra structure of A. In Section 2.5.4 we will discuss the problem of representing a given Lie algebra as the commutator Lie algebra
of some associative algebra.
If G is a Lie group with Lie algebra g we define the exponential map
exp : g G
from g to G as follows. Regard g as the tangent space to the identity e in G. For any
g there exists a unique homomorphism : R G satisfying (0) = e and
(0) = (see page 102 of [Warn]). We define
exp () = (1).
The terminology and notation are motivated by the fact that, for the general linear
group GL(n, R),
exp : gl(n, R) GL(n, R)
is precisely matrix exponentiation (Example 3.35 of [Warn]). The same is true of
any matrix group so in this case we will often use exp () and e interchangeably.
Exercise 1.1.3. Let G be the additive group Rn . Then the Lie algebra can be canonically identified with Rn (Exercise 1.1.1). Show that, with this identification, the
exponential map is given by
exp () = .
We will have occasion to need a more geometrical description of the rotation
group SO(3). The following is Lemma A.1 of [Nab2].
Theorem 1.1.4. Let N be an element of so(3). Then the matrix exponential etN is
in SO(3) for every t R. Conversely, if A is any element of SO(3), then there is a
unique t [0, ] and a unit vector n = (n1 , n2 , n3 ) in R3 for which
A = etN = id33 + (sin t)N + (1 cos t)N 2 ,
where N is the element of so(3) given by
10
0 n3 n2
3
0 n1 .
N = n
2 1
n n
0
11
12
The map ad is called the adjoint representation of g and adX Y = [X, Y] is the adjoint
action of X on Y. The Lie algebra homomorphism ad : g End(g) is a Lie algebra
representation of g on g.
Now let G be a Lie group, H a closed (not necessarily normal) subgroup of G,
and : G G/H the natural projection of G onto the set of left cosets of H in G.
(g0 ) = [g0 ] = g0 H
Supply G/H with the quotient topology determined by . Then it is not dicult to
show that is a continuous, open map and the left action of G on G/H defined by
(g, [g0 ]) G G/H $ g [g0 ] = [gg0 ] G/H
is continuous. The action is also transitive on G/H, that is, for any two points [g0 ]
and [g1 ] in G/H there is a g G for which [g1 ] = g [g0 ], namely, g = g1 g1
0 . For
this reason G/H is called a homogeneous space. The corresponding statements in
the dierentiable category require considerably more work and are contained in the
following result which combines Theorem 3.58 and Theorem 3.64 of [Warn].
Theorem 1.1.5. Let G be a Lie group, H a closed subgroup of G and : G G/H
the natural projection of G onto the space of left cosets of H in G. Then G/H admits
a unique dierentiable structure for which the left action (g, [g0 ]) G G/H $
g [g0 ] = [gg0 ] G/H is smooth. Relative to this dierentiable structure all of the
following are satisfied.
1. : G G/H is smooth.
2. If H is a normal subgroup of G, then G/H is a Lie group.
3. : G G/H admits smooth local sections, that is, for each point [g0 ] G/H
there exists an open neighborhood U of [g0 ] in G/H and a smooth map s : U
1 (U) G such that s = idU .
The third apart of Theorem 1.1.5 asserts that one can locally make a smooth
selection of a representative from each equivalence class in G/H. We will put Theorem 1.1.5 to good use in Section 1.4 when we describe Mackeys theory of induced
representations. The same is true of the following, which is Theorem 3.62 of [Warn].
13
14
(1.1)
Show also that if H is finite-dimensional, then all of these are equivalent to the
continuity of : G U(H) when U(H) is given the operator norm topology.
15
Hint:
For (3) (1) show
that (g)u (g0 )u 2 = 2u2 2Re (g1
0 g)u, u
**
**
2
1
* u (g0 g)u, u *.
16
, b1 .
2
2
"/
0#
1
1
Now notice that, if E A a1 +b
= I, then the Spectral Theorem gives A = a1 +b
2
2 I
and we are done. Otherwise, E A must be I on one of the intervals and 0 on the
other. Denote by [a2 , b2 ] the interval on which it is I. Applying the same argument to [a2 , b2 ] we either prove the result (at the midpoint) or we obtain an interval
[a3 , b3 ] of half the length of [a2 , b2 ] on which E A is I. Continuing inductively, we
either prove the result in a finite number of steps or we obtain a nested sequence
[a1 , b1 ] [a2 , b2 ] [a3 , b3 ] of intervals whose lengths approach zero and
for which E A ([ai , bi ]) = I for every i = 1, 2, 3, . . .. By the Cantor Intersection Theorem (Theorem C, page 73, of [Simm1]),
i=1 [ai , bi ] = {c} for some c R. Since
E A ( R [ai , bi ] ) = 0 for each i, 0 = E A (
(R
[ai , bi ]) ) = E A ( R
i=1
i=1 [ai , bi ] ) =
A
A
E (R {c}). Thus, E ({c}) = I so again we have A = cI.
+
17
Corollary 1.2.2. Let A be an Abelian Lie group. Then every irreducible, unitary
representation of A on a complex, separable Hilbert space H is 1-dimensional.
Exercise 1.2.5. Prove Corollary 1.2.2.
It is the unitary representations of the Poincare group and its universal cover
(Section 2.4) that are of most interest to us, but to describe these we will need not
only all of the machinery of the next three sections, but also an explicit description
of the irreducible, unitary representations of SU(2). We will conclude this section
with the latter.
Example 1.2.1. The special unitary group SU(2) consists of all 2 2 complex maT
T
trices U that are unitary (UU = U U = id22 ) and have determinant det (U) = 1.
Every U SU(2) can be written uniquely in the form
1
2
U=
,
where , C and ||2 + ||2 = 1 (Lemma 1.1.3 of [Nab2]). The inverse of U is
given by
1
2
1
U =
.
18
(U)
2
1 2 1
21 2 1
2
z1
z1 + z2
z1
z
=U 1 =
=
.
z2
z2
z2
z1 + z2
Remark 1.2.2. We will feel free to regard the elements of Cn as either n-tuples
or column vectors of complex numbers and will write them as Z = (z1 , . . . , zn ) or
z1
Z = ... . In any case, the standard Hermitian inner product Z, W of Z and W in
zn
Cn is z1 w1 + + zn wn . Naturally, this turns Cn into a complex Hilbert space. With
the topology determined by the corresponding norm, Cn is homeomorphic to R2n
and is continuous.
There is another obvious representation of SU(2) on C2 called the conjugation
representation which matrix multiplies by U rather than U. We will denote this so
that
1 2
1 2 1
21 2 1
2
z
z
z1
z1 + z2
(U) 1 = U 1 =
=
.
z2
z2
z2
z1 + z2
We would like to show, however, that is not really anything new since it is unitarily
equivalent to . To prove this we will use some properties of the so-called Pauli spin
matrices which we introduce in the following Exercise. We will try to give some
sense of where they come from and why they are interesting.
Exercise 1.2.6. Denote by R3 the set of all 2 2 complex, Hermitian matrices X
T
(X = X) with trace zero (Trace (X) = 0) .
1. Show that every X R3 can be uniquely written as
1
2
x3
x1 ix2
X= 1
= x 1 1 + x 2 2 + x 3 3 ,
x + ix2 x3
where x1 , x2 and x3 are real numbers and
1
2
1
2
1
2
01
0 i
1 0
1 =
, 2 =
, 3 =
10
i 0
0 1
are the so-called Pauli spin matrices. Show that these are all unitary and equal to
their own inverses.
2. Show that, with the operations of matrix addition and (real) scalar multiplication,
/
0
R3 is a 3-dimensional, real vector space and 1 , 2 , 3 is a basis. Consequently,
R3 is linearly isomorphic to R3 . Furthermore, defining an orientation on R3 by
/
0
decreeing that 1 , 2 , 3 is an oriented basis, the map X (x1 , x2 , x3 ) is an
orientation preserving isomorphism when R3 is given its usual orientation.
19
3. Show that
1 2 = i3 , 2 3 = i1 , 3 1 = i2 , 1 2 3 = iI,
where I is the 2 2 identity matrix.
4. Show that 1 , 2 , 3 satisfy the commutation relations
[1 , 2 ] = 2i3 , [2 , 3 ] = 2i1 , [3 , 1 ] = 2i2 ,
where [ , ] denotes the matrix commutator ([A, B] = AB BA).
5. Show that 1 , 2 , 3 satisfy the anticommutation relations.
[i , j ]+ = 2i j I, i, j = 1, 2, 3,
where [ , ]+ denotes the matrix anticommutator ([A, B]+ = AB + BA) and i j is
the Kronecker delta. In particular, each i squares to the identity matrix.
6. Show that, if X = x1 1 + x2 2 + x3 3 and Y = y1 1 + y2 2 + y3 3 , then
1
[X, Y]+ = (x1 y1 + x2 y2 + x3 y3 )I.
2
Conclude that, if one defines an inner product X, YR3 on R3 by
1
[X, Y]+ = X, YR3 I,
2
/
0
then 1 , 2 , 3 is an oriented, orthonormal basis for R3 and R3 is isometric to
R3 . We will refer to R3 as the spin model of R3 .
7. Regard the matrices 1 , 2 , 3 as linear operators on C2 (as a 2-dimensional,
complex vector space with its standard Hermitian inner product) and show that
each of these operators has eigenvalues 1 with normalized, orthogonal eigenvectors given as follows.
1 2
1 2
1 1
1 1
1 :
,
2 1
2 1
1 2
1 2
1 1
1 1
2 :
,
2 i
2 i
1 2 1 2
1
0
3 :
,
0
1
8. Show that, for any U SU(2),
U = 2 U1
2
and conclude that the conjugation representation and the standard representation are unitarily equivalent.
20
21
(1.2)
forms a basis for the linear space V j so the map that sends a0 z1j +a1 z1j1 z2 +a2 z1j2 z22 +
+ a j z2j onto (a0 , a1 , a2 , . . . , a j ) is an isomorphism of V j onto C j+1 . One can now
move the topology and Hilbert space structure of C j+1 back to V j by this isomorphism. In particular, the basis (1.2) is orthonormal and the representation j is unitary. Not at all so clear, but true nonetheless, is that each j is irreducible and that,
up to unitary equivalence, these are all of the irreducible, unitary representations of
SU(2). The following result combines Theorems 5.37 and 5.39 of [Fol2].
Theorem 1.2.3. Each j : SU(2) GL(V j ), j 0, is an irreducible, unitary
representation of SU(2) and every irreducible, unitary representation of SU(2) is
unitarily equivalent to one and only one j .
Remark 1.2.3. 0 : SU(2) GL(C) is the trivial representation that sends every
U SU(2) to the identity idC on V0 = C.
Generally we will find it more convenient to simply identify
V j = C j+1
via the isomorphism described above and regard j as acting on the coecients
(a0 , a1 , a2 , . . . , a j ). Doing this in the notation we have employed up to this point is
rather cumbersome so we will introduce a new way of writing all of this that is
more convenient and also more in line with what one is likely to see in the physics
literature. We will describe this in the following example..
Example 1.2.2. A typical element of V1 is a homogeneous polynomial f (z1 , z2 ) =
a0 z1 + a1 z2 of degree 1 with complex coecients. We now prefer to write this in the
form
f (z1 , z2 ) =
2
!
A=1
A zA ,
22
U 1 U 2 1
.
= U = 1
2
U 2U 2
T
= 1 (U 1 z1 + U 1 z2 ) + 2 (U 2 z1 + U 2 z2 )
1
= (U 1 1 + U 2 2 )z1 + (U 1 1 + U 2 2 )z2
= A zA ,
where the transformed coordinates A are given by
1 2
1 12 1
U 1 U 1 2 1
2 .
= 2
2
2
U 1U 2
U 1 U 1 2 1
=
U 2 1 U 2 2 2
2
of SU(2) on C2 and it will simplify the notation if we deal with this equivalent
representation instead. We would now like to write this as
A = U A B B , A = 1, 2,
where B is summed over B = 1, 2.
Now notice that a typical element of V2 is a homogeneous polynomial a0 z21 +
a1 z1 z2 + a2 z22 of degree 2 with complex coecients and that this can be written in
the form
2
!
A1 ,A2 =1
A1 A2 z A1 z A2 ,
23
(1.3)
= U 1 U 2 U 1 U 2 ,
2
2
2
2
21
U 1U 2
U 1 U 2 21
22
22
where 21 = 12 and means the matrix tensor product. Now generalize.
24
Remark 1.2.4. It is common to denote the spinor representations D(s) , where s = j/2
is in {0, 12 , 1, 32 . . .} and refer to s as the spin of the representation.
Remark 1.2.5. One can think of an SU(2)-spinor intuitively in a way that is entirely
analogous to the old fashioned way of thinking about a tensor. One has assigned to
every orthonormal basis for C2 a set of 2 j complex components A1 A j , A1 , . . . , A j =
1, 2, totally symmetric in A1 . . . A j . If two such bases are related by U = (U A B )A,B=1,2
in SU(2), then the corresponding components are related by the transformation law
(1.3). It is not unreasonable to ask if such things are of any use to physics. Now, one
is familiar with a great many physical quantities that are represented mathematically
by tensors of various types (mass, velocity, momentum, elastic stress and so on) and
these tensors are simply the carriers of various representations of SO(3). But notice
that when j is even, that is, when the representation D( j/2) of SU(2) has integral
spin, D( j/2) (U) = D( j/2) (U) for every U SU(2). We will see in Section 2.4 that
SU(2) is the universal double cover of the rotation group SO(3) and will conclude
from this that integral spin representations of SU(2) descend to representations of
SO(3) and so their carriers can be identified with tensors. Half-integral spin representations of SU(2) do not descend to representations of SO(3) so their carriers are
not tensors and represent something new. That there are, indeed, physical quantities
that require such spinors for their mathematical description and that these quantities have unexpected and counterintuitive properties is the topic of Appendix B of
[Nab4].
Remark 1.2.6. If U is assumed to be only in SL(2, C) rather than the subgroup
SU(2), then (1.3) still determines an irreducible, unitary representation of SL(2, C)
and the carriers are called the components of a (contravariant, undotted) SL(2, C)spinor. In this case, however, these do not exhaust all of the irreducible, unitary
representations of SL(2, C). The representations of SL(2, C) are discussed in much
more detail in Chapter 3 and Appendix B of [Nab4].
is the ray in H containing v. The projectivization of H is the set of all such rays and
is denoted
The map
/
0
P(H) = [v] : v H {0} .
25
P : H {0} P(H)
P (v) = [v]
is a surjection and we provide P(H) with the quotient topology it determines. Stated
otherwise, P(H) is the quotient space of H {0} by the equivalence relation
defined by v1 v2 if and only if v2 = v1 for some C. The topology on P(H) is
Hausdor and P is an open mapping. There are two other useful ways of viewing
P(H).
/
0
1. Let S (H) = e H : eH = 1 be the unit sphere in H with its relative topology.
Define an equivalence relation on S (H) by e1 e2 if and only if e2 = e1 for
some C with || = 1. Then P(H) is homeomorphic to the quotient space
S (H)/ of S (H) by . In particular, one can think of P(H) as the space of unit
rays [e] = {e : C, || = 1} in H.
2. Let End(H) be the Banach space of all bounded linear operators on H with
the operator norm. For each v H {0}, let v End(H) be the orthogonal
projection of H onto [v]. Then v $ v is a continuous map of H{0} into End(H)
which depends only on [v] and therefore induces a continuous map of P(H) into
End(H). If the image of this map is given the relative topology from End(H),
then this is a homeomorphism of P(H) onto the image so we can identify
/
0
P(H) = v End(H) : v H {0} .
At this point we know only that P(H) is a Hausdor topological space, but it is,
in fact, a topological Hilbert manifold and, if H is infinite-dimensional, it is locally
homeomorphic to H. One sees this in the following way. Let e be a unit vector in
H and e its orthogonal complement in H. Then e is a closed linear subspace of
H and consequently it is a Hilbert space in its own right so P(e ) is defined. The
inclusion e * H induces a continuous embedding of P(e ) into P(H) and we will
identify P(e ) with its image. Thus, Ue = P(H) P(e ) is an open neighborhood
of P (e) in P(H) which we claim is homeomorphic to e (and this is isometrically
isomorphic to H if H is infinite-dimensional). Each element of Ue is a ray [v] for
which e, vH " 0 and we can define ([v]) e by
([v]) =
v e, vH e
.
e, vH
Note that ([v]) clearly depends only on [v] and is, indeed, in e .
Exercise 1.3.1. Show that is bijective with inverse given by
1 (w) = [e + w]
for every w e .
Both and 1 are continuous so is a homeomorphism. Since the unit vector e
was arbitrary the result follows.
26
Next we need to introduce the automorphism group Aut(P(H)) of P(H). Motivated by the physical interpretation in quantum mechanics (see page 235 of [Nab5])
we define, for any v, w H {0}, the transition probability ([v], [w]) from state [v]
to state [w] by
([v], [w]) = ([v], [w])P(H) =
| v, wH |2
.
v2H w2H
Notice that the right-hand side is independent of the representatives v and w chosen for the rays. Let H1 and H2 be two complex, separable Hilbert spaces. A map
T : P(H1 ) P(H2 ) is an isomorphism of their projectivizations if it is a homeomorphism and preserves transition probabilities in the sense that ([v], [w])P(H1 ) =
(T ([v]), T ([w]))P(H2 ) for all v, w H1 {0}. An automorphism of P(H) is an isomorphism of P(H) onto itself. The set of all such is denoted Aut(P(H)) and is a
group under composition. We will supply Aut(P(H)) with a topology shortly, but
first we need to describe an important result due to Wigner [Wig1]. For this we recall
that a map U : H1 H2 is anti-unitary if it is additive (U(v + w) = Uv + Uw for all
v, w H1 ), anti-linear (U(v) = U(v) for all C and all v H1 ), and satisfies
Uv, Uw2 = v, w1 for all v, w H1 . A map U that is either unitary or anti-unitary
induces an isomorphism T U on the projectivizations via T U ([v]) = [U(v)]. Wigners
Theorem asserts that every isomorphism T : P(H1 ) P(H2 ) arises in this way
from a map from H1 to H2 that is either unitary or anti-unitary. In fact, the result
is more general in that it does not require the topological assumptions. For a proof
of the following result one can consult [Barg]; [VDB] contains a shorter proof, but
one that assumes continuity.
Remark 1.3.1. Wigner defined a symmetry of the quantum system whose Hilbert
space is H to be a bijection T : P(H) P(H) that preserves transition probabilities. Note that no continuity assumption is made. The following result implies, in
particular, that every symmetry of a quantum system arises from an operator on H
that is either unitary or anti-unitary. One gets continuity (and much more) for free.
We will return to the discussion of symmetries and their physical significance in
Section 2.7.
Theorem 1.3.1. (Wigners Theorem on Symmetries) Let H1 and H2 be complex,
separable Hilbert spaces and T : P(H1 ) P(H2 ) a mapping of P(H1 ) into P(H2 )
satisfying
(T ([v]), T ([w]))P(H2 ) = ([v], [w])P(H1 )
for all [v], [w] P(H1 ). Then there exists a mapping U : H1 H2 satisfying
27
3. either
a. U(v) = U(v) and U(v), U(w)H2 = v, wH1 , or
b. U(v) = U(v) and U(v), U(w)H2 = v, wH1
for all C and all v, w H1 .
Furthermore, if H1 and H2 are of dimension at least 2 and T ([e1 ]) = T ([e2 ]), where
e1 and e2 are unit vectors in H1 , then there is a unique such mapping U : H1 H2
for which U(e1 ) = U(e2 ).
In particular, every automorphism T of P(H) arises from an operator U on H that
is either unitary or anti-unitary via T ([v]) = [Uv].
We denote by U(H) the group of unitary operators on H with the strong operator
topology. The set U (H) of anti-unitary operators, also with the strong operator
topology, is not a group, but the product (composition) of two anti-unitary operators
is unitary so the disjoint union U(H) + U (H) is a group under composition. With
the strong operator topology, U(H) + U (H) is a Hausdor topological group and
U(H) is a closed and open subgroup (the component containing the identity operator
I). We identify the group S 1 of complex numbers of modulus one with a subgroup
of U(H) + U (H) via the map z $ zI for z S 1 . The following result gives an
explicit description of Aut(P(H)) and is the tool we need to provide Aut(P(H))
with a natural topology.
Proposition 1.3.2. Let H be a complex, separable Hilbert space of dimension at
least two. Then the map
Q : U(H) + U (H) Aut(P(H))
defined by
Q(U)([v]) = U([v]) = C(Uv)
is a surjective group homomorphism with kernel S 1 . Consequently, Aut(P(H)) is
isomorphic as a group to the quotient [U(H) + U (H)]/S 1 .
Proof. Q is a homomorphism because
Q(U1 U2 )([v]) = [U1 U2 )(v)] = [U1 (U2 v)] = Q(U1 )([U2 v])
= Q(U1 )(Q(U2 )([v])) = Q(U1 ) Q(U2 )([v]).
Q is surjective by Wigners Theorem 1.3.1. The kernel of Q contains S 1 because,
for every z S 1 ,
Q(zI)([v]) = [(zI)(v)] = [zv] = [v].
Finally, suppose that U is in the kernel of Q. Then Q(U) = idP(H) . Now select an
arbitrary unit vector e in H. Then [Ue] = Q(U)([e]) = [e]. Since U is either unitary
28
+
Identifying Aut(P(H)) with [U(H) + U (H)]/S 1 we provide it with the quotient
topology determined by Q and one can show that it thereby acquires the structure of
a Hausdor topological group.
Now let G be an arbitrary Hausdor topological group. A projective representation of G on the complex, separable Hilbert space H is a continuous group homomorphism : G Aut(P(H)) of G into the automorphism group Aut(P(H)).
Two projective representation j : G Aut(P(H j )), j = 1, 2, of G are equivalent if there is an isomorphism T : P(H1 ) P(H2 ) of P(H1 ) onto P(H2 )
such that 2 (g) T = T 1 (g) for every g G. Notice that any representation
: G U(H) + U (H) of G into U(H) + U (H) gives rise to projective representation of G by composing with Q
= Q .
It is not the case, however, that every projective representation arises in this way. We
will say that : G Aut(P(H)) lifts if there exists a continuous homomorphism
: G U(H) + U (H) for which = Q ;
is then called a lift of . As with
any lifting problem for continuous maps the existence of such lifts is a topological
issue. For connected, simply connected Lie groups G the obstruction is the second
Lie algebra cohomology H 2 (g; R).
Remark 1.3.2. Lie algebra cohomology can be introduced in a variety of ways.
A very thorough discussion together with a number of applications to physics is
available in [deAI]. We will not pursue this here since our interest in the general
result is limited to the fact that it implies a theorem of Bargmann (Theorem 2.7.1)
that we will state, but not prove in Section 2.7. Nevertheless, we will, for the record,
formulate the result precisely. The following is Theorem 4, Section 2, of [Simms]
and Corollary 3.12 of [VDB].
Theorem 1.3.3. Let G be a connected, simply connected Lie group whose Lie algebra g satisfies H 2 (g; R) = 0 and let H be a complex, separable Hilbert space. Then
every projective representation : G Aut(P(H)) of G on P(H) lifts to a unitary
representation : G U(H) of G on H.
29
Notice that, under the circumstances described in the Theorem, the lift actually
maps into the unitary subgroup U(H) of U(H) + U (H).
P is the bundle space or total space, X is the base space, H is the structure group
and is the projection of the principal H-bundle : P X. It follows from local triviality that, for each x X, the fiber 1 (x) above x is a submanifold of P
dieomorphic to H. One thinks of P as a family of copies of H parametrized by
the points of X and glued together topologically in such a way as to achieve local
triviality. For a given X and H there are generally many ways to do this gluing and
it is sometimes possible to classify them (see Section 6.4 of [Nab3]). If, in the local
triviality condition, one can take U to be all of X, then the principal H-bundle is said
to be trivial. One can show that any principal H-bundle over any Rn is necessarily
trivial. If U is an open subset of X, then a smooth map s : U 1 (U) P for
30
(1.4)
31
by ([p, v]) = (p). This is well-defined because (ph) = (p), surjective because
is surjective, and continuous because its composition with q is continuous. For
any x X the fiber of above x is just 1
(x) = {[p, v] : v H}, where p is
any fixed point in 1 (x). Each of these fibers can be supplied with the structure of
a complex, separable Hilbert space isomorphic to H by defining [p, v1 ] + [p, v2 ] =
[p, v1 + v2 ], a[p, v] = [p, av] and [p, v1 ], [p, v2 ] = v1 , v2 H . All of this is easy
to check. It takes just a bit more work to show that : P H X satisfies a
(topological) local triviality condition, that is, that for each x0 X there is an open
1
neighborhood U of x0 in X and a homeomorphism : 1
(U) U H of (U)
onto U H of the form ([p, v]) = ( ([p, v]), ([p, v]) ) = ( (p), ([p, v]) ), where
the restriction |1
of to each fiber of is a linear homeomorphism. To see
(x)
where comes from begin with a local trivialization : 1 (U) U H of the
principal bundle : P X. Let s : U 1 (U) be the natural associated section
1
defined by s(x) = 1 (x, e). Now define 1 : U H 1
(U) by (x, v) =
[s(x), v]. The we are looking for is the inverse of this.
With the structure we have just described : P H X is an example of a
C 0 -Hilbert bundle over X. A (global) section of : P H X is a continuous
map s : X P H with the property that s = idX . One can show (see
Section 6.8 of [Nab2] and Remark 1.4.1 below) that these sections are in one-to-one
correspondence with continuous maps f : P H that are equivariant with respect
to the given actions of H on P and H, that is, that satisfy
f (p h) = h1 f (p) = (h)1 ( f (p)).
For the time being we will identify continuous sections of : P H X with
these continuous, equivariant, H-valued maps on P.
Remark 1.4.1. In Section 2.8 we will need to undo this identification and convert
the equivariant maps to sections so we will note for the record that the section s f :
X P H corresponding to the equivariant map f : P H is given by
s f (x) = [p, f (p)],
where p is any point in 1 (x).
With this review behind us we can return to the problem of describing induced
representations. Thus, we let G be a Lie group, H a closed subgroup of G, and
: H U(H) a strongly continuous, irreducible, unitary representation of H on
the complex, separable Hilbert space H. We would like to construct from this data
a strongly continuous, irreducible, unitary representation of G on some complex,
separable Hilbert space H . We now know that : G G/H is a principal Hbundle over G/H in which the right action of H on G is just right multiplication by
elements of H. This, together with : H U(H), then gives rise to an associated
C 0 -Hilbert bundle : G H G/H. The Hilbert space H will be the space
of L2 -sections of : G H G/H. Naturally, this requires that we be able to
integrate over G/H. Now G, being a locally compact group, has a left-invariant Haar
32
Notice that, since is unitary, f (gh)H = f (g)H for all g G and all h H
so f (g)H depends only on [g]. We can therefore define a real-valued function on
G/H that we will denote f ([g]) by
f ([g]) = f (g)H ,
where g is any element of [g]. Integrating with respect to the G-invariant measure
on G/H we define
f =
+6
G/H
f ([g])2 d([g])
, 12
This is a norm on C0E (G, H) and arises from the inner product
6
f1 , f2 =
f1 ([g]), f2 ([g])d([g]),
G/H
where f1 ([g]), f2 ([g]) = f1 (g), f2 (g)H ; this also depends only on [g]. Since the
elements of C0E (G, H) are continuous, it is not complete with respect to this norm so
we take H to be its Hilbert space completion. The elements of this completion can
be identified with Borel measurable functions f : G H, modulo equality almost
everywhere with respect to Haar measure on G, that satisfy
1. f (gh) = (h)1 ( f (g)) for all h H and almost all g G, and
2.
G/H
We can now fulfill our stated objective. Thus, we let G be a Lie group, H a
closed subgroup of G, and : H U(H) a strongly continuous, irreducible,
unitary representation of H on the complex, separable Hilbert space H. Then the
representation of G induced by H and is denoted
IndGH () : G U(H )
and acts by left translation on the elements of H , that is,
88
33
9 9
IndGH ()(g) f (g ) = f (g1 g ).
(n, h)
= (h1 n1 , h1 ).
Exercise 1.5.2. Verify the group axioms and show that the maps
n $ (n, 1H ) : N N ! H
34
and
h $ (1N , h) : H N ! H
are embeddings so that we can (and will) identify N and H with subgroups of N! H.
Notice also that, when is the trivial homomorphism that sends everything to the
identity, the semi-direct product reduces to the usual direct product N H of the
groups N and H.
Remark 1.5.1. When the action of H on N is clear from the context we will often
omit the subscript and write N ! H simply as N ! H.
Now notice that
1
(1N , h)(n, 1H )(1N , h)1 = (1N (h n), h1H )(h1 11
N ,h )
= (h n, h)(1N , h1 )
= ((h n)1N , hh1 )
= (h n, 1H )
= ((h)(n), 1H ).
If we identify N and H with subgroups of G = N ! H in the manner described in
the previous exercise this simply says that the action of H on N is by conjugation in
G, that is,
h n = hnh1 .
Exercise 1.5.3. Identify N and H with subgroups of G = N ! H and show that
1. N is a normal subgroup of G,
2. NH = G, and
3. N H = {1G }.
Show also that the last two conditions imply that every element g of G can be written
uniquely as g = nh with n N and h H.
Turning matters around let us suppose that G is an arbitrary group and N and H
are subgroups of G satisfying the conditions
1. N is a normal subgroup of G,
2. NH = G, and
3. N H = {1G }.
Then G is said to be the internal semi-direct product of N and H. One can then
define an action of H on N by conjugation, that is, define : H Aut(N) by
(h)(n) = hnh1 .
35
36
1
x 1 2
x2
x
3 =
1
x
1
1
R 1
R2
Ra
= 3 1
01
R 1
0
R1 2
R2 2
R3 2
0
R1 3
R2 3
R3 3
0
a1
a2
,
a3
1
R1 a1
0 1
21
2 1
2
R2 a2
R1 R2 R1 a2 + a1
=
0 1
0
1
so we can identify ISO(3) with G and its action on R3 with matrix multiplication.
G is a matrix Lie group of dimension 6. Its Lie algebra can be identified with
the set of 4 4 real matrices that arise as velocity vectors to curves in G through
the identity with matrix commutator as bracket. We find a basis for this Lie algebra
(otherwise called a set of generators) by noting that if
1
2
id33 ta
a (t) =
,
0 1
then
a (0)
0a
=
00
and if
N (t) =
then
N (0) =
2
etN 0
,
0 1
2
N0
.
0 0
(see Theorem A.2.2 for N). Taking a = (1, 0, 0), (0, 1, 0), (0, 0, 1) and n =
(1, 0, 0), (0, 1, 0), (0, 0, 1) (again, see Theorem A.2.2 for n ) we obtain a set of six
generators for the Lie algebra iso(3) of ISO(3) that we will write as follows.
0
0
N1 =
0
0
0 0
0 1
1 0
0 0
0
0
0
0
, N2 =
0
1
0
0
0 1 0
0 1 0 0
1 0 0 0
0 0 0
, N3 =
0 0 0
0 0 0 0
000
0 0 00
0
0
P1 =
0
0
0
0
0
0
0
0
0
0
1
0
0
0
, P2 =
0
0
0
0
37
0
0
0
0
0
0
0
0
0
0
0
1
, P3 =
0
0
0
0
0
0
0
0
0
0
0
0
1
0
N1 , N2 , and N3 are called the generators of rotations, while P1 , P2 , and P3 are called
the generators of translations.
We will write [A, B] = AB BA for the matrix commutator. Using i jk for the
Levi-Civita symbol (1 if i j k is an even permutation of 1 2 3, 1 if i j k is an odd permutation of 1 2 3 and 0 otherwise) we record the following commutation relations
for these generators, all of which can be verified by simply computing the matrix
products.
[Ni , N j ] = i jk Nk , i, j = 1, 2, 3
[Pi , P j ] = 0, i, j = 1, 2, 3
[Ni , P j ] = i jk Pk , i, j = 1, 2, 3
The following result on semi-direct products of Lie groups is proved in Chapter
IV, Section XV, of [Chev].
Theorem 1.5.1. Let N and H be Lie groups. Then, with the compact-open topology,
Aut(N) is also a Lie group and so any continuous group homomorphism : H
Aut(N) is smooth. The corresponding semi-direct product N ! H is also a Lie group
.
Our final objective in this section is to apply the construction of induced representations (Section 1.4) to certain semi-direct products of Lie groups in order to
describe the so-called Mackey machine for manufacturing all of the irreducible, unitary representations of such groups. Our primary goal is to apply this machine to the
Poincare group and its universal double cover in Section 2.8.
We will consider two Lie groups N and H, where N is assumed Abelian, and
a continuous (and therefore smooth) homomorphism : H Aut(N) of H into
Aut(N). As usual, we will write (h)(n) = h n. Then
G = N ! H
is also a Lie group. It will be convenient to identify both N and H with closed
subgroups of G so that N is a normal subgroup, G = NH, N H = {e}, and the
action of H on N is by conjugation
h n = hnh1 .
An important role in the construction will be played by the irreducible, unitary
representations of N. Since N is Abelian these are all 1-dimensional (Corollary
38
1.2.2) and are described by the characters of N. We will pause for a moment to
provide some background information.
Remark 1.5.2. In our present circumstances N is an Abelian Lie group, but the discussion that follows requires only that N be a locally compact, Hausdor, Abelian
topological group. A character of N is a continuous homomorphism : N S 1
from N to the group of complex numbers of modulus one. Each of these determines a
unitary representation U of N on C defined by U (n)(z) = (n)z. The set of all char Under pointwise multiplication ((1 2 )(n) = 1 (n)2 (n)) N
acters of N is denoted N.
is an Abelian group called the character group, or dual group of N. Regard N as a
subset of the space C 0 (N, S 1 ) of continuous maps from N to S 1 with the compactopen topology and provide it with the subspace topology. N thereby becomes a
second countable, locally compact, Hausdor, Abelian topological group. The local compactness is not at all obvious (see page 89 of [Fol2]). If N1 , . . . , Nk are locally compact, Abelian groups, then so is N1 Nk and the character group of
N1 Nk is isomorphic, as a topological group, to N1 Nk (see Proposition
4.6 of [Fol2]). An isomorphism from N1 Nk to the dual group of N1 Nk
is given by (1 , . . . , k ) $ , where
(n1 , . . . , nk ) = 1 (n1 ) k (nk ).
Example 1.5.2. We will first find all of the characters of the additive group R and
is isomorphic to R. Thus, we let : R S 1 be a continuous group
show that R
homomorphism so that (x1 + x2 ) = (x1 )(x2 ) and (0) = 1. Then there exists an
> 0 such that ([, ]) is contained in Re(z) > 0. Since p [ 2 , 2 ] p
[ 2 , 2 ] there is a unique p [ 2 , 2 ] such that
() = eip .
Now, () = ( 2 + 2 ) = ( 2 )2 and therefore
+ ,
2
= eip(/2)
because eip(/2) does not have positive real part. Iterating gives
+ ,
n
n = eip(/2 ) , n = 0, 1, 2, . . .
2
Thus, for any k Z,
k
2n
+k ,
n
= eip(k/2 ) .
2n
39
(x) = eipx x R.
there exists a unique p in R such that
Exercise 1.5.5. Show that, for each R,
ipx
(x) = e for every x R. Hint: Repeat the argument above for any with
0 < < .
defined by (p) = eipx is a group isomorConsequently, the map : R R
phism. All that remains is to show that it is also a homeomorphism and therefore
an isomorphism of topological groups. To show that is continuous it is enough to
prove continuity at the identity p = 0 in R. A basic open neighborhood of (0) = 1
can be described as follows. Let r be a posiin the compact-open topology on R
tive real number and n a positive integer. Define Ur = {z C : |1 z| < r} and
Kn = [n, n] R. Then
: (Kn ) Ur } = {eipx : |1 eipx | < r |x| n}
U(Kn , Ur ) = { R
4
+ px ,
5
1 (U(Kn , Ur )) = p R : 4 sin2
< r2 |x| n
2
" #
so p is in 1 (U(Kn , Ur )) if and only if |p| < 2n arcsin 2r and this is an open neighborhood of 0 R. This proves the continuity of .
R is continuous.
Exercise 1.5.6. Show that 1 : R
We conclude that, as topological groups,
! R.
R
via the map
p R $ eipx R.
Applying the result on products quoted above we find that
:k ! R
k R
=R
k ! Rk
R
++pk xk )
40
(h )(n) = (h1 n) n N.
Now fix some 0 N and consider the orbit
O0 = H 0 = {h 0 : h H}
Also let
of 0 under the H-action on N.
H0 = {h H : h 0 = 0 }
be the isotropy subgroup of 0 with respect to this H-action. This is a closed subgroup of H and therefore also a Lie group. Physicists refer to H0 as the little group
at 0 . Notice that if 0 is another point in O0 with, say, 0 = h0 0 , then the orbit of 0
coincides with the orbit of 0 and the isotropy subgroups H0 and H0 are conjugate
(H0 = h0 H0 h1
0 ) and therefore isomorphic as Lie groups. In particular, any unitary
representation of H0 is unitarily equivalent to the unitary representation of H0
1
the
defined by (h0 hh1
for h H0 . On a fixed orbit in N,
0 ) = (h0 )(h)(h0 )
little groups all have the same unitary representations, up to unitary equivalence.
Remark 1.5.3. There is a technical assumption we will need in order to get the full
force of Mackeys theorem on irreducible, unitary representations of G = N ! H.
the orbit O0
We will say that the semi-direct product G is regular if, for each 0 N,
meaning that for any O0 there is an open neighborhood
is locally closed in N,
V of in N such that O0 V is closed in V. Certainly this is the case if all of the
as they will be for the examples of most interest to us.
orbits are closed in N,
Now, suppose we are given a strongly continuous, irreducible, unitary representation of the little group H0 on some separable, complex Hilbert space H. Assuming that H/H0 admits an H-invariant measure (as it will in our examples) we can
follow the procedure described in Section 1.4 to produce an induced representation
IndHH () : H U(H )
0
For each orbit O0 of H in N and each strongly continuous, irreducible, unitary representation of the little group H0 on some separable, complex Hilbert space H,
41
Mackey proves that LO0 , is a strongly continuous, irreducible, unitary representation of G = N ! H on H . Indeed, much more is true. The following result of
Mackey is quite deep and we will simply refer to Theorem 6.24 of [Vara] for the
details.
Theorem 1.5.2. (Mackeys Theorem) LO0 , is a strongly continuous, irreducible,
unitary representation of G = N ! H on H . If G is a regular semi-direct product,
then every strongly continuous, irreducible, unitary representation of G is unitarily
equivalent to some LO0 , . Furthermore, LO , is unitarily equivalent to LO0 , if
0
and only if O0 = O0 and is unitarily equivalent to .
Here then is the Mackey machine for computing all of the irreducible, unitary
representations of the regular semi-direct product G = N ! H of two Lie groups N
and H when N is Abelian. Identify N and H with subgroups of G so that G = NH
and the action of H on N is by conjugation.
1. Select an orbit O of the H-action on the character group N of N and an arbitrary
point 0 in O so that O = O0 .
2. Select an H-invariant measure on H/H0 , where H0 is the isotropy subgroup
of 0 in H.
Note: In general, such a measure need not exist and it is not really necessary for
the operation of the Mackey machine, but for the examples of interest to us we
will explicitly construct them.
3. Select a strongly continuous, irreducible, unitary representation : H0 U(H)
of the isotropy subgroup H0 of 0 on some complex, separable Hilbert space H.
4. Construct the C 0 -Hilbert bundle associated with H, H0 , and as follows: :
H H/H0 is an H0 -principal bundle, where the right action of H0 on H is
right multiplication. This, together with the representation : H0 U(H)
determines an associated C 0 -vector bundle
: H H H/H0
over H/H0 with H-fibers.
5. The L2 -sections of the vector bundle : H H H/H0 with respect to
the measure form a Hilbert space H that can be identified with the space of
Borel measurable functions f : H H, modulo equality almost everywhere
with respect to Haar measure on H, satisfying
42
H/H0
9 9
IndHH ()(h) f (h ) = f (h1 h ).
0
Chapter 2
Minkowski Spacetime
2.1 Introduction
Quantum Field Theory (QFT) arose from attempts to reconcile quantum mechanics
with the special theory of relativity. In this chapter we will attempt to provide the
background in relativity required to understand the challenges that such a reconciliation must confront and how the formalism of QFT proposes to deal with them. We
will discuss only those aspects of special relativity that are directly relevant to this
objective. Our primary source for this material and our primary reference for a more
comprehensive introduction to special relativity is [Nab4]. In particular, Section 2.2
is essentially the Introduction to [Nab4].
2.2 Motivation
Minkowski spacetime is generally regarded as the appropriate arena within which
to formulate those laws of physics that do not refer specifically to gravitational phenomena. We would like to spend a moment here at the outset briefly examining
some of the circumstances that give rise to this belief.
We shall adopt the point of view that the basic problem of science in general is
the description of events that occur in the physical universe and the analysis of relationships between these events. We use the term event, however, in the idealized
sense of a point-event, that is, a physical occurrence that has no spatial extension
and no duration in time. One might picture, for example, an instantaneous collision
or explosion, or an instant in the history of some material point particle or photon.
In this way the existence of a point particle or photon can be represented by a continuous sequence of events called its worldline. We begin then with an abstract set
M whose elements we call events. We will provide M with a mathematical structure that reflects certain simple facts of human experience as well as some rather
nontrivial results of experimental physics.
43
44
2 Minkowski Spacetime
2.2 Motivation
45
Remark 2.2.1. I didnt make this up. The experiment was first performed by J.C.
Hafele and R.E. Keating in 1971 (see [HK]).
To avoid this diculty we will ask our admissible observers to build their clocks
at the origins of their coordinate systems, transport them to the desired locations,
set them down and return to the master clock at the origin. We assume that each
observer has stationed an assistant at the location of each transported clock. Now
our observer must communicate with each assistant, telling him the time at which
his clock should be set in order that it be synchronized with the clock at the origin. As a means of communication we choose a signal which seems, among all the
possible choices, to be least susceptible to annoying fluctuations in reliability, that
is, light signals. To persuade the reader that this is an appropriate choice we will
record some of the experimentally documented properties of light signals. First,
however, a little experiment. From his location at the origin O an observer O emits
a light signal at the instant his clock reads t0 . The signal is reflected back to him
at a point P and arrives again at O at the instant t1 . Assuming there is no delay at
P when the signal is bounced back, O will calculate the speed of the signal to be
distance(O, P)/ 12 (t1 t0 ), where distance(O, P) is computed from the Cartesian coordinates of P in (x1 , x2 , x3 ). This technique for measuring the speed of light we
call the Fizeau procedure in honor of the gentleman who first carried it out with
care.
Remark 2.2.2. Notice that we must bounce the signal back to O since we do not yet
have a clock at P that is synchronized with that at O.
For each admissible observer the speed of light in vacuo as determined by the
Fizeau procedure is independent of when the experiment is performed, the arrangement of the apparatus (that is, the choice of P), the frequency (energy) of the signal
and, moreover, has the same numerical value c (approximately 3.0 108 meters per
second) for all such observers.
Here we have the conclusions of numerous experiments performed over the
years, most notably those first performed by Michelson-Morley and KennedyThorndike (see Exercises 33 and 34 of [TW] for a discussion of these experiments).
The results may seem odd. Why is a photon so unlike an electron (or a baseball)
whose speed certainly will not have the same numerical value for two observers in
relative motion? Nevertheless, they are incontestable facts of nature and we must
deal with them. We will exploit these rather remarkable properties of light signals
immediately by asking all of our observers to multiply each of their time readings
by the constant c and thereby measure time in units of distance (light travel time).
For example, one meter of time is the amount of time required by a light signal to
travel one meter in vacuo. With these units, all speeds are dimensionless and c = 1.
. . . will be denoted x0 ( = ct ), x0 ( = ct ), . . . .
Such time readings for observers O, O,
Now we provide each of our admissible observers with a system of synchronized
clocks in the following way. At each point P of his spatial coordinate system, place
46
2 Minkowski Spacetime
a clock identical to that at the origin. At some time x0 at O emit a spherical electromagnetic wave (photons in all directions). As the wavefront encounters P set the
clock placed there at time x0 + distance(O, P) and set it ticking, thus synchronized
with the clock at the origin.
. . . has established a frame of reference
At this point each of our observers O, O,
0 1 2 3
0 1 2 3
S(x ) = S(x , x , x , x ), S( x ) = S( x , x , x , x ), . . .. A useful intuitive visualization of such a reference frame is a lattice work of spatial coordinate lines with, at
each lattice point, a clock and an assistant whose task it is to record locations and
times for events occurring in his immediate vicinity; the data can later be collected
for analysis by the observer.
(2.1)
(2.2)
(2.3)
2.2 Motivation
47
3. Linear transformations
x = x , = 0, 1, 2, 3,
(2.4)
where the matrix = ( ),=0,1,2,3 satisfies T = , where T means transpose, = ( ),=0,1,2,3 is the matrix
and 0 0 1.
1
0
0
0
0 0 0
1 0 0
,
0 1 0
0 0 1
Remark 2.2.3. Notice that it is not even assumed at the outset that F is continuous
(much less ane). A proof of Zeemans Theorem is available either in [Z] or in
Section 1.6 of [Nab4]. We should point out that this result was actually proved in
1950 by Alexandrov [Alex], but the paper was in Russian and, sadly, was not widely
known in the West.
Since two frames of reference related by a mapping of type (2) dier only by
a rather trivial and unnecessary change of scale, we shall banish them from further consideration. In some circumstances one can adopt a similar attitude toward
mappings of type (1). The constants a , = 0, 1, 2, 3, can be regarded as the Scoordinates of Ss spacetime origin and we may request that all of our observers cooperate to the extent that they select a common event to act as origin, in which case
a = 0, = 0, 1, 2, 3. This is particularly useful when one is interested in purely geometrical questions. On the other hand, in classical mechanics translations in space
and time are important symmetries with important conservation laws (momentum
and energy) so, when the issue is dynamics, translations will play a fundamental
role (see Sections A.2 and A.3).
The (linear) coordinate transformations of type (3), which we will soon christen orthochronous Lorentz transformations, contain all of the novel kinematic
features of special relativity (time dilation, length contraction, etc.). We will
find that these are precisely the linear maps that leave invariant the quadratic form
(x0 )2 (x1 )2 (x2 )2 (x3 )2 (analogous to orthogonal transformations of R3 , which
leave invariant the usual squared length x2 + y2 + z2 ) and preserve time orientation
in the sense described in our Causality Assumption. We will see that any such
has determinant 1. Those with det = 1 are called proper and the collection
of all proper, orthochronous Lorentz transformations is denoted L+ . We will show
that L+ forms a group under matrix multiplication (that is, composition). The group
generated by L+ and the translations of R4 in (1) is called the Poincare group and is
denoted P+ .
The mathematical structure that appears to be emerging for M is that of a 4dimensional real vector space with a distinguished quadratic form and a group of
48
2 Minkowski Spacetime
49
e , e M =
0, if "
=
1, if = = 0
1, if = = 1, 2, 3
, M is called the Lorentz inner product or Minkowski inner product on M and any
such basis is said to be M-orthonormal, or simply orthonormal.
Remark 2.3.1. Unless some confusion is likely to arise we will tend to write this
and any other inner product that is likely to arise simply as , .
The quadratic form corresponding to , is the map Q : M R defined by
Q(x) = x, x
for all x M. The points in M are called events.
Remark 2.3.2. As is customary in linear algebra we will have no qualms about
blurring the distinction between a point in M and a vector in M since the context
invariably makes clear which geometrical picture we have in mind. If one has moral
objections to this the appropriate course is to define Minkowski spacetime as an
ane space containing the points and distinguish it from the corresponding vector
space of displacement vectors determined by two points (a tip and a tail).
An x M is said to be spacelike, timelike, or null if Q(x) is < 0, > 0, or = 0,
respectively. We introduce a 4 4 matrix defined by
= ( ),=0,1,2,3
1
0
=
0
0
0 0 0
1 0 0
.
0 1 0
0 0 1
The inverse of is the same matrix, but we will denote its entries by .
1 = ( ),=0,1,2,3
1 0
0 1
=
0 0
0 0
0 0
0 0
1 0
0 1
50
2 Minkowski Spacetime
C+T (x0 ) = {x M : x x0 T + }
CT (x0 ) = {x M : x x0 T }.
Similarly, the null cone CN (x0 ), future null cone C+N (x0 ), and past null cone CN (x0 )
at x0 are given by
51
CN (x0 ) = {x M : x x0 is null}
There is no notion of time orientation for spacelike vectors and the set of spacelike
vectors is not a cone so, at each x0 M, one simply defines
E(x0 ) = {x M : x x0 is spacelike}
and calls it elsewhere.
With the choice of T + we have provided M with what is called a time orientation.
Independently of this we will also select some orientation for the vector space M,
that is, some equivalence class of ordered bases for M. Henceforth we will consider
only orthonormal bases {e0 , e1 , e2 , e3 } for M that are in the chosen orientation class
and for which e0 is timelike and future directed. Such a basis is called an admissible
basis for M. The coordinates (x0 , x1 , x2 , x3 ) of an event x in M relative to such a
basis are identified with the spatial (x1 , x2 , x3 ) and time (x0 ) coordinates of the event
provided by some admissible observer (see Section 2.2). The choice of such a basis
identifies the vector space M with the vector space R4 and transfers the Lorentz
inner product to R4 . To emphasize the signature of the Lorentz inner product we
will write this copy of R4 as R1,3 .
M ! R1,3
Although the inner product is dierent, the topology and dierentiable structure of
R1,3 are taken to be the usual ones of R4 .
Remark 2.3.3. It is not unreasonable to argue that the Euclidean topology for R1,3
does not make a great deal of physical sense since the Euclidean inner product has no
invariant physical interpretation for all admissible observers. Alternative topologies
have been proposed and one of them is discussed in some detail in Appendix A of
[Nab4]. Even if one acquiesces to the choice of the Euclidean topology, the dierentiable structure is not determined. Deep results of Michael Freedman on topological
4-manifolds and Simon Donaldson on smooth 4-manifolds combine to show that
R4 admits (many) non-dieomorphic dierentiable structures. This cannot occur
for any Rn of dimension n " 4.
The linear subspace spanned by e0 is called the time axis of the corresponding
admissible observer and its orthogonal complement
(e0 ) = {x M : e0 , x = 0} = Span {e1 , e2 , e3 }
is the observers space. We orient (e0 ) by decreeing that {e1 , e2 , e3 } is an oriented
basis. We will feel free to view Span {e0 } and Span {e1 , e2 , e3 } either in M or in R1,3 .
If {e }=0,1,2,3 and {e }=0,1,2,3 are two orthonormal bases for M, then there exists
a unique linear transformation L : M M such that Le = e and this satisfies
52
2 Minkowski Spacetime
Lx, Ly = x, y
for all x, y M. Such a linear transformation is called an M-orthogonal transformation, or simply an orthogonal transformation. If x M and if we write x = x e and
x = x e , then (x ) and ( x ) are interpreted as the spacetime coordinates of the event
x in the two frames of reference corresponding to {e }=0,1,2,3 and {e }=0,1,2,3 . We
are interested in the coordinate transformation relating these two sets of coordinates.
For this we write
e = e , = 0, 1, 2, 3
for some real numbers , , = 0, 1, 2, 3. Then the orthogonality conditions
e , e = can be written
= , , = 0, 1, 2, 3,
(2.5)
= , , = 0, 1, 2, 3.
(2.6)
or, equivalently,
0
0
1
= ( ) = 2 0
0
3 0
0 1
1 1
2 1
3 1
0 2
1 2
2 2
3 2
0 3
1
3
.
2 3
3 3
Then the orthogonality conditions (2.5) or, equivalently, (2.6) can be written
T = ,
(2.7)
where T means transpose. The coordinate transformation from unhatted to hatted coordinates is just the matrix product
0 0
x 0
x1 1 0
2 = 2
x 0
x3
3 0
0 1
1 1
2 1
3 1
0 2
1 2
2 2
3 2
0 3 x0
1 3 x1
2 3 x2
3 3 x3
x = x , = 0, 1, 2, 3.
(2.8)
Any real, 4 4 matrix satisfying (2.7) is called a (general) Lorentz transformation. Any such satisfies det = 1 and either 0 0 1 or 0 0 1. To ensure
that the basis {e }=0,1,2,3 has the proper orientation we will consider only Lorentz
transformations that are proper, that is, satisfy
53
det = 1.
To ensure that e 0 is future directed we assume also that is orthochronous, that is,
satisfies
0 0 1.
Remark 2.3.4. Orthochronous Lorentz transformations actually preserve the time
orientation (future directed or past directed) of all timelike and nonzero null vectors
(see Theorem 1.3.3 of [Nab4]).
The set of all proper, orthochronous Lorentz transformations is denoted L+ .
Exercise 2.3.1. Show that L+ is a group under matrix multiplication. Hint: Show
that any (general) Lorentz transformation has an inverse given by
1 = T .
We will denote the entries of the matrix 1 by , where labels the column
and labels the row. Thus,
0 0 0 0 0
0 1 2 3 0 1 0 2 0 3 0
1 1 1 1 0 1 2 3
1
1
1 = 0 2 1 2 2 2 3 2 = 0 1 1 1
2
3
2
2
0 3 1 3 2 3 3 3 0 2 1 2
3 3 2 3 3 3
0 1 2 3
1 0
0 R1
1
R =
0 R2 1
0 R3 1
0
R1 2
R2 2
R3 2
0
1
R 3
,
R2 3
3
R3
54
2 Minkowski Spacetime
where (Ra b )a,b=1,2,3 is an element of the rotation group SO(3). The coordinate transformation associated with R corresponds physically to a rotation of the spatial coordinate axes within a given frame of reference. R is called the rotation subgroup of
L+ . The following is Lemma 1.3.4 of [Nab4].
Theorem 2.3.4. Let = ( ),=0,1,2,3 be a proper, orthochronous Lorentz transformation. Then the following are equivalent.
1.
2.
3.
4.
is a rotation in L+ .
0 1 = 0 2 = 0 3 = 0
1 0 = 2 0 = 3 0 = 0
0 0 = 1
Exercise 2.3.4. Let and be real numbers with > 0 and 2 2 = 1. Show that
,
is in L+ .
=
0
0
0 0
0 0
0 1 0
001
Taking = cosh and = sinh for some real number > 0 in Exercse 2.3.4
one obtains what is called a boost in the x1 -direction.
cosh sinh 0 0
sinh cosh 0 0
.
1 () =
0
1 0
0
0
0
0 1
The physical interpretation of the corresponding coordinate transformation is discussed in some detail on pages 21-27 of [Nab4]. It describes the relationship between the spacetime coordinates in two admissible frames of reference S and S
for which the spatial coordinate axes initially coincide and remain parallel, but for
which those of S move in the positive direction along the common x1 , x1 -axis with
speed = tanh . One can define boosts in the x2 - and x3 -directions in an entirely
analogous way. Their matrices are
cosh
0
2 () =
sinh
0
0 sinh 0
cosh
0
1
0 0
, 3 () =
0 cosh 0
0
0
0
1
sinh
0
1
0
0
0 sinh
0 0
.
1 0
0 cosh
55
Remark 2.3.5. These boosts along the spatial coordinate axes of a given frame of
reference are generally called special Lorentz transformations and we have written
them in what is called hyperbolic form. They are more traditionally written in terms
of the relative velocity = tanh of the two frames and = (1 2 )1/2 in which
case cosh = and sinh = .
Boosts in the x1 -direction and rotations suce to describe all of the elements of
the proper, orthochronous Lorentz group. The following is Theorem 1.3.5 of [Nab4].
Theorem 2.3.5. Let be a proper, orthochronous Lorentz transformation. Then
there exists a real number and two rotations R1 and R2 in R such that
= R1 1 ()R2 .
The Theorem suggests that the boosts 1 () contain a great deal of the kinematic
information contained in L+ and this is, indeed, the case. All of the well-known
kinematic eects of special relativity such as the relativity of simultaneity, time dilation, length contraction, the relativistic addition of velocities formula, and the
quite inappropriately named twin paradox are easily derived from the properties of
the coordinate transformation corresponding to 1 (). This, however, is not really
our business here so, for most of this, we will simply refer to the rather detailed
discussions in [Nab4] (particularly pages 29-42). We will, however, need some additional information about timelike vectors and curves that is closely related these
phenomena and we will conclude this section with a brief synopsis of what we require.
56
2 Minkowski Spacetime
Theorem 2.3.7. (Reversed Triangle Inequality) If v and w are timelike vectors with
the same time orientation (that is, v, w > 0), then v + w is timelike and
(v + w) (v) + (w).
Equality holds if and only if v and w are parallel.
Theorem 2.3.7 extends to any finite sum of timelike vectors all of which have the
same time orientation (Corollary 1.4.4 of [Nab4]).
Now suppose I is an interval in R and : I M is a smooth curve in M. Then
is said to be spacelike, timelike, or null, respectively, if its velocity vector (t),
identified with a vector in M, is spacelike, timelike or null, respectively, for each
t I.
Remark 2.3.6. Note that if I has endpoints, then smooth means that extends to
an open interval on which it is C .
A smooth timelike or null curve : I M in M is future directed (respectively,
past directed) if (t) is future directed (respectively, past directed) for every t I. A
future directed timelike curve is also called a timelike worldline, or the worldline of
a material particle and is interpreted physically as the set of all events in the history
of some material particle.
If : [a, b] M is a timelike worldline joining (a) and (b), then we define
the proper time length L() of by
L() =
1/2
(t), (t)
dt =
( (t)) dt.
The physical interpretation of L() is a bit more subtle than one might expect and
depends on additional physical input called the Clock Hypothesis (see pges 4850 of [Nab4]). The generally accepted interpretation is that L() is the time lapse
between the events (a) and (b) as measured by an ideal standard clock carried
along by the particle whose worldline is . This is always less that or equal to the
proper time separation of (a) and (b); the following result combines Theorem
1.4.6 and Theorem 1.4.8 of [Nab4].
Theorem 2.3.8. (Twin Paradox) Let : [a, b] M be a timelike worldline from
(a) to (b). Then the displacement vector (b)(a) is timelike and future directed
and
L() ((b) (a)).
Equality holds if and only if is a parametrization of the straight line joining (a)
and (b).
57
Exercise 2.3.5. Do a little outside reading and decide for yourself why we choose to
call Theorem 2.3.8 the Twin Paradox. Also decide for yourself if there is anything
paradoxical about it.
We now define, for any timelike worldline : [a, b] M, what is for most
purposes its most convenient parametrization. The proper time function = (t) on
[a, b] is defined by
6 t
6 t
= (t) =
( (u)) du =
(u), (u)1/2 du
a
d
dt
for every t in [a, b]. Since is timelike, is smooth and positive so the inverse of
= (t) exists and is smooth with a positive derivative. We can therefore parametrize
by and we will abuse the notation a bit and write this parametrization simply
(). Physically, we are parametrizing the timelike worldline by time readings actually recorded along the worldline, assuming the time is set to zero at (a). Relative
to any admissible basis we write
() = x ()e .
The velocity vector
() =
dx
e
d
d 2 x
e
d2
is its 4-acceleration.
Exercise 2.3.6. Prove each of the following.
1. (), () = 1 for all in [0, L()].
2. (), () = 0 for all in [0, L()].
Thus, the 4-velocity is a unit timelike vector and the 4-acceleration is spacelike at
each point.
58
2 Minkowski Spacetime
59
1
a0
(a, ) = a1
a2
a3
0
0 0
1 0
2 0
3 0
0
0 1
1 1
2 1
3 1
0
0 2
1 2
2 2
3 2
0
0
2
3 1
10
1 3 =
a
2 3
3
3
Then
1
x0 1 2
1
X = x1 =
.
x
x2
x3
1
10
(a, )X =
a
21 2 1
2
1
1
=
x
a + x
Exercise 2.4.1. Show that is a Lie group isomorphism of P+ onto its image in
GL(5, R).
General topological considerations imply that, because the fundamental groups
of L+ and P+ are Z2 , each of these groups has a universal double covering group.
However, we will need explicit constructions of these so we will build them and
explain as we go along what universal double covering group means (also see
page 13). The construction depends on a rather remarkable reformulation of both
Minkowski spacetime and the Lorentz group which we now describe.
We begin by considering the real vector space H2 of 2 2, complex matrices X
T
that are Hermitian (X = X). This is precisely the set of matrices that can be written
in the form
1 0
2
x + x3 x1 ix2
X= 1
= x0 0 + x1 1 + x2 2 + x3 3 = x ,
x + ix2 x0 x3
where x0 , x1 , x2 , and x3 are real numbers, 0 is the 2 2 identity matrix and 1 , 2 ,
and 3 are the Pauli spin matrices (see Exercise 1.2.6). Notice that
1 0
2
x + x3 x1 ix2
det X = det 1
= (x0 )2 (x1 )2 (x2 )2 (x3 )2 .
x + ix2 x0 x3
Define an inner product on H2 by polarization of this quadratic form, that is,
60
2 Minkowski Spacetime
X, YH2 =
1
[det(X + Y) det(X Y)] = x0 y0 x1 y1 x2 y2 x3 y3 .
4
1
Trace ( X), = 0, 1, 2, 3,
2
We conclude that this map is an isomorphism of R1,3 onto H2 that sends the Lorentz
inner product to the H2 -inner product so that we are free to fully identify R1,3 and
H2 .
Next we consider the Lie group SL(2, C) of 2 2 complex matrices with determinant one. For every A SL(2, C) we define a mapping MA : H2 H2 by
T
MA (X) = AXA
for every X H2 .
Exercise 2.4.2. Show that MA (X) is in H2 and det MA (X) = det X for every A
SL(2, C) and every X H2 .
But then MA (X) can be uniquely written in the form
1 0
2
x + x3 x1 i x2
MA (X) = 1
= x0 0 + x1 1 + x2 2 + x3 3 = x ,
x + i x2 x0 x3
where x0 , x1 , x2 , and x3 are real numbers. Thus,
( x0 )2 ( x1 )2 ( x2 )2 ( x3 )2 = (x0 )2 (x1 )2 (x2 )2 (x3 )2 .
Consequently, the mapping x = (x )=0,1,2,3 $ x = ( x )=0,1,2,3 defined by
1 0
2
1 0
2
x + x3 x1 i x2
x + x3 x1 ix2 T
=A 1
A ,
x1 + i x2 x0 x3
x + ix2 x0 x3
which is clearly linear for each fixed A SL(2, C), preserves the quadratic form
x x and therefore, by polarization, preserves the Lorentz inner product x y .
The mapping is therefore a general Lorentz transformation which we will denote
A . Notice that A = A for every A SL(2, C). One can write out the matrix A
explicitly in terms of the entries in A and we will do so in a moment, but for many
purposes it is more useful to note that
(A ) =
To see this we compute
"
1
T#
Trace A A , , = 0, 1, 2, 3.
2
61
"
"
1
1
T#
T#
Trace A A x = Trace A(x )A
2
2
"
1
T#
= Trace AXA
2
"
#
1
= Trace MA (X)
2
= x .
" T#
Notice that (A )0 0 = 12 Trace AA which is one-half the sum of the squared moduli
of the entries in A. This is positive so A is orthochronous. To see that A is proper
we proceed as follows. In Exercise 2.6.3 you will show that SL(2, C) is dieomorphic to R3 SU(2).
SL(2, C) ! R3 SU(2)
Since SU(2) is homeomorphic to the 3-sphere S 3 (Theorem 1.1.4 of [Nab2]) and
S 3 is connected and simply connected (pages 118-119 of [Nab2]), it follows that
SL(2, C) is connected and simply connected (see Theorem 2.4.10 of [Nab2]). Now,
being Lorentz transformations, every A has det A = 1. But det A is a continuous
function of the entries in A so it must be constant on the connected space SL(2, C).
When A is the 2 2 identity matrix, A is the 4 4 identity matrix and this has
determinant 1. Consequently, det A = 1 for all A SL(2, C). We conclude that
A L+ for every A SL(2, C). Thus, we have a mapping
: SL(2, C) L+
(A) = A
of SL(2, C) to L+ .
Next we would like to show that is a group homomorphism, that is,
(BA) = (B)(A)
for all A, B SL(2, C). What we want to show then is that BA = B A and for this
it will be enough to show that, for any x = (x )=0,1,2,3 R1,3 ,
(BA ) x = (B ) (A ) x .
First note that, as we showed above,
x = (A ) x =
and similarly
(B ) x =
"
"
#
1
1
T#
Trace AXA = Trace MA (X)
2
2
"
1
# = 1 Trace " (MB MA )(X)#.
Trace MB (X)
2
2
62
2 Minkowski Spacetime
Finally,
"
#
1
Trace (BA)X(BA)T
2
"
1
T
T#
= Trace B(AXA )B
2
"
#
1
= Trace (MB MA )(X)
2
= (B ) x
(BA ) x =
= (B ) (A ) x
as required.
Next we will show that the map is surjective and precisely two-to-one. For this
it will be convenient to have in hand an explicit representation of A in terms of the
entries
1
2
ab
A=
cd
of A. Arriving at this is routine, albeit tedious. One can either write out the
T
matrix product AXA and read o the coecients of the x or use (A ) =
"
T#
1
2 Trace A A . In either case the result is
2
|a| + |b|2 + |c|2 + |d|2
1 ab + ab + cd + cd
2 i(ab ab + cd cd)
2
|a| |b|2 + |c|2 |d|2
ac + ac + bd + bd
ad + ad + bc + bc
i(ad + bc ad bc)
ac + ac bd bd
i(ad bc ad + bc) ab + ab cd cd
ad + ad bc bc i(ab + cd ab cd)
We have already seen that (A) = (A) and would now like to show that is
precisely two-to-one. Since is a group homomorphism we need only show that the
kernel of is {I}, where I is the 2 2 identity matrix.
Exercise 2.4.3. Equate the explicit matrix representation for A to the 4 4 identity
matrix and show that A = I.
Exercise 2.4.4. Let
A1 () =
Then A1 () is in SL(2, C). Show that (A1 ()) is the element of L+ representing a
boost in the x1 -direction.
(A1 ()) = 1 ()
If one has still not wearied of this laborious arithmetic one can check that, for
any t [0, ] and any unit vector n = (n1 , n2 , n3 ) in R3 , the matrix exponential
63
1
ei(t/2)(n 1 +n
2 +n3 3 )
(2.9)
1 0 0 0
0
,
R =
0 R(n, t)
0
where R(n, t) is the rotation in SO(3) described in Theorem A.2.2. Since every rotation in SO(3) is of this form we can conclude from this that maps the SU(2)
subgroup of SL(2, C) onto the rotation subgroup R of L+ . Then we appeal to Theorem 2.3.5 which asserts that any L+ can be written as = R1 1 ()R2 , where
R1 and R2 are rotations in L+ and to the fact that is a homomorphism to conclude
that maps SL(2, C) onto L+ .
Exercise 2.4.5. Prove each of the following directly from the explicit representation
for A in terms of the entries for A.
1
0
cos (t/2) i sin (t/2)
t ( 2i 1 )
( e
)=
=
i sin (t/2) cos (t/2)
0
0
1
0 0
1 0
0 cos t
0 sin t
sin t
cos t
0 0
1 0
0 cos t sin t 0
i
cos (t/2) sin(t/2)
( et ( 2 2 ) ) =
=
sin (t/2) cos (t/2)
1 0
0 0
0 sin t cos t 0
0 0
1 it/2
2 1 0
0 cos t sin t 0
i
e
0
( et ( 2 3 ) ) =
=
0 eit/2
0 sin t cos t 0
0 0
0 1
1
There is one final consequence we would like to draw from the fact that :
SL(2, C) L+ is a surjective homomorphism with discrete kernel Z2 = {I}.
From the explicit matrix expression for (A) it is clear that is continuous. By
Corollary 1.1.3, it is smooth. Since SL(2, C) and L+ are both connected we can
apply Theorem 1.1.7 to conclude that is a smooth covering map. Since SL(2, C)
is simply connected, it is, in fact, the universal covering group of L+ . Because is
two-to-one, SL(2, C) is called the universal double cover of L+ . SU(2) is also simply
connected so it is the universal double cover of the rotation group R ! SO(3). The
map itself is referred to either as the covering map or, on occasion, the spinor map.
All of this extends at once to the Poincare group which, we recall, is the semidirect product
64
2 Minkowski Spacetime
P+ = R1,3 ! L+ .
determined by the natural action of L+ on R1,3 . Now define inhomogeneous SL(2, C),
denoted ISL(2, C), to be the semi-direct product
ISL(2, C) = R1,3 ! SL(2, C),
where the action of SL(2, C) on R1,3 is determined by the covering map :
SL(2, C) L+ as follows.
A a = (A)a = A a
for all A SL(2, C) and all a R1,3 .
Exercise 2.4.6. Show that the map idR1,3 : ISL(2, C) P+ defined by
(idR1,3 )(a, A) = (a, (A))
for all (a, A) ISL(2, C) is a smooth homomorphism of the semi-direct products
with kernel equal to idISL(2,C) . Conclude that ISL(2, C) is the universal double
cover of P+ .
Exercise 2.4.7. Show that the kernel of idR1,3 : ISL(2, C) P+ is precisely the
center of ISL(2, C).
65
SL(2, C) so these have isomorphic Lie algebras and, similarly, P+ and ISL(2, C)
have isomorphic Lie algebras.
and that these are precisely the real matrices of the form
0 X 0 1 X 0 2 X 0 3
X 0
1
1
0 X 2 X 3
.
X = 0 1 1
X 2 X 2 0 X 2 3
0
1
2
X 3 X 3 X 3 0
From the explicit description of the elements of the Lie algebra o(1, 3) in this
Exercise we can read o a basis for o(1, 3).
0
0
M1 =
0
0
0
0
0
0
0
1
N1 =
0
0
0 0
0 0
0 0
0 0
, M2 =
0 1
0 0
1 0
0 1
1
0
0
0
0
0
0
0
0
0 0
0 0
0
, N2 =
0
1 0
0
00
0 0
0 0 0
0 0 1
0 1
, M3 =
0 0
0 1 0
00
00 0
1
0
0
0
0
0 0
0 0
0
, N3 =
0
0 0
0
10
0
0
0
0
0
0
0
0
j, k = 1, 2, 3
(2.10)
[N j , Nk ] = jkl Ml , j, k = 1, 2, 3
(2.11)
[M j , Nk ] = jkl Nl ,
(2.12)
j, k = 1, 2, 3
66
2 Minkowski Spacetime
Comparing the first of these with Exercise 1.1.2 we find that {M1 , M2 , M3 } generate
a Lie algebra isomorphic to the Lie algebra so(3) of the rotation group. In particular,
since the exponential map on so(3) is surjective (Theorem A.2.2) any rotation in L+
can be written in the form
e
Mj
= e
M1 +2 M2 +3 M3
M2
M1
and e
1
0
=
0
0
M3
0 0
0
1 0
0
0 cos 1 sin 1
0 sin 1 cos 1
Remark 2.5.1. Except for the 0th row of zeros and 0th column of zeros, M1 , M2 ,
and M3 are just the basis vectors X1 , X2 , and X3 for so(3) introduced in Exercise
1.1.2. These are, of course, technically dierent objects, but it is often convenient
j
to simply identify M j and X j for j = 1, 2, 3 so that e M j is either a rotation in L+
or the same rotation in SO(3) depending on the context. We will even take this one
step further and notice that these, in turn, can, by Exercise 1.2.6 (10), be identified
j
with the basis vectors 2i j , j = 1, 2, 3, of su(2) in which case e M j is an element
of SU(2) which corresponds to a rotation via the double covering.
i
i
i
M1 , M2 , M3 X1 , X2 , X3 1 , 2 , 3
2
2
2
Exercise 2.5.3. Show that
N2
N1
and e
cosh 1 sinh 1 0 0
sinh 1 cosh 1 0 0
=
0
1 0
0
0
0
0 1
N3
67
is not a boost, but is the composition of a boost and a rotation (called a Wigner
rotation); there is an elementary discussion of this in [ODV].
An arbitrary element of the Lie algebra of L+ is of the form j M j + j N j for real
numbers j , j , j = 1, 2, 3. Consequently, every
e
M j + j N j
(2.13)
(L ),=0,1,2,3
0 N1 N2 N3
N 0 M M
3
2
.
= 1
N2 M3 0 M1
N3 M2 M1 0
L23
Notice that this can be written
1, if = 2, = 3
=
1, if = 3, = 2
0, otherwise.
L23 = 3 2 2 3 .
Exercise 2.5.4. Check a few more cases to persuade yourself that, for all , =
0, 1, 2, 3 and all , = 0, 1, 2, 3,
L = .
68
2 Minkowski Spacetime
Then lower the index , that is, define L = L , and show that
L =
for all , , , = 0, 1, 2, 3.
We can now write all of the commutation relations (2.10), (2.11), and (2.12) as a
single relation. For each fixed , , , = 0, 1, 2, 3 we have
[L , L ] = L + L + L L .
(2.14)
= e2X
There are advantages to viewing what we have just done from the complexified
perspective (see Remark 1.1.2). For this we note that the complexification of the Lie
algebra of L+ contains, in particular, the matrices
J j = iM j , j = 1, 2, 3,
and
K j = iN j , j = 1, 2, 3.
In terms of these the commutation relations (2.10), (2.11), and (2.12) take the form
[J j , Jk ] = i jkl Jl ,
j, k = 1, 2, 3
(2.15)
[K j , Kk ] = i jkl Jl , j, k = 1, 2, 3
(2.16)
69
[J j , Kk ] = i jkl Kl ,
j, k = 1, 2, 3.
(2.17)
Note: Unless some confusion is likely to arise we will adopt the usual custom of suppressing the subscript C in the notation [ , ]C for the bracket of the complexification
(Remark 1.1.2).
Notice that J1 , J2 , J3 , K1 , K2 , and K3 can still be regarded as generators of the
Lorentz transformations by simply including a factor of i in the argument of the
exponential map, that is, every element of L+ can be written in the form
ei(
J j + j K j )
(2.18)
(M ),=0,1,2,3
0 K1 K2 K3
K 0 J J
3
2
= 1
K2 J3 0 J1
K3 J2 J1 0
"
#
[M , M ] = i M M M + M .
(2.19)
= e 2 X
Another useful set of complex generators for L+ is obtained in the following way.
Define
Sj=
1
(J j + iK j ), j = 1, 2, 3
2
Tj =
1
(J j iK j ), j = 1, 2, 3.
2
and
Exercise 2.5.7. Show that, in terms of these, the commutation relations (2.15),
(2.16), and (2.17) decouple as follows.
[S j , S k ] = i jkl S l , j, k = 1, 2, 3
(2.20)
70
2 Minkowski Spacetime
[T j , T k ] = i jkl T l , j, k = 1, 2, 3
(2.21)
[S j , T k ] = 0, j, k = 1, 2, 3
(2.22)
1
a0
a1
2
a
a3
0
0 0
1 0
2 0
3 0
0
0 1
1 1
2 1
3 1
0
0 2
1 2
2 2
3 2
0
0
2
3 1
10
3 =
a
2 3
3 3
0 0
0 Nj
71
for j = 1, 2, 3 and we will persist in our abuse of the notation by writing these
simply as M j and N j , respectively. They satisfy the commutation relations (2.10),
(2.11), and (2.12). The matrices
0
1
O0 = 0
0
0
0
O2 = 0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0 0 0 0 0
0 0 0 0 0
0
0 , O1 = 1 0 0 0 0
0 0 0 0 0
0
0
00000
0
0 0 0 0 0
0 0 0 0 0
0
0 , O3 = 0 0 0 0 0
0 0 0 0 0
0
0
10000
are generators for the Lie algebra of the translation subgroup R1,3 P+ and they
satisfy the commutation relations
[O , O ] = 0, , = 0, 1, 2, 3.
Inserting a factor of i into each of the generators M j and N j , j = 1, 2, 3, we obtain
the complex generators of rotations and boosts
1
2
0 0
Jj =
, j = 1, 2, 3
0 iM j
and
Kj =
2
0 0
, j = 1, 2, 3
0 iN j
and thereby the matrix (M ),=0,1,2,3 satisfying the commutation relations (2.19).
For the O , = 0, 1, 2, 3, we introduce a factor of i and define
P = iO , = 0, 1, 2, 3.
(2.23)
These satisfy
[P , P ] = 0, , = 0, 1, 2, 3.
All that remains is to compute the brackets [M , P ] and we claim that these are
given by
[M , P ] = i ( P P ), , , = 0, 1, 2, 3.
Exercise 2.5.8. Check this for = 1, = 0, = 0 and as many other cases as your
conscience requires.
72
2 Minkowski Spacetime
(2.24)
(2.25)
(2.26)
The complex generators J j and K j of rotations and boosts, respectively, are given
by
J j = jkl Mkl , j = 1, 2, 3,
and
K j = M j0 , j = 1, 2, 3,
so the commutation relations (2.24), (2.25), and (2.26) can be written in terms of
these by making specific choices for the subscripts in M . For example,
[J1 , J2 ] = [M23 , M31 ] = i(33 M21 ) = i(M12 ) = iJ3 .
Continuing in this way one obtains the following set of commutation relations for
the Poincare algebra p.
[J j , Jk ] = i jkl Jl , j, k = 1, 2, 3
(2.27)
[J j , Kk ] = i jkl Kl , j, k = 1, 2, 3
(2.28)
[K j , Kk ] = i jkl Jl , j, k = 1, 2, 3
(2.29)
[J j , Pk ] = i jkl Pl , j, k = 1, 2, 3
(2.30)
[K j , Pk ] = iP0 jk , j, k = 1, 2, 3
(2.31)
[J j , P0 ] = [P j , P0 ] = [P0 , P0 ] = 0, j = 1, 2, 3
(2.32)
73
[K j , P0 ] = iP j , j = 1, 2, 3
(2.33)
In physics it is generally the commutation relations of p that play the most prominent role rather than any particular realization of them as operators. We have already
seen one concrete representation of these relations as 5 5 matrices, that is, as operators on R5 . We will conclude this section with another realization of p, this time as
operators on an infinite-dimensional vector space. We will find that these two manifestations of the Poincare algebra will suggest a link between the abstract bracket
structure of p and the physics of relativistic systems, both classical and quantum.
Before getting started, however, we should point out that in much of what follows we will often treat P and M as if they were the components of objects that
lived in R1,3 rather than as elements of the Lie algebra p. We will, for example,
raise indices with to define P = P and M = M and, in the next
section, form products such as P P . The motivation for this is as follows. We saw
in Example 1.1.5 that the Poincare group P+ , and therefore its subgroup L+ , acts on
P+ on the right by conjugation and that this induces a right action of L+ on the Lie
algebra p. In particular, L+ P+ acts on each matrix P by 1 P . We will
ask you to show now that this has the same eect as transforming the P as if they
were the components of a 4-vector in R1,3 .
Exercise 2.5.9. Compute the indicated matrix products and show that
1 P = P , = 0, 1, 2, 3.
(2.34)
On the other hand, one thinks of (2.35) below as saying that the generators M
transform as a second rank 4-tensor under L+ .
Exercise 2.5.10. Show that
1 M = M , , = 0, 1, 2, 3.
(2.35)
Remark 2.5.2. One often sees similar terminology used in the physics literature, but
in a dierent context so we should explain. First recall (Section 1.1) that the action
of a matrix Lie group G on its Lie algebra g by conjugation is called the adjoint
action of G and is denoted Ad : G Aut(g). The derivative of Ad at the identity
e G determines an action ad = Ade : g Der(g) of g on g. The value of ad
at X g is denoted adX : g g and is given by adX Y = [X, Y] for every Y g.
This is called the adjoint action of g. Intuitively, g (the tangent space at e G) is
thought of as an infinitesimal version of G. An element of so(3), for example,
is an infinitesimal rotation. From this point of view the adjoint action of g is an
infinitesimal version of the adjoint action of G.
Consider, for example, the adjoint action of L+ P+ on p. Exercise 2.5.9 asserts
that under the action Ad1 of the Lorentz group on p, the generators P transform
74
2 Minkowski Spacetime
like a 4-vector so that, even though it lives in p, one can treat P = (P0 , P1 , P2 , P3 ) as
if it were a vector in R1,3 . Now think of the elements of the Lie algebra o(1, 3) p of
L+ as infinitesimal Lorentz transformations. The adjoint action of o(1, 3), given by
bracket, is then thought of as the action of infinitesimal Lorentz transformations on
p and one can ask how the elements of p transform under this action. For example,
lets consider the generators K1 , K2 , K3 in p and ask how these transform under
infinitesimal rotations. The generators of the rotations are J1 , J2 , and J3 so we are
interested in the action of J j on Kk , that is,
ad J j Kk = [J j , Kk ].
But, according to (2.28),
ad J j Kk = i jkl Kl
so (K1 , K2 , K3 ) transforms like a vector under infinitesimal rotations. Physicists are
inclined to omit the word infinitesimal and to write
K = (K1 , K2 , K3 )
as a reminder that K behaves in some ways like a vector in R3 .
Exercise 2.5.11. Check that the commutation relations (2.24), (2.25), and (2.26) are
the same with all of the indices raised, that is,
[P , P ] = 0
"
#
[M , M ] = i M M M + M
[M , P ] = i ( P P )
(2.36)
(2.37)
(2.38)
75
Thus,
P0 = i0 and P j = i j , j = 1, 2, 3
M jk = i(xk j x j k ), j, k = 1, 2, 3 and M 0 j = M j0 = i(x0 j + x j 0 ), j = 1, 2, 3.
We claim that, when the bracket is taken to be the operator commutator, these operators satisfy the commutation relations defining the Poincare algebra, that is, (2.36),
(2.37), and (2.38). The first of these is clear from the equality of mixed second
order partial derivatives. We will check just two of the remaining cases to see how
things go and then leave the rest to you. Specifically, we will first verify (2.37) when
= 0, = 1, = 0, and = 2. Notice that, in this case, the right-hand side of (2.37)
evaluated at C (R1,3 ; C) reduces to
i (00 M 12 ) = x2 1 x1 2
since all of the remaining factors are zero. To see that the left-hand side of (2.37)
is the same we just compute as follows.
[M 01 , M 02 ] = [i (x0 1 + x1 0 )][i (x0 2 + x2 0 )]
[i (x0 2 + x2 0 )][i (x0 1 + x1 0 )]
= x0 x0 1 2 x0 x2 1 0 x1 0 (x0 2 ) x1 x2 0 0
+ x0 x0 2 1 + x0 x1 2 0 + x2 0 (x0 1 ) + x2 x1 0 0
= x 2 1 x 1 2
as required.
Next we will check (2.38) when = 1, = 2, and = 1. Evaluated at , the
right-hand side contains only one nonzero term, namely,
i (11 P2 ) = i(i2 ) = 2 .
For the left-hand side we compute
[M 12 , P1 ] = M 12 (P1 ) P1 (M 12 )
(2.39)
76
2 Minkowski Spacetime
from Appendix A. In the quantum arena, however, one should keep in mind that
C (R1,3 ; C) consists of smooth objects and therefore is not a Hilbert space so a more
precise analogy will require self-adjoint extensions of the operators to something
that is a Hilbert space. At the moment we are looking only for formal similarities
that we can use as motivation later on. To facilitate the comparisons we will also
adopt units in which " = 1.
We begin by restricting our attention to some spatial cross section of R1,3 corresponding to a fixed time x0 in a fixed admissible frame of reference. This is a copy
of R3 . The operators
P j = i
, j = 1, 2, 3,
x j
on C (R3 ; C) are then just the restrictions of the quantum mechanical momentum
operators (Example A.4.1 and specifically (A.25)) on L2 (R3 ) in the given frame of
reference. This would seem to suggest that the generators P1 , P2 and P3 have something to do with linear momentum. The suggestion is strengthened when we recall
that these are the generators of the spatial translations in the R1,3 subgroup of P+
and that, in classical mechanics, spatial translation symmetry implies conservation
of linear momentum (Example A.2.1).
In the relativistic context, however, the three spatial components of non-relativistic
momentum have no physical significance. Rather, they appear as the non-relativistic
approximations to the spatial components of the momentum 4-vector whose time
component is the total relativistic energy (see Remark 2.6.1). In classical mechanics conservation of total energy follows from time-translation symmetry (Example A.2.1) and P0 is the generator of time translations in R1,3 . Moreover, in nonrelativistic quantum mechanics the total energy is given by the classical Hamiltonian
operator H on L2 (R3 ) and this, according to the Schrodinger equation (see (A.16)),
is related to the time evolution of the wave function by
i
d
((t)) = H((t)),
dt
where the time evolution (t) of the wave function is regarded as a curve in L2 (R3 )
and the t-derivative is the tangent vector to this curve in L2 (R3 ). One can regard the
wave function as defined on R1,3 and, under certain circumstances, one can identify the L2 (R3 )-derivative dtd ((t)) with the time partial derivative of in the given
frame of reference (see Remark 6.2.14 of [Nab5] for more on this). The Schrodinger
equation is then written
i
= H
x0
x0
77
with the total energy operator H. Now, the Schrodinger equation is not relativistically invariant so it has no real status on R1,3 . However, the Relativity Principle
asserts that all admissible frames are physically equivalent and this suggests that the
Schrodinger equation should describe the non-relativistic limit of the time evolution
in any admissible frame of reference. This, in turn, suggests that P0 is the appropriate relativistic analogue of the 0th -component of the 4-momentum in quantum
theory.
The preceding arguments were informal and, perhaps, not entirely persuasive,
but we are simply trying to motivate associating the relativistic 4-momentum to
the abstract generators (P0 , P1 , P2 , P3 ) of the Poincare algebra p. Precisely how this
rather vague association is made use of in practice will be addressed at the end of
this section. In any case, we will proceed now to attempt something similar for the
remaining generators M , , = 0, 1, 2, 3. Only six of these are independent so we
will look just at the following generators.
M 23 = i(x3 2 x2 3 ), M 31 = i(x1 3 x3 1 ), M 12 = i(x2 1 x1 2 )
and
M 01 = i(x0 1 + x1 0 ), M 02 = i(x0 2 + x2 0 ), M 03 = i(x0 3 + x3 0 ).
The appropriate interpretations for M 23 , M 31 , and M 12 seem clear since these are
just the operators representing (orbital) angular momentum in quantum mechanics
(Example A.4.1 with " = 1). Except for a factor of i they also bear a striking
resemblance to the infinitesimal generators (A.3), (A.4), and (A.5) of angular momentum in classical mechanics. This suggests that (J 1 , J 2 , J 3 ) = (M 23 , M 31 , M 12 )
should be associated with the components of angular momentum.
The operators describing M 01 , M 02 , and M 03 are less familiar. These correspond
to the generators of boosts in P+ and are the operators associated with what is called
relativistic angular momentum. This is a topic that generally does not find its way
into most introductions to special relativity (in particular, it will not be found in
[Nab4]) and any discussion of it here would take us rather far afield. For those who
are interested in pursuing this we can suggest Section 7.8 of [Gold] for the physicists perspective and Chapter VII of [Synge] for a geometrical treatment in much
the same spirit as [Nab4]. We will take this rather subtle physics for granted and
simply associate (K 1 , K 2 , K 3 ) = (M 01 , M 02 , M 03 ) with relativistic angular momentum.
Exercise 2.5.13. Regard
M 23 = i(x3 2 x2 3 ), M 31 = i(x1 3 x3 1 ), M 12 = i(x2 1 x1 2 )
as operators on L2 (R3 ) and let be a smooth element of L2 (R3 ).
1. Prove the following commutation relations for these operators.
78
2 Minkowski Spacetime
[M 23 , M 31 ] = i M 12 , [M 31 , M 12 ] = i M 23 , [M 12 , M 23 ] = i M 31
2. Suppose depends only on distance to the origin, that is, (x1 , x2 , x3 ) = (|x|).
Show that M 23 = M 31 = M 12 = 0 so that these operators depend only on
the angular coordinates in R3 . This suggests rewriting the operators in spherical
coordinates.
3. Introduce spherical coordinates , and in R3 by
x1 = sin cos , x2 = sin sin , x3 = cos ,
where 0 < and 0 < 2. Show that
M 23 = i (sin +cot cos ), M 31 = i (cos cot sin ), M 12 = i .
Notice that none of these depend on .
4. Define M2 = (M 23 )2 + (M 31 )2 + (M 12 )2 on the smooth elements of L2 (R3 ) and
show that
+ 1
,
1
2
M2 =
(sin ) +
sin
sin2
Note: Physicists generally write the operators M 23 , M 31 , M 12 and M2 as L x , Ly , Lz
and L2 , respectively.
5. Compare this with the usual expression for the Laplacian in spherical coordinates on R3 to show that
=
1
1
(2 ) 2 M2
2
79
There are a number of ways to approach such problems. One can simply solve the
dierential equations. Another approach via raising and lowering operators is entirely analogous to the procedure carried out for the harmonic oscillator in Section
5.3 of [Nab5]. One or the other, or both, of these will be discussed in some detail
in essentially any basic quantum mechanics book, although perhaps not at a level
of rigor that will satisfy a mathematician (see, for example, Chapter 14 of [Bohm]
or Sections 3.5 and 3.6 of [Sak]). A rigorous proof of the result we need, but will
simply quote can be found in Chapter II, Section 7, of [Prug].
Because M2 commutes with each of M 23 , M 31 , and M 12 (Exercise 2.5.13 (6))
one can actually find simultaneous eigenfunctions for M2 and any one of the angular momentum components, but not more than one since the M i j do not commute
with each other (Exercise 2.5.13 (1)). Physicists interpret this to mean that one can
simultaneously measure M2 and any single one of the angular momentum components (see pages 248-252 of [Nab5] for more on the notion of simultaneous measurability, which is more subtle than one might expect). We will, in fact, describe an
orthonormal basis for L2 (R3 ) consisting of eigenfunctions for both M2 and M 12 . In
particular, it will follow from this that the complete spectrum of each of these operators (possible measured values of the associated observables) consists precisely
of the corresponding eigenvalues). To find such an orthonormal basis we begin by
noting that, if the 2-sphere S 2 and the ray (0, ) are given the measures sin d d
and 42 d, respectively, then the product measure is just the Lebesgue measure on
R3 . Moreover, there is a unique unitary map : L2 (S 2 ) L2 ((0, )) L2 (R3 )
of the Hilbert space tensor product L2 (S 2 ) L2 ((0, )) onto L2 (R3 ) satisfying
( f g)((, ), ) = f (, )g() for all f L2 (S 2 ), g L2 ((0, )), (, ) S 2 and
(0, ) (Chapter II, Theorem 6.9, of [Prug]). Consequently, if { f1 , f2 , . . .} is an
orthonormal basis for L2 (S 2 ) and {g1 , g2 , . . .} is an orthonormal basis for L2 ((0, )),
then { f j gk : j, k = 1, 2, . . .} is an orthonormal basis for L2 (S 2 )L2 ((0, )) (Chapter
II, Theorem 6.10, of [Prug]).
We will briefly describe how to obtain such an orthonormal basis of simultaneous
eigenfunctions for M2 and M 12 . The eigenvalue problems we are interested in are
1
1
(sin ) +
2 =
sin
sin2
(2.40)
i =
(2.41)
and
Y(,
)
+
Y(,
)
= 0.
sin
sin2
Consequently, if one finds solutions to the eigenvalue problem
80
2 Minkowski Spacetime
1
1
(sin Y(, )) +
2 Y(, ) + Y(, ) = 0,
sin
sin2
(2.42)
then R() can be chosen arbitrarily. As it happens, (2.42) is an old and wellunderstood problem in partial dierential equations and mathematical physics. To
describe its solutions we must introduce some equally old and well-known functions. For each
l = 0, 1, 2, . . .
and each
m = l, l + 1, . . . , l,
we define a function Ylm (, ) as follows.
Ylm (, ) = (1)m
- 2l + 1 (l |m|)! .1/2
4 (l + |m|)!
(2.43)
where Plm (u) are the associated Legendre functions of order m defined by the Rodrigues formula
Plm (u) =
(1 u2 )m/2 dl+m 2
(u 1)l , m = 0, 1, 2, . . . .
2l l!
dul+m
The functions Ylm (, ), which are clearly smooth on S 2 , are called spherical harmonics and our interest in them arises from the following result (which is Theorem
7.1, Chapter II, of [Prug]).
Theorem 2.5.1. For each l = 0, 1, 2, . . . and each m = l, l + 1, . . . , l, the spherical
harmonic Ylm (, ) satisfies
1
1
(sin Ylm (, )) +
2 Ylm (, ) + l(l + 1)Ylm (, ) = 0.
(2.44)
sin
sin2
/
0
Furthermore, the functions Ylm (, ) : l = 0, 1, 2, . . . , m = l, l + 1, . . . , l form
an orthonormal basis for L2 (S 2 ).
81
82
2 Minkowski Spacetime
83
d
tX **
dt (e ) t=0
= limt0
(etX ) 1H
t
*
d
(etX )
(etX ) **t=0 = lim
t0
dt
t
and its domain is the set of all H for which this limit in H exists.
Next we must appeal to a rather deep theorem of Nelson [Nel1] which asserts
that there exists a dense linear subspace D of H with D D(d(X)) X g that
is invariant under each d(X)
84
2 Minkowski Spacetime
d(X)(D ) D X g
and on which each d(X) is essentially skew-adjoint. The route Nelson took toward this result is sketched on pages 292-295 of [Nab5] and an application to the
Heisenberg algebra is described on pages 295-300 of [Nab5]. As usual we will use
the same symbol d(X) for the unique skew-adjoint extension of d(X). With D
we can define the Lie algebra of operators W(D ) on H consisting of all skewsymmetric operators W on H for which D D(W), W(D ) D , and W is
essentially skew-adjoint on D . The map
d : g W(D )
then satisfies
d(X + Y) = d(X) + d(Y) X, Y g and D ,
d(aX) = ad(X) X g a R and D ,
and
d([X, Y]g ) = [d(X), d(Y)] X, Y g and D .
d is called a realization of g by skew-adjoint operators on H. For each X g,
id(X) is self-adjoint on H. We will describe a concrete example at the end of
Section 2.8.
Remark 2.5.5. It is not uncommon to see d referred to as a representation of the
Lie algebra g. The terminology can be misleading since the set of operators d(X)
need not form a Lie algebra.
Now suppose that H is the Hilbert space associated to some quantum system
and G is a symmetry group of that system represented by : G U(H) (see
Remark A.4.2). Then, for each X g, id(X) is a self-adjoint operator on H and
therefore corresponds to some observable for the quantum system. If {X1 , . . . , Xn }
is a basis for g and if each of the self-adjoint operators id(X j ), j = 1, . . . , n, has
been associated with some physical quantity, then we will say that the symmetry
group G has been assigned a physical interpretation.
Return now to the case in which G = P+ is the Poincare group (or its universal
cover ISL(2, C) which has the same Lie algebra p). The existence of a unitary representation : P+ U(H) expresses a form of relativistic invariance of the quantum
system whose Hilbert space is H (this is discussed in more detail in Section 2.7).
The Lie algebra p has ten generators P and M = M , , = 0, 1, 2, 3, " ,
which we have already associated with physical quantities. Specifically, in a fixed
admissible frame of reference, P1 , P2 , and P3 are associated with the linear momentum in the x1 , x2 and x3 directions, P0 with the energy, M 23 , M 31 and M 12 with the
85
86
2 Minkowski Spacetime
:gA
is another Lie algebra homomorphism, then there exists a unique algebra homomorphism
:UA
such that (1U ) = 1A and
= .
It is not at all obvious, but the Poincare-Birkho-Witt Theorem (Theorem 3.8 of
[Knapp]) implies that : g U is injective so we can, and will, identify g as
a Lie algebra with its image (g) in U. In particular, the bracket on g, however it
was initially defined, can be viewed as the commutator bracket assoociated with the
multiplication on U.
As usual, defining an object by a universal property does not guarantee that it exists, but it does guarantee that, if it exists, it must be unique. Thus, any two universal
enveloping algebras for g must be isomorphic as unital, associative algebras and we
are justified in denoting it U(g) and calling it the universal enveloping algebra of g.
To settle the question of existence one must construct from g a unital, associative
algebra and a Lie algebra homomorphism from g into it that satisfies the universal
property proposed in the definition. For this one begins with the tensor algebra
T(g) =
T k (g)
k=0
T k (g) = g g
is the kth -tensor power of g (there is a review of the tensor algebra in Appendix A
of [Knapp]). This is a (graded) associative algebra with multiplication given by the
tensor product and unit element 1 T 0 (g) = C. We let J(g) denote the ideal in
T(g) generated by all elements of the form
X Y Y X [X, Y]
for all X and Y in g = T 1 (g). The quotient T(g)/J(g) is then a (graded) unital, associative algebra. Products in T(g)/J(g) are generally written without the multiplication
sign and the inclusion of g in T(g) as a vector subspace induces a map denoted
: g T(g)/J(g) that satisfies
([X, Y]) = (X)(Y) (Y)(X)
87
for all X, Y g. The universal properties of the tensor algebra then imply that
T(g)/J(g) and : g T(g)/J(g) satisfy the defining properties of the universal
enveloping algebra U(g) (Proposition 3.3 of [Knapp]) and this establishes the existence of U(g). In particular, is injective and therefore embeds g in T(g)/J(g) as a
Lie subalgebra when T(g)/J(g) is given its commutator Lie algebra structure.
Since embeds g in U(g) as a Lie algebra one generally suppresses all mention
of it and simply regards g as a Lie subalgebra of U(g). Since g then lives inside
the associative algebra U(g) we can form products of the elements of g and these
products will also live in U(g). In particular, if {X1 , . . . , Xn } is a basis for g, then the
monomials
Xij11 Xijkk
(2.45)
(2.46)
are all in U(g). According to Theorem 3.8 of [Knapp] these monomials actually
form a basis for U(g).
Exercise 2.5.18. Show that Lie algebra representations of g are in one-to-one correspondence with algebra representations of U(g). More precisely, prove each of the
following.
1. Every Lie algebra representation : g End(V) of g on a complex vector space
V extends to a unique algebra representation : U(g) End(V) of U(g) on V.
2. Every algebra representation : U(g) End(V) of U(g) on a complex vector
space V is the extension of a unique Lie algebra representation of g on V.
This exercise applies, in particular, to the adjoint representation ad of g on g
which therefore extends to an algebra representation of U(g) on g. We will denote
this extension
ad : U(g) End(g)
and to call it the adjoint representation of U(g).
Remark 2.5.7. Although we will not make use of it we mention that there is another,
more analytic description of the universal enveloping algebra for the Lie algebra of
a Lie group G. One can identify this Lie algebra with the left-invariant vector fields
on G. Now, a vector field can be thought of as a derivation on the smooth function
defined on G, that is, as first order dierential operator. Composing such operators generates left-invariant dierential operators of higher order on G. Proposition
1.9, Chapter II, of [Helg] proves that the associative algebra generated by the leftinvariant vector fields on G and the identity map is isomorphic to U(g).
88
2 Minkowski Spacetime
Now we will turn to the special case of the Poincare algebra p. A basis for p is
given by
X1 = P0 , X2 = P1 , X3 = P2 , X4 = P3 ,
X5 = J1 = M23 , X6 = J2 = M31 , X7 = J3 = M12 ,
X8 = K1 = M10 , X9 = K2 = M20 , X10 = K3 = M30 ,
so that the monomials (2.45) in these generators span U(p). We are particularly
interested in two specific elements of U(p). The first is denoted P2 and is defined as
follows. For each = 0, 1, 2, 3, define P by P = P . Then
P2 = P P = P20 P21 P22 P23 .
(2.47)
The second is denoted W2 and is defined as follows. Begin by defining the components W U(p) of what is called the Pauli-Lubanski vector by
W =
1
M P ,
2
(2.48)
1
1
1
1
0 M P = 0123 M 12 P3 + 0213 M 21 P3 + 0321 M 32 P1
2
2
2
2
1
1
1
+ 0231 M 23 P1 + 0312 M 31 P2 + 0132 M 13 P2
2
2
2
1
1
1
= (1)M 12 P3 + (1)(M 12 )P3 + (1)(M 23 )P1
2
2
2
1
1
1
23 1
31 2
+ (1)M P + (1)M P + (1)(M 31 )P2
2
2
2
and so
W 0 = W0 = M 12 P3 + M 23 P1 + M 31 P2 .
Notice that
W =
where
M P ,
2
89
= = .
Exercise 2.5.19. Prove each of the following.
1.
2.
3.
4.
W0
W1
W2
W3
= W0 = J1 P1 + J2 P2 + J3 P3
= W1 = J1 P0 + K2 P3 K3 P2
= W2 = J2 P0 + K3 P1 K1 P3
= W3 = J3 P0 + K1 P2 K2 P1
(2.49)
(2.50)
We will discuss the significance of P2 and W2 shortly, but first we will need a few
useful relations (the brackets below refer to the commutator bracket in U(p)).
W P = 0
(2.51)
[W , P ] = 0 , = 0, 1, 2, 3
(2.52)
[W , M ] = i (W W ) , , = 0, 1, 2, 3
(2.53)
[W , W ] = i W P , = 0, 1, 2, 3
(2.54)
1
1
M P P = M ( P P ).
2
2
90
2 Minkowski Spacetime
[W , P ] =
1
1
1
[M P , P ] = M [P , P ] + [M , P ]P .
2
2
2
The first term is clearly zero since [P , P ] = 0 for all , = 0, 1, 2, 3. To see that
the second term is also zero we first compute
[M , P ] = [M , P ] = i ( P P ) = i ( P P ).
Consequently,
1
1
i
P P i P P
2
2
1
1
= i
P P i
P P
2
2
= i
P P
[W , P ] =
which is zero because is skew-symmetric in and , whereas P P is symmetric in and . We will leave (2.53) and (2.54) for you.
Exercise 2.5.21. Prove each of the following.
1. [W , M ] = i (W W )
2. [W , W ] = i W P
Although a bit more labor-intensive, similar arguments will establish
1
W2 = M M P2 + M M P P .
2
(2.55)
The elements P2 and W2 of U(p) are called Casimir invariants of p. The reason
we care about them is that they commute with all of the generators of p in U(p), that
is,
[P , P2 ] = 0 and [M , P2 ] = 0, , = 0, 1, 2, 3
(2.56)
[P , W2 ] = 0 and [M , W2 ] = 0, , = 0, 1, 2, 3.
(2.57)
and
91
[M , P2 ] = [M , P P ] = [M , P ]P + P [M , P ]
"
#
"
#
= i( P P ) P + P i( P P )
"
#
= i P P P P + P P P P
"
#
= i P P P P + P P P P
"
#
= i [P , P ] + [P , P ]
= 0.
92
2 Minkowski Spacetime
93
P1,3 = (R1,3 )
The dual basis for P1,3 will be written {e0 , e1 , e2 , e3 } and we will designate the points
of P1,3 by p = p e , or simply by their coordinates (p0 , p1 , p2 , p3 ) = (p0 , p).
The natural L+ -action on R1,3 gives rise to an L+ -action on P1,3 , called the contragredient action, and defined as follows. If p : R1.3 R is a linear functional in
P1,3 , then p is defined by
( p)(x) = p(1 x)
for every L+ . In coordinates relative to {e0 , e1 , e2 , e3 } this is given by
p = (1 )T p.
Recalling from Exercise 2.3.3 that (1 )T is in L+ whenever is in L+ we conclude
that all of the following subsets of P1,3 are invariant under this L+ -action. For any
m > 0 define
Xm+ = {p P1,3 : p20 p21 p22 p23 = m2 , p0 > 0},
Xm = {p P1,3 : p20 p21 p22 p23 = m2 , p0 < 0},
Xm+ and Xm for m > 0 are called the positive and negative mass hyperboloids, respectively. With the exception of a few remarks now and then we will concentrate
almost exclusively on Xm+ .
Remark 2.6.1. We should say a word about the terminology. Xm+ is called the positive
mass hyperboloid and the reason for the hyperboloid designation is no doubt clear.
However, the mass m is assumed positive for both Xm+ and Xm so positive must refer
to something else. To see what this might be we proceed as follows. Suppose ()
is a timelike worldline parametrized by proper time and m is a positive real number.
Then the pair (, m) is identified with a material particle whose worldline is and
whose mass is m. Since the 4-velocity () is a unit timelike vector at each point,
p = p() = m ()
satisfies
p, p = m2 .
94
2 Minkowski Spacetime
where v is the ordinary velocity vector of the particle in the given frame. Now, since
(x0 ) is in (1, 1) for each x0 we can write a convergent binomial series expansion
for .
= (1 2 )1/2 = 1 +
1 2 3 4
+ +
2
8
If we write v = (dx1 /dx0 , dx2 /dx0 , dx3 /dx0 ) = (v1 , v2 , v3 ) this gives
1
3
pi = mvi + mvi 2 + mvi 4 + , i = 1, 2, 3,
2
8
and
1
3
p0 = m + m 2 + m 4 +
2
8
The term mvi in the expression for pi is the ith -component of the Newtonian momentum in the given frame and the remaining terms are the relativistic corrections. On
the other hand, the appearance of the term 12 m 2 corresponding to the Newtonian
kinetic energy in the expression for p0 leads us to refer to p0 as the total relativistic
energy of the particle and denote it
1
3
E = p0 = m = m + m 2 + m 4 +
2
8
In particular, Xm+ is the surface in P1,3 containing the 4-momenta of particles with
mass m > 0 and positive energy. This would lead one to identify Xm with the surface in P1,3 containing the 4-momenta of particles with negative energy. Classically
one would simply ignore Xm as being unphysical. In quantum theory the situation
95
is not so simple, but the physical interpretation of particles with negative energy
involves subtleties that we will have no need to get into here and this accounts for
our particular interest in Xm+ .
We would like to point out one other consequence of the identification of E and
p0 . We will call p = (p1 , p2 , p3 ) the relativistic 3-momentum of the particle in the
given frame of reference and will write p 2 for p21 + p22 + p23 . Then
E 2 = p 2 + m2 .
(2.58)
This is called the Einstein energy-momentum relation. When one uses traditional
units of time rather than x0 = ct (2.58) becomes
E 2 = p 2 c2 + m2 c4 .
(2.59)
96
2 Minkowski Spacetime
Exercise 2.6.2. Prove the same result for Ym by taking = p0 /m and = (p20 +m2 )/m
and looking at the image of (0, m, 0, 0) under , .
Since every point of P1,3 is on exactly one of Xm , Ym , X0 or X00 these are, in fact, all
of the orbits of L+ in P1,3 .
Next we will need to determine the isotropy subgroup HP0 of P0 = (m, 0, 0, 0)
Xm+ under the L+ -action. Thus, we are looking for all of the L+ for which
P0 = P0 , that is, for which (1 )T P0 = P0 . If = ( ), =0,1,2,3 , then
0
0
1
1 T
( ) = = 2 0
0
3 0
0 1
1 1
2 1
3 1
0 2
1 2
2 2
3 2
which is equal to
0 3
1
3
2 3
3 3
0
0 m
1 m
0
2 0 m
3
0 m
m
0
0
0
1
0
0
0
and this is the case if and only if (1 )T is in the rotation subgroup R of L+ (Theorem
2.3.4). Consequently,
HP = R ! SO(3).
The same argument shows that, for m > 0, the isotropy subgroup of (m, 0, 0, 0) in
Xm is R.
Remark 2.6.2. The isotropy subgroups for Ym (m > 0) can be obtained in a similar
fashion, while those of X0 are somewhat more involved. Since we will require only
the Xm+ case we will simply refer those interested in seeing the results for Ym and X0
to pages 340-341 of [Vara].
97
We have shown that L+ acts transitively on the left on Xm+ with isotropy subgroup
R at (m, 0, 0, 0). Consequently, the homogeneous manifold L+ /R is dieomorphic
to Xm+ (Theorem 1.1.6) which, in turn, is dieomorphic to R3 via the projection
+ : Xm+ R3
(2.60)
98
2 Minkowski Spacetime
We would like to make a smooth selection (p) of a representative of this [A] for
p Xm+ . Since the smooth principal SU(2)-bundle SL(2, C) SL(2, C)/SU(2) !
R3 is trivial it has smooth global sections. Choose such a section
u : SL(2, C)/SU(2) SL(2, C)
and define
: Xm+ SL(2, C)
by
(p) = (u 1
P0 )(p).
(2.61)
(2.62)
U [A] = [UA]
99
A = PV,
where P is positive definite Hermitian and V is in SU(2). Notice that det P = 1 so
P is in SL(2, C) and, moreover,
[A] = A SU(2) = (PV) SU(2) = P SU(2) = [P].
Now define u : SL(2, C)/SU(2) SL(2, C) by
u([A]) = u([P]) = P.
Next we observe that
[UA] = (UA) SU(2) = (UPV) SU(2) = (UP) SU(2) = (UPU 1 ) SU(2) = [UPU 1 ]
Consequently, u([UA]) = u([UPU 1 ]) = P , where P V is the polar decomposition
of UPU 1 . But since P is positive definite Hermitian it can be written in the form
P = eB , where B is Hermitian, and therefore
UPU 1 = UeB U 1 = eU BU
100
2 Minkowski Spacetime
m (B) =
+ (B)
d3 p
,
2p
where
p =
A
A
m2 + p 2 = m2 + p21 + p22 + p23
1
2+ ((p ,p))
The remainder of the proof is simplified if we recall (Theorem 2.3.5) that every
L+ can be written as the composition of two rotations and a boost so it will be
enough to prove m ((B)) = m (B) for boosts and rotations separately. Furthermore,
all of the boosts are treated in exactly the same way so it will suce to consider
cosh
0
=
0
sinh
0
1
0
0
0 sinh
0 0
.
1 0
0 cosh
101
Consequently,
g =
det (g ) =
1
0
0
1
p1 sinh p2 sinh
p
p
.
p3 sinh +p cosh
0
0
which is positive because is orthochronous. Substituting all of this into the Change
of Variables Formula (Theorem 2.6.1) gives
6
6
+ ((p ,p)) 3
d3 p
1
m ((B)) =
=
d p
p
+ ((B)) 2p
+ (B) 2+ ((p ,p))
6
d3 p
=
+ (B) 2p
= m (B)
as required.
Exercise 2.6.5. Complete the proof by showing that m ((B)) = m (B) when is
in the rotation subgroup of L+ .
Remark 2.6.5. The Hilbert space L2 (Xm+ , m ) plays a particularly prominent role in
the construction of a rigorous model of the so-called Wightman axioms for scalar
quantum field theory. We will return to this somewhat later (also see Section X.7 of
[RS2]).
Since the action of SL(2, C) on Xm+ is defined in terms of the L+ -action via the
double covering : SL(2, C) L+ , the measure m is also invariant under the
102
2 Minkowski Spacetime
Furthermore, (2.63) holds in the stronger sense that, if either side is defined (even if
it is not finite), then the other side is defined and they agree.
Exercise 2.6.6. Let 1
P0 be the dieomorphism defined by (2.60). Show that
= (1
P0 ) (m )
is an SL(2, C)-invariant measure on SL(2, C)/SU(2).
103
: SL(2, C) L+
and SL(2, C) acts on R1,3 by defining
: SL(2, C) GL(R1,3 )
by
(A)(x) = A x = (A)x,
where A SL(2, C) and (A)x is the matrix product of (A) L+ with the (column) vector x R1,3 . With this action we define inhomogeneous SL(2, C), denoted
ISL(2, C), to be the semi-direct product of R1,3 (thought of as the translation group)
and SL(2, C), that is,
ISL(2, C) = R1,3 ! SL(2, C).
As for P+ one generally omits the explicit reference to in the notation and writes
ISL(2, C) = R1,3 ! SL(2, C).
The action of SL(2, C) on R1,3 induces a contragredient action
: SL(2, C) GL( (R1,3 ) )
of SL(2, C) on the vector space dual P1,3 = (R1,3 ) of R1,3 defined by
[ (A)() ](x) = [A ](x) = ( (A1 )(x) ),
where A SL(2, C), : R1,3 R is in P1,3 = (R1,3 ) , and x R1,3 . In terms of
components relative to {e0 , e1 , e2 , e3 }
(A)(p) = A p = (A1 )T p,
where T means transpose and (A1 )T p is the matrix product of (A1 )T and the
(column) vector p.
1,3 of the translation group R1,3 (see
Next we consider the character group R
Remark 1.5.2). This has nothing to do with the signature of the inner product so
1,3 is the same as R
4 . The action of SL(2, C) on R1,3 also induces an action
R
1,3
104
2 Minkowski Spacetime
ei(p0 x
1,3
R
1,3 .
is a group isomorphism of the additive group R4 onto the character group R
1,3 can
Remark 2.6.7. It will be convenient later on to note that the characters in R
equally well be written uniquely as
ei(p0 x
p1 x1 p2 x2 p3 x3 )
= (A) 1 .
1,3 can be identified with the vector space dual
With this we conclude that R
1,3
(R ) of R , that is, with momentum space P1,3 , in such a way that the induced
1,3
105
ISL(2, C) Aut(P(H))
106
2 Minkowski Spacetime
Here the downward vertical arrow is the restriction to U(H) of the quotient map
Q : U(H) + U (H) Aut(P(H)) described in Proposition 1.3.2 so
= Q .
107
Exercise 2.7.1. Show that induces in the sense of Theorem 2.7.2. Hint: Follow
the construction in the proof of Theorem 2.7.2
Our problem then is to enumerate the irreducible, unitary representations of
ISL(2, C) and for this we can apply the Mackey machine described in Section
1.5. In the next section we will describe that part of enumeration that is of interest
to us here (the so-called positive energy representations with mass m > 0).
where : SL(2, C) L+ is the covering map (Section 2.4). It will therefore suce
to find the orbits of this action on P1,3 . Since (A1 )T is in L+ , these are the same as
the L+ -orbits in momentum space and these were described in Section 2.4. Notice
that these are all closed subsets of P1,3 so ISL(2, C) is a regular semi-direct product.
For our purposes we will be interested only in the mass hyperboloids Xm+ , m > 0.
Remark 2.8.1. We have chosen to restrict our attention to Xm+ for reasons that are
in the physics, not the mathematics (see Remark 2.6.1). We will eventually describe
the physical interpretation of the results we derive here for Xm+ , but for the remaining
orbits there are subtleties such as negative energy states that we prefer not to become
entangled in at the moment. For those who would like to see the full story we refer
to [Simms] and [Vara].
We have shown in Section 2.4 that, for any point p Xm+ , the isotropy subgroup
of p for the L+ -action on Xm+ is isomorphic to the rotation subgroup R ! SO(3) of
L+ . Consequently, the isotropy subgroup for the SL(2, C)-action is the pre-image
under of R and this is isomorphic to SU(2) (Section 2.4).
B
1,3
Remark 2.8.2. Recall that every point p in Xm+ gives rise to a character p R
defined by
p = (p0 , p1 , p2 , p3 ) Xm+ $ p (x) = ei(p0 x
p1 x1 p2 x2 p3 x3 )
108
2 Minkowski Spacetime
For the remainder of the discussion we will fix the base point P0 = (m, 0, 0, 0)
Xm+ and the corresponding character
0 = eimx
(2.64)
B
1,3 so that by SU(2) we mean the subgroup of SL(2, C) that leaves fixed in
in R
0
+
B
1,3
R and leaves P0 fixed in Xm .
We have also shown in Section 2.4 that Xm+ admits a measure m that is invariant
under the L+ -action and therefore also under the SL(2, C)-action. Recall that it is
defined in the following way. For any Borel set B Xm+ , the projection + (B) is a
Borel subset of R3 and we define
m (B) =
+ (B)
d3 p
=
2p
+ (B)
d3 p
,
C
2 m2 + p2
109
( j/2)
associated to it by D( j/2) is also trivial. We have seen that the Hilbert space HD
of sections of this vector bundle that are L2 with respect to the invariant measure
on SL(2, C)/SU(2) can be identified with equivalence classes (modulo equality up
to sets of measure zero) of functions f : SL(2, C) C(D( j/2) ) that are measurable
with respect to Haar measure on SL(2, C) and satisfy each of the following.
1. f (AU) = D( j/2) (U 1 )( f (A)) for all U SU(2) and for almost all A SL(2, C).
2.
SL(2,C)/SU(2)
For the moment we will think of HD as the space of such functions and then
will switch back to sections when the need arises.
Exercise 2.8.1. Show that
6
6
2
f ([A] d([A]) =
SL(2,C)/SU(2)
Xm+
( j/2)
is defined by
--
. .
SL(2,C)
IndSU(2)
(D( j/2) )(A) f (A ) = f (A1 A ).
(2.65)
When the context makes the intention clear we will try to relieve some of the
notational clutter by writing the action of A in SL(2, C) on f corresponding to
SL(2,C)
IndSU(2)
(D( j/2) ) the way we write every other group action, that is, we will write
(2.65) simply as
(A f )(A ) = f (A1 A ).
(2.66)
( j/2)
110
2 Minkowski Spacetime
B
1,3 corresponding to the base
where 0 is the character in the SL(2, C)-orbit of R
+
point P0 = (m, 0, 0, 0) in Xm (see (2.64)). Note that as A varies over all of SL(2, C),
A 0 varies over this entire orbit.
Finally, we put the two factors together and define, for every (x, A) ISL(2, C)
the unitary operator
SL(2,C)
U(x, A) = U(x) IndSU(2)
(D( j/2) )(A).
Thus,
[U(x, A) f ](A ) = [(A 0 )(x)](A f )(A ) = [(A 0 )(x)] f (A1 A ).
(2.67)
Technically, this completes the application of the Mackey machine to ISL(2, C), but
we will need to work a bit to put (2.67) into a usable form.
Notice that, since A 0 is a character, (A 0 )(x) is a complex number of modulus
1 for each A SL(2, C) and each x R1,3 . We will now write out this phase factor
explicitly. For this we recall that (A 0 )(x) = 0 ((A )1 x) = 0 ((A )1 x).
Exercise 2.8.2. Denote (A ) by L+ and show that the 0th -component of
(A )1 x is
0 0 x0 1 0 x1 2 0 x2 3 0 x3 .
Conclude that
0
(A 0 )(x) = eim( 0 x
1 0 x1 2 0 x2 3 0 x3 )
0 0
x p1 x1 p2 x2 p3 x3 )
= eip x = eip,x ,
where p is just m times the 0th -column of (A ). With this (2.67) becomes
[U(x, A) f ](A ) = eip,x f (A1 A ).
(2.68)
Next we will think a bit more about the second factor f (A1 A ) and return to the
( j/2)
view of HD as sections of a vector bundle. Recall that f : SL(2, C) C(D( j/2) )
is a map from SL(2, C) to C(D( j/2) ) that is equivariant with respect to the SU(2)actions, that is, satisfies
f (AU) = D( j/2) (U 1 )( f (A))
111
for all U SU(2) and for almost all A SL(2, C). As such, it determines a section
s f : SL(2, C)/SU(2) SL(2, C) D( j/2) C(D( j/2) )
of the vector bundle
D( j/2) : SL(2, C) D( j/2) C(D( j/2) ) SL(2, C)/SU(2)
given by
s f ([A ]) = [A , f (A )]
for all [A ] SL(2, C)/SU(2) and any A [A ] (see Remark 1.4.1). Moreover, any
section s of this vector bundle is s f for one and only one such equivariant map f .
Notice that both the base SL(2, C)/SU(2) and the vector bundle space SL(2, C)D( j/2)
C(D( j/2) ) admit continuous SL(2, C)-actions given by
A [B] = [AB]
and
A [B, v] = [AB, v],
respectively, and that these commute with the projection D( j/2) . Now, we define an
( j/2)
( j/2)
SL(2, C)-action on the sections in HD as follows. For any section s HD and
any A SL(2, C) we take A s to be the section defined by
(A s)([A ]) = A (s(A1 [A ])) = A s([A1 A ])
for all [A ] SL(2, C)/SU(2). The motivation for the definition is as follows. If s f
is the section corresponding to f , then A s is the section corresponding to f LA ,
where LA is the dieomorphism of SL(2, C) onto itself defined by LA (A ) = A1 A ,
that is,
(A s f )([A ]) = s f LA ([A ]) = [A , f (A1 A )]
for all A SL(2, C) (compare (2.68)).
Since is an invariant measure on SL(2, C)/SU(2) this action of SL(2, C) on the
( j/2)
( j/2)
sections in HD determines a unitary representation of SL(2, C) on HD and
one can show that this representation is strongly continuous. Note that we can write
this in terms of the induced action of SL(2, C) on the corresponding equivariant
function f as
(A s f )([A ]) = [A , (A f )(A )].
(2.69)
112
2 Minkowski Spacetime
(2.70)
where we again point out that p = p(A ) is m times the 0th column of (A ). This is
looking more like the expression we want for U(x, A), but there is still work to do.
The next thing to observe about the vector bundle D( j/2) : SL(2, C) D( j/2)
C(D( j/2) ) SL(2, C)/SU(2) is that, since SL(2, C)/SU(2) ! R3 , it is trivial
so that any section s of it can be identified with a function on SL(2, C)/SU(2)
whose value at any [A] is a point in the fiber 1
([A]) above [A]. Furthermore,
D( j/2)
each of these fibers can be identified with the finite-dimensional Hilbert space
C(D( j/2) ). Such an identification is obtained in the following way. The principal
bundle : SL(2, C) SL(2, C)/SU(2) is trivial for the same reason so we
can select some global section u : SL(2, C)/SU(2) SL(2, C). Then, for each
[A] SL(2, C)/SU(2), u([A]) SL(2, C) so there is a unique ([A]) C(D( j/2) ) for
which
s([A]) = [u([A]), ([A])].
Specifically, if s = s f corresponds to the equivariant map f : SL(2, C) C(D( j/2) ),
then = f = f u. Thus, given u, we can identify the section s with the function
: SL(2, C)/SU(2) C(D( j/2) ). However, this identification of s with a function
from SL(2, C)/SU(2) to C(D( j/2) ) is not unique since it depends on the choice of
the section u. Indeed, if u : SL(2, C)/SU(2) SL(2, C) is another global section,
then for each [A] SL(2, C)/SU(2) we can write u ([A]) = u([A])U([A]) for some
U([A]) SU(2) and then, by definition of the equivalence relation defining the
points of SL(2, C) D( j/2) C(D( j/2) ), [u([A]), U([A]) ([A])] = [u([A]), ([A])] so
s([A]) = [u([A]), U([A]) ([A])].
In other words, is determined only up to the action of SU(2) given by the representation D( j/2) .
Next we will need a few computational formulas. First we note that, when s = s f ,
f can be recovered from . To see this we write
f (A ) = f (u([A ]) u([A ])1 A )
D!!!!!!!EF!!!!!!!G
=D
so
( j/2) "
SU(2)
#
(A ) u([A ]) f (u([A ]))
1
"
#
f (A ) = D( j/2) (A )1 u([A ]) ([A ]).
(2.71)
Remark 2.8.3. Notice that u([A ])1 A is in SU(2) because its projection into
SL(2, C)/SU(2) is the identity.
113
=D
( j/2)
SU(2)
+
,
+
,
(A )1 u([A ]) D( j/2) u([A ])1 Au(A1 [A ]) (A1 [A ])
(2.72)
We have not yet defined an action of SL(2, C) on , but (2.71) suggests that we
define A in such a way that
"
#
(A f )(A ) = D( j/2) (A )1 u([A ]) (A )([A ])
Thus,
+
,
(A )([A ]) = D( j/2) u([A ])1 Au(A1 [A ]) (A1 [A ]).
(2.73)
+
,"
#
f (A1 A ) = D( j/2) (A )1 u([A ]) (A )([A ])
"
# "
#
= (A )1 u([A ]) (A )([A ]) .
114
2 Minkowski Spacetime
+
,
[U(x, A)]([A ]) = eip,x D( j/2) u([A ])1 Au(A1 [A ]) (A1 [A ]).
(2.74)
is the smooth section of the principal SU(2)-bundle P0 : SL(2, C) Xm+ corresponding to u, and
+
( j/2)
m = 1
)
P0 : Xm C(D
115
for every A SL(2, C). Then (Am )(p) = (A)([A ]), where p = P0 ([A ]) = A P0 .
Furthermore, u([A ]) = u(1
P0 (p)) = (p).
Exercise 2.8.4. Show that P0 (A1 [A ]) = A1 p.
However, SL(2, C) acts on Xm+ by Lorentz transformations via the double covering so, if we write A L+ for the image (A) of A, then what the last Exercise
shows is that
1
A1 [A ] = 1
P0 (A p).
p)
m (1
A
A p).
(2.75)
p)
m (1
A
A p).
(2.76)
This is the form in which one most often sees the representation U(x, A) written in
the physics literature.
Remark 2.8.4. Just as a reminder and to anticipate some of the physical terminology
that we will discuss more fully as we proceed lets record the interpretation of the
various objects in (2.76). (x, A) is an element of the double cover R1,3 ! SL(2, C)
of the Poincare group P+ so x is interpreted as a translation on Minkowski spacetime and A is an element of SL(2, C) acting on Minkowski spacetime by Lorentz
transformations via the double covering : SL(2, C) L+ . U(x, A) is a unitary operator on L2 (Xm+ , m , C(D( j/2) )), where Xm+ P1,3 is the mass hyperboloid,
m is an SL(2, C)-invariant measure on Xm+ , and C(D( j/2) ) is the finite-dimensional
Hilbert space of carriers of the spin j/2 representation of SU(2). We will refer to
L2 (Xm+ , m , C(D( j/2) )) as the 1-particle space. The elements of L2 (Xm+ , m , C(D( j/2) ))
will be interpreted as wave functions for a free material particle with 4-momentum p
satisfying p, p = m2 so that m is interpreted as the mass of the particle. These wave
functions take values in C(D( j/2) ) which is C only when j = 0. The significance of
the additional components when j > 0 will be explained in due course (Section ??).
The point p = A (m, 0, 0, 0) varies over all of Xm+ as A varies over SL(2, C) and
(p) is a mapping of Xm+ to SL(2, C) with the property that (p) (m, 0, 0, 0) = p
for each p Xm+ . Physically, (p) therefore corresponds to a Lorentz transformation from a frame in which the particles 4-momentum is (p0 , p1 , p2 , p3 ) to a frame
in which the 4-momentum is (m, 0, 0, 0), that is, to the rest frame of the particle.
1
+
1,3
A = (A) and 1
A p is the contragredient action of A on Xm P , that is,
1 1 T
T
T
A p = [(A ) ] p = A p = (A) p. U itself is called the (Wigner) representation
116
2 Minkowski Spacetime
of mass m and spin j/2. What the term spin has to do with the corresponding term
in physics will be discussed in Section A.5.
Notice that, when x = 0 (no translation), (2.76) reduces to
+
,
[U(0, A)m ](p) = D( j/2) (p)1 A (1
p)
m (1
A
A p)
(2.77)
(2.78)
The simplest case of (2.76) occurs when j = 0 since C(D(0) ) = C and D(0) is
the trivial representation, that is, D(0) (U) = idC for every U SU(2) (see Remark
1.2.3). In this case,
[U(x, A)m ](p) = eip,x m (1
A p)
(2.79)
for all (x, A) ISL(2, C). Most of our attention later on will focus on this spin zero
case. We will refer to a m satisfying (2.79) as classical, relativistic scalar field on
Xm+ . and our ultimate goal is to quantize the physical system it describes.
We make one more general observation about (2.76). The carriers of the repj
resentation D( j/2) can be identified with 2 j -tuples ( A1 A j )A1 ,...,A j =1,2 in C2 that are
invariant under all permutations of A1 A j (see Example 1.2.2). The dimension of
j
this subspace of C2 is j + 1 so each m has j + 1 components which we write as
A A j
m1
(p), A1 , . . . , A j = 1, 2,
A A
(2.80)
where we sum over B1 , . . . , B j = 1, 2. We will refer to a m with component functions that transform under the ISL(2, C)-action according to (2.80) as a (contravariant) spinor field of rank j on Xm+ .
All of our eorts in this section have been directed toward the irreducible, unitary representation of the double cover ISL(2, C) of the Poincare group P+ . We have
seen in Section 2.7 how these determine the irreducible projective representations of
P+ and that these are the objects that express relativistic invariance for quantum systems. We should note that we have, in fact, also determined the irreducible, unitary
117
(P 1 m )(p) = p1 m (p)
on L2 (Xm+ , m , C(D( j/2) )). The domain of P 1 is the set of all m in L2 (Xm+ , m , C(D( j/2) ))
for which p1 m (p) is also in L2 (Xm+ , m , C(D( j/2) )). Similarly,
[U((0, 0, t, 0), I)m ](p) = eitp2 m (p),
[U((0, 0, 0, t), I)m ](p) = eitp3 m (p),
118
2 Minkowski Spacetime
p)
m (1
(2.81)
A
A p)
Here : Xm+ SL(2, C) is any function with the property that (p) (m, 0, 0, 0) = p
for each p Xm+ . In Remark 2.6.3 we showed that one can choose an that is
equivariant with respect to the natural actions of SU(2) on Xm+ and SL(2, C) and we
will now assume that we have made such a choice. In particular, if X is any element
of the Lie algebra of the isotropy subgroup SU(2) of (m, 0, 0, 0) and t R, then
(etX p) = etX (p) etX .
Exercise 2.8.6. Show that, with such a choice for , (2.81) gives
119
(2.82)
m ](p) = lim
[iA
*
9*
d8
d
m (etX p)**t=0 i D( j/2) (etX ) **t=0 m (p).
dt
dt
(2.83)
120
2 Minkowski Spacetime
Exercise 2.8.7. Let : SL(2, C) L+ be the covering map. Prove each of the
following.
0
1 0 0
0 1 0
0
= ( e(it/2)1 )
eitM23 = etM1 =
0 0 cos t sin t
0 0 sin t cos t
0 0
1 0
0 cos t sin t 0
= ( e(it/2)2 )
eitM31 = etM2 =
1 0
0 0
0 sin t cos t 0
0 0
1 0
0 cos t sin t 0
= ( e(it/2)3 )
eitM12 = etM3 =
0 sin t cos t 0
0 0
0 1
and
*
d
[O 12 m ](p) = i m (eitM12 p)**t=0
dt
[S 12 m ](p) = i
Thus,
d 8 ( j/2) itM12 9 **
D (e
) *t=0 m (p).
dt
(2.84)
(2.85)
. **
d[O 12 m ](p) = i
m (p0 , p1 cos t p2 sin t, p1 sin t + p2 cos t, p3 ) ***
dt
t=0
+
m ,
m
= i p2
p1
p1
p2
,
i p2
p1
p1
p2
on L2 (Xm+ , m .C(D( j/2) )). We will refer to the physical quantity represented by O 12 as
the orbital angular momentum about the x3 -axis corresponding to the representation
U and the section .
121
d 8 (0/2) itM12 9 **
D
(e
) *t=0 = 0 (s = 0)
dt
[S 12 m ](p) =
=
(s = )
2
(p)
0 12 2m (p)
2
2
m
Exercise 2.8.8. Show that, when s = 1,
d 8 (2/2) itM12 9 **
D
(e
) *t=0
dt
1
0
=
0
0
0
0
0
0
0 0
0 0
(s = 1)
0 0
0 1
so that the eigenvalues are {1, 0, 1}. Hint: See Exercise 1.2.8.
8
9*
Continuing in this way one finds that if s = 2j , then i dtd D( j/2) (eitM12 ) **t=0 has
eigenvalues
4
j
j
j
j5
, + 1, . . . , + j =
2
2
2
2
so that the spin of the representation is the largest eigenvalue of S 12 . We will refer
to S 12 as the spin angular momentum about the x3 -axis corresponding to the rep 12 = O 12 + S 12 is the total angular momentum
resentation U and the section . M
about the x3 -axis.
122
2 Minkowski Spacetime
Remark 2.8.5. An element m of the 1-particle space L2 (Xm+ , m , C(D( j/2) )) is interpreted as the wave function of a free material particle of mass m and spin s = 2j ,
where mass and spin here refer to the terms as they are used in physics to denote
certain physically measured observables. One would like to understand the rationale
behind identifying these physical observables with the mathematical mass and spin
parameters that we have introduced here. In the case of the mass we have seen above
that this rationale is essentially just the relativistic relation between the Minkowski
norm of a particles 4-momentum and its mass.
p20 p21 p22 p23 = m2
The corresponding rationale for spin will, of course, have to wait until we know
what the physicists mean by spin and so must be deferred to Section A.5. One
particular aspect of this rationale is worth briefly anticipating here however. We have
identified the spin of a representation with the largest eigenvalue of the spin angular
momentum about the x3 -axis and one might reasonably wonder what is so special
about the x3 -axis. The answer is quite simple, that is, nothing at all. Physically, the
spin of, say, an electron about the x3 -axis is thought of as the third component
of a spin vector, but this is a rather peculiar vector that could only exist in the
quantum world in that its projection onto every axis is either 21 or 12 ( "2 if " is
not taken to be 1). One can see that this is at least consistent with the mathematical
notion of spin that we have introduced. For example, if one repeats the arguments
we have just given in the s = 12 case with M12 replaced by the generator M23 of
rotations about the x1 -axis one finds that
1
2*
1
2
d 8 (1/2) itM23 9 **
d cos (t/2) i sin (t/2) **
1
0 12
** =
D
(e
) *t=0 = i
(s = )
1
2 0
dt
dt i sin (t/2) cos (t/2) t=0
2
(Exercise 2.4.5) and this has eigenvalues 12 . Consequently, S 12 and S 23 have the
same eigenvalues and, in particular, the same largest eigenvalue. The spin about the
x1 -axis is the same as the spin about the x3 -axis.
Exercise 2.8.9. Prove the same thing for S 31 .
One can generalize to show that, when the representation of SU(2) is D( j/2) , then
s = 2j is the largest eigenvalue of all of the operators S 12 , S 23 , and S 31 so that, in
this sense at least, our spin parameter has the property required of the physicists
spin observable.
Appendix A
Physical Background
A.1 Introduction
Although this volume was planned as a sequel to [Nab5] we would not want this
earlier work to be a sine qua non for this one. Toward this end we will try to remove
some potential obstacles by providing brief synopses of a number of topics in classical and quantum mechanics that we must depend upon here and that are discussed
in greater detail in [Nab5].
124
A Physical Background
q i (x, v x ) = v x [qi ]. i = 1, . . . , n.
We will, on occasion, write q i (x, v x ) simply as q i (v x ).
Any smooth real-valued function L : T M R on the state space T M is called a
Lagrangian on M. Such a function can be described locally in natural coordinates.
We adopt the usual custom of writing such a local coordinate representation as
L(q1 , . . . , qn , q 1 , . . . , q n ) = L(q, q).
Ca,b
([t0 , t1 ], M) has a unique lift to a smooth curve
: [t0 , t1 ] T M
in the tangent bundle defined by
(t)
= ((t), (t)),
where (t)
denotes the velocity (tangent) vector to at t. The action functional
associated with the Lagrangian L is the real-valued function
S L : Ca,b
([t0 , t1 ], M) R
defined by
S L () =
t0
t1
L((t))
dt =
t1
t0
L((t), (t))
dt.
: [t0 , t1 ] (, ) M
for some > 0 such that
(t, 0) = (t), t0 t t1
125
is an element of Ca,b
([t0 , t1 ], M). Then S L ( s ) is a smooth real-valued function of
*
d
S L ( s ) ** s=0 = 0
ds
for every variation of . In this case we will call a stationary curve, or a critical
curve for the action S L . According to the Principle of Stationary Action, also called
the Principle of Least Action, the actual time evolution of the state of the system
will take place along the lift of a stationary curve for S L . One should understand,
of course, that this is not a mathematical theorem, but rather a physical assumption
that happens to be well born out by experience. We will oer some motivation in
Example A.2.1.
For curves that lie in some coordinate neighborhood in M one can write down
explicit equations that are necessary conditions for to be a stationary point of S L .
Let Ca,b
([t0 , t1 ], M) be a curve whose image lies in a coordinate neighborhood
with coordinate functions q1 , . . . , qn . The lift of is written in natural coordinates
as (t)
= (q1 (t) . . . , qn (t), q 1 (t), . . . , q n (t)), where qi (t) is a notational shorthand for
i
q ((t)) and similarly q i (t) means q i ((t)).
(t),
(t)
= 0, 1 i n.
qi
dt q i
(A.1)
L
d + L ,
= 0, 1 i n.
qi dt q i
(A.2)
These are the famous Euler-Lagrange equations which one often sees written simply as
The Euler-Lagrange equations are satisfied along any stationary curve for S L .
Notice that one can draw a rather remarkable conclusion just from the form in
which the Euler-Lagrange equations are written, namely, that if the Lagrangian L
L
happens not to depend on one of the coordinates, say qi , then q
i = 0 everywhere
L
and (A.1) implies that, along any stationary curve, q i is constant. In more colloquial
terms, Lq i is conserved as the system evolves.
L
qi
= 0 =
L
q i
L
q i
126
A Physical Background
127
The content of our next result is that infinitesimal symmetries of L give rise to
conserved quantities.
Theorem A.2.1. (Noethers Theorem) Let L : T M R be a Lagrangian on a
smooth manifold M and suppose X is an infinitesimal symmetry of L. Let q1 , . . . , qn
be any local coordinate system for M with corresponding natural coordinates
q1 , . . . , qn , q 1 , . . . , q n . Write X = X 1 q1 + + X n qn = X i qi . Then
Xi
L
= X i pi = X 1 p1 + + X n pn
q i
*
d tN
(e x) **t=0 .
dt
Each of these in turn gives rise, via Noethers Theorem, to a conserved quantity. The
conservation laws come from the Lie algebra of the symmetry group.
We will now try to illustrate all of these ideas with a concrete example. Many
more such examples are available in Section 2.2 of [Nab5].
Example A.2.1. (Momentum, Angular Momentum, and Energy) For our configuration space we begin with M = Rn and choose global standard Cartesian coordinates
(q1 , . . . , qn ) on Rn . The state space is then T Rn = Rn Rn and the corresponding
natural coordinates are q1 , . . . , qn , q 1 , . . . , q n . Letting V(q1 , . . . , qn ) denote an arbitrary smooth, real-valued function on Rn and m a positive constant, we take our
Lagrangian to be
128
A Physical Background
n
1 !
L(q , . . . , q , q , . . . , q ) = m (q i )2 V(q1 , . . . , qn ).
2 i=1
1
L
V
= i , i = 1, . . . , n
qi
q
and
L
= mq i , i = 1, . . . , n.
q i
Thus, (A.2) becomes
V
d
(mq i ) = 0, i = 1, . . . , n,
qi dt
that is,
m
d2 qi
V
= i , i = 1, . . . , n
2
q
dt
and these are just the components of Newtons Second Law for a conservative force
V/qi , i = 1, . . . , n, with potential V. This is, of course, one of the primary motivations behind the Principle of Stationary Action.
Notice that if the potential V(q1 , . . . , qn ) happens not to depend on, say, the ith coordinate qi , then, since pi = Lq i = mq i , we conclude that pi = mq i is constant along the trajectory of the particle. Now, mq i is what physicists call the ith component of the particles (linear) momentum. Thus, if the potential V is independent of the ith -coordinate, then the ith -component of momentum is conserved during
the motion. The conjugate momentum is really momentum in this case. In particular, for a particle that is not subject to any forces (a free particle), the potential
V(q1 , . . . , qn ) is constant so all of the momentum components are conserved. One
can phrase this in the following way. If the potential V(q1 , . . . , qn ) and therefore
H
the Lagrangian L(q1 , . . . , qn , q 1 , . . . , q n ) = 12 m ni=1 (q i )2 V(q1 , . . . , qn ) is invariant
under translations in Rn , then momentum is conserved.
Spatial Translation Symmetry implies Conservation of (Linear) Momentum
We conclude by looking at a somewhat less obvious example of this sort of conservation law.
We will specialize to the case in which n = 3 and the potential is spherically
symmetric, that is, V depends only on q = ( (q1 )2 + (q2 )2 + (q3 )2 )1/2 .
129
V(q1 , q2 , q3 ) = V(q)
Thus, we can write the Lagrangian as
L(q, q)
=
1
mq
2 V(q).
2
Now we will find a symmetry group of L. Recall that the rotation group SO(3)
consists of all 3 3 real matrices A that are orthogonal (AT A = AAT = id33 ) and
have determinant one (det A = 1). SO(3) acts on R3 by matrix multiplication
A (q) = Aq,
where Aq means matrix multiplication with q R3 thought of as a column vector.
Since A is invertible, A is a dieomorphism of R3 onto R3 . Moreover, since A
is linear, its derivative at each point is the same linear map (multiplication by A)
once the tangent spaces are canonically identified with R3 . Thus, the induced map
on state space
T A : T R3 = R3 R3 T R3 = R3 R3
is given by
(T A )(q, q)
= (A (q), (A )q (q))
= (Aq, Aq).
130
A Physical Background
0 n3 n2
3
0 n1 .
N = n
2 1
n n
0
Geometrically, one thinks of A = etN as the rotation of R3 through t radians about
an axis along n in a sense determined by the right-hand rule from the direction of n .
Now fix an n and the corresponding N in so(3). For any q R3 ,
t etN q
is a curve in R3 passing through q at t = 0 with velocity vector
d tN **
(e q) *t=0 = Nq.
dt
d tN **
(e q) *t=0 = Nq.
dt
0 1 0
N = 1 0 0
0 0 0
so
2
q
XN (q) = Nq = q1 .
(A.3)
131
Taking n to be (1, 0, 0) and (0, 1, 0) one obtains, in the same way, the vector fields
X23 = q2 q3 q3 q2
(A.4)
X31 = q3 q1 q1 q3 .
(A.5)
and
L
1 L
2 L
3 L
= X12
+ X12
+ X12
= q2 (mq 1 ) + q1 (mq 2 ) = m(q1 q 2 q2 q 1 ).
1
2
k
q
q
q 3
q
For X23 and X31 one obtains m(q2 q 3 q3 q 2 ) and m(q3 q 1 q1 q 3 ), respectively. Thus,
along any stationary curve,
m [q1 (t)q 2 (t) q2 (t)q 1 (t)]
m [q3 (t)q 1 (t) q1 (t)q 3 (t)]
m [q2 (t)q 3 (t) q3 (t)q 2 (t)]
are all constant. Notice that these are precisely the components of the cross product
r(t) [m r (t)]
of the position and momentum vectors of the particle and this is what physicists call
its (orbital) angular momentum (with respect to the origin). Thus, angular momentum is constant for motion in a spherically symmetric potential in R3 .
Rotational Symmetry implies Conservation of Angular Momentum
Notice also that the constancy of the angular momentum vector along the trajectory
of the particle implies that the motion takes place entirely in a 2-dimensional plane
in R3 , namely, the plane with this normal vector.
We will conclude with an example of a slightly dierent sort. We have defined a
Lagrangian to be a function on the tangent bundle T M, but it is sometimes convenient to allow it to depend explicitly on t as well, that is, to define a Lagrangian to
132
A Physical Background
be a smooth map L : R T M R. Then, for any path in the domain of a coordinate neighborhood on M, one would write L = L(t, (t), (t)),
t0 t t1 , in natural
coordinates. The action associated with this path is defined, as before, to be the integral of L(t, (t), (t))
over [t0 , t1 ]. Stationary points for the action are also defined in
precisely the same way and one can check that the additional t-dependence has no
eect at all on the form of the Euler-Lagrange equations, that is, stationary curves
satisfy
# d + L "
#,
L "
t,
(t),
(t)
t,
(t),
(t)
= 0, 1 k n.
dt q k
qk
(A.6)
Write the stationary curve in local coordinates as (t) = (q1 (t), . . . , qn (t)) and the
Lagrangian evaluated on the lift of this curve as
L(t, q1 (t), . . . , qn (t), q 1 (t), . . . , q n (t)).
Computing
dL
dt
along the stationary curve. If it so happens that L does not depend explicitly on t,
then L
i L is conserved along the stationary curve. Notice that
t = 0 so E L = pi q
H
for the particle Lagrangian L(q1 , . . . , qn , q 1 , . . . , q n ) = 12 m ni=1 (q i )2 V(q1 , . . . , qn )
E L = pi q i L =
1 ! i2
m (q ) + V(q1 , . . . , qn )
2 i=1
(A.7)
which is the total energy (kinetic plus potential). We will phrase this in the following
way.
Time Translation Symmetry implies Conservation of Energy
Here then is a synopsis of what we have concluded about symmetries and conservation laws for classical mechanical systems.
Symmetry
Conservation Law
133
134
A Physical Background
gral curve of XH through this initial state. Consequently, the evolution of the state
(q1 (t), . . . , qn (t), p1 (t), . . . , pn (t)) with time satisfies Hamiltons equations
q i =
H
H
and p i = i , i = 1, . . . , n.
pi
q
The Hamiltonian flow, that is, the flow of the Hamiltonian vector field XH , therefore
contains all of the information about the evolution of the state of the system.
Remark A.3.2. Depending on the nature of H, Hamiltons equations may have solutions defined only on some finite interval of t values so the Hamiltonian flow may
be only a local flow.
Example A.3.1. (Particle Motion in Rn ) The standard example models a single particle of mass m > 0 moving in R3 under the influence of some conservative force
F = V. The configuration space is M = R3 and we denote by (q1 , q2 , q3 ) its
standard Cartesian coordinates. Physically, M is identified with the space of possible positions of the particle. Since R3 is contractible, its cotangent bundle can be
identified with T M = R3 R3 = R6 . The Hamiltonian is to be the total energy of
the particle, that is, the sum of its kinetic and potential energies, and we would like
to describe this in canonical coordinates (q1 , q2 , q3 , p1 , p2 , p3 ).
Remark A.3.3. In Example A.2.1 we saw that the dierence of the kinetic and potential energies is the Lagrangian L for this system and that the momentum conjugate
to qi is L/q i = mq i . In light of Remark A.3.1 we have pi = mq i so the particles
1
kinetic energy can be written 2m
p2 .
The particles potential energy due to the influence of the conservative force V(q)
is just V(q). The Hamiltonian H(q, p) is the sum of these.
H(q, p) =
1
1 2
p2 + V(q) =
(p + p22 + p23 ) + V(q1 , q2 , q3 ).
2m
2m 1
1
V
pi qi i pi .
m
q
1
V
pi and p i = i , i = 1, 2, 3.
m
q
135
Notice that substituting the first of these into the second gives Newtons Second Law
mq i =
V
, i = 1, 2, 3.
qi
1 2 m2 2
p +
q
2m
2
for the classical harmonic oscillator of mass m and natural frequency (see Chapter
1 of [Nab5]) . The Hamiltonian vector field is
XH (q, p) =
1
p q m2 q p
m
1
p and p = m2 q.
m
136
A Physical Background
f g
f g
qi pi pi qi
and that the Lie bracket of two symplectic gradient vector fields X f and Xg is the
symplectic gradient of the Poisson bracket of f and g, that is,
[X f , Xg ] = X{g, f } f, g C (T M; R).
The Poisson bracket provides C (T M; R) with the structure of a Lie algebra since
{ , } : C (T M; R) C (T M; R) C (T M; R)
is R-bilinear, skew-symmetric
{g, f } = { f, g} f, g C (T M; R),
and satisfies the Jacobi Identity
{ f, {g, h}} + {h, { f, g}} + {g, {h, f }} = 0 f, g, h C (T M; R).
Moreover, C (T M; R) is a Poisson algebra since the Poisson bracket satisfies the
Leibniz Rule
{ f, gh} = { f, g}h + g{ f, h} f, g, h C (T M; R).
In terms of the Poisson bracket Hamiltons equations take the form
137
138
A Physical Background
139
S ( (x1 , y1 ), (x2 , y2 ) ) = x1 , y2 y1 , x2
defines such a bilinear form. Any such S determines a (constant) symplectic form
on the manifold V by defining, at each p V,
p (v p , w p ) = S (v, w).
It would seem then that one can produce a wide variety of symplectic forms on
V by simply making various choices for S . These hopes are dashed, however, by
a theorem in linear algebra according to which these are, up to a change of linear
coordinates, all the same. The following is Proposition 1.3(ii) of [Bern].
Theorem A.3.1. (Linear Darboux Theorem) Let V be a 2n-dimensional real vector space, S : V V R a nondegenerate, skew-symmetric, bilinear form
on V, and the corresponding symplectic form on V. Then there exists a basis
{e1 , . . . , en , en+1 , . . . , e2n } for V with corresponding linear coordinates
(q1 , . . . , qn , p1 , . . . , pn ) such that
= dqi d pi = d(pi dqi ).
It is remarkable that this result extends locally to arbitrary symplectic manifolds.
For a very detailed proof of the following theorem of Darboux see Section 2.2 of
[Bern].
Theorem A.3.2. (Darboux Theorem) Let (P, ) be a symplectic manifold of dimension 2n. Then, at each point in P, there exists a local coordinate neighborhood U
with coordinates (q1 , . . . , qn , p1 , . . . , pn ) such that, on U,
= dqi d pi = d(pi dqi ).
Remark A.3.5. The essential content of the Darboux Theorem is that all symplectic manifolds of the same dimension are locally the same. This contrasts rather
markedly with Riemannian manifolds of the same dimension, for which the local
structure is determined by curvature. This gives symplectic geometry and topology
quite a dierent flavor from classical dierential geometry and topology (see, for
example, [MS]).
Example A.3.4. Let P be any orientable smooth surface (that is, orientable 2dimensional manifold) and a volume (area) form on P. Then is a 2-form and it
is closed because every 2-form on a 2-manifold is closed. It is also nondegenerate.
To see this we assume that V is a vector field on P for which V = 0 and will show
that V(p) = 0 at each p P. Since determines an orientation for P there exists, on
a neighborhood U of p, a chart with coordinates (x, y) for which ( x , y ) > 0 (see
Theorem 4.3.1 of [Nab3]). Then (y , x ) < 0 and ( x , x ) = (y , y ) = 0 on U.
140
A Physical Background
H
H
and p i = i , i = 1, . . . , n
pi
q
and the rate of change of any classical observable f C (P; R) along these integral
curves is locally given by
X H [ f ] = L XH f =
141
H f
H f
i
.
i
pi q
q pi
More generally, we define, for any f, g C (P; R), the Poisson bracket of f and g
determined by by
{ f, g} = (X f , Xg )
and find that, locally in canonical coordinates,
{ f, g} =
f g
f g
.
qi pi pi qi
Exercise A.3.2. Let (P, ) be an arbitrary symplectic manifold and let (q, p) =
(q1 , . . . , qn , p1 , . . . , pn ) be canonical coordinates on some open neighborhood U in
P. Let U denote the restriction of to U, that is, the pullback of to U by the
inclusion map U * P. Show that (U, U ) is a symplectic manifold and that the
canonical coordinate functions q1 , . . . , qn , p1 , . . . , pn C (U; R) satisfy the classical canonical commutation relations
{qi , q j } = {pi , p j } = 0 and {qi , p j } = ij , i, j = 1, . . . , n,
(A.8)
142
A Physical Background
(A.9)
143
symplectic vector field. That the converse is generally not true follows from the next
exercise.
Exercise A.3.3. Let X be a smooth vector field on (P, ). Prove each of the following.
1. X is symplectic if and only if the 1-form X is closed.
2. X is Hamiltonian if and only if the 1-form X is exact.
When the first de Rham cohomology group of P is trivial (for example, when P =
R2n ), every closed 1-form is exact so a vector field X on (P, ) is symplectic if and
only if it is Hamiltonian. More generally, since P is locally dieomorphic to R2n
any symplectic vector field on P is locally Hamiltonian.
A symmetry of the Hamiltonian system ((P, ), H) is a symplectic dieomorphism F : P P of P onto itself that preserves the Hamiltonian H in the sense
that
F H = H,
that is,
H F = H.
Thus, a symmetry is a dieomorphism of phase space that preserves both the symplectic form and the Hamiltonian. For the infinitesimal version we proceed as in the
Lagrangian case (see page 126). A smooth, complete vector field X on P is said to
be an infinitesimal symmetry of the Hamiltonian system ((P, ), H) if each t , t R,
in the 1-parameter group of dieomorphisms of X is a symmetry of ((P, ), H); if X
is not complete then one modifies the definition exactly as in the Lagrangian case
(page 126).
Infinitesimal symmetries generally arise from group actions of the following
type. Let G be a (matrix) Lie group with Lie algebra g and suppose : G P P,
(g, x) = g (x) = g x, is a smooth left action of G on P. If each of the dieomorphisms g : P P, g G, is a symmetry of the Hamiltonian system ((P, ), H),
then we refer to G as a symmetry group of ((P, ), H). In this case each nonzero
element of g gives rise to an infinitesimal symmetry X defined at each x P by
X (x) =
*
d t
(e x) **t=0 .
dt
144
A Physical Background
: P g .
Then for every x P, (x) is a real-valued linear map on g
(x) : g R
so that ((x))() is a real number for every g. We want to fix and and regard
this as a real-valued function on P, that is, we define
, : P R
by
, (x) = (x)().
Note: The notation is meant to suggest the natural pairing of g and g so that one
should think of , as being defined by
, (x) = (x), .
The map , is smooth so it has a symplectic gradient X, . We will say that is
a moment map, also called a momentum map, for the Hamiltonian system ((P, ), H)
with respect to the symmetry group G if
X, = X
for every g. The Hamiltonian version of Noethers Theorem A.2.1 asserts that
if is a moment map for the Hamiltonian system ((P, ), H) with respect to the
symmetry group G, then , is conserved for every g.
Theorem A.3.3. (Noethers Theorem: Hamiltonian Version) Let ((P, ), H) be a
Hamiltonian system, G a symmetry group of ((P, ), H) with Lie algebra g, and
: P g a moment map for ((P, ), H) with respect to G. Then, for every g,
the function
, : P R
defined by
, (x) = (x)()
for every x P is constant along the integral curves of the Hamiltonian vector field
XH . In particular, each , Poisson commutes with the Hamiltonian H.
/
0
, , H = 0
145
Proof. Let (t) be an integral curve of XH . Then, by definition, (t) = XH ((t)) for
each t. We compute, for each fixed g,
9
d8
, ((t)) = d, (t) ((t))
dt
= d, (t) (XH ((t)))
"
#
= X, (t) (XH ((t)))
= dH(t) (X ((t)))
+d
* ,
= dH(t)
(e s (t)) ** s=0
ds
*
#*
d"
=
H(e s (t)) ***
ds
s=0
# **
d"
=
H((t)) * s=0
ds
=0
as required.
This result does not address the issue of actually finding moment maps or the
question of their existence. As it happens a symmetry group for a Hamiltonian system need not have a moment map, although there are broad classes of such problems
for which the existence of moment maps can be proved; for an introduction to this
see, for example, Chapter 4 of [AM]. We will conclude this section with just one
simple example that should at least clarify the origin of the terminology.
Example A.3.5. We construct a Hamiltonian system ((P, ), H) as follows. Let P =
T R3 = R3 R3 be the tangent bundle of R3 with its canonical symplectic form
= dqi d pi = dq1 d p1 + dq2 d p2 + dq3 d p3 .
For the Hamiltonian we can take any smooth map H : T R3 R that satisfies
H(q + a, p) = H(q, p) for all (q, p) T R3 and any a R3 . One possibility is the
kinetic energy Hamiltonian
H(q, p) =
p2
,
2m
where m is a positive constant. Next we need to find a symmetry group for this
Hamiltonian system. Take G to be the additive group R3 . We think of G as the
translation group of R3 , that is, we define a left action : G P P of G on P by
146
A Physical Background
*
*
*
d t
d
d
*
e (q, p) **t=0 = (q + et , p) **t=0 = (q + t, p) **t=0 = i i **(q,p) ,
dt
dt
dt
q
X = i
.
qi
To find a moment map for this G-action on ((P, ), H) we need a smooth map
: P g for which X, = X . But X, is characterized by X, = d, so
what we need is
X = d, .
Exercise A.3.4. Show that X = X (dqi d pi ) is the 1-form
X = i d p i = 1 d p 1 + 2 d p 2 + 3 p 3 .
According to this exercise X will be equal to d, if we define in such a
way that , (q, p) = i pi = 1 p1 + 2 p2 + 3 p3 for each , that is,
((q, p))() = i pi
and this defines the moment map : P g . Notice that if we identify g with R3
via the coecients of the linear functional on g, then
147
(q, p) = p
so that, in this case, the moment map, or momentum map, really is momentum and
we conclude, as we did in Section A.2, that spatial translation symmetry implies the
conservation of momentum.
What we have tried to do in this section is merely suggest that the mathematical
structure of classical Hamiltonian mechanics is but a special case of the much more
general and very elegant subject of symplectic geometry. We have made no attempt
to capitalize on this by exploiting ideas from symplectic geometry to shed light on
mechanics; this is an enormous subject and there are many superb introductions to
it available (see, for example, [AM], [AMR], [Arn2] and [GS1]).
148
A Physical Background
Postulate QM1
To every quantum system is associated a separable, complex Hilbert space H. The
states of the system are represented by vectors H with = 1 and, for any
c C with |c| = 1, and c represent the same state.
In 1900 Max Planck [Planck] appended to time-honored concepts in classical
physics what we would today call an ad hoc quantization condition in order to
solve the classically perplexing problem of the equilibrium distribution of electromagnetic energy in a black box (this is described in some detail in Section 4.3
of [Nab5]). For the next quarter of a century physicists devised ever more ingenious such appendages to classical physics in order to solve the equally perplexing problem of atomic structure. These eorts had some limited success, but were
not directed by any underlying theoretical or mathematical model of the quantum
world. This changed in 1925-26 when Heisenberg [Heis1] and Schrodinger [Schro1]
each proposed such models. On the surface these looked quite dierent. Heisenbergs formalism eventually came to be known as matrix mechanics since it was expressed in terms of infinite arrays of complex numbers (see Section 7.1 of [Nab5]).
Schrodingers approach, called wave mechanics, represented the state of a quantum
system by a complex-valued wave function that was assumed to satisfy a certain
partial dierential equation, known today as the Schrodinger equation. Eventually it
came to be understood that these two are both physically and mathematically equivalent (see [Cas]). Naively, this equivalence can be understood in the following way.
Schrodingers wave function is most naturally regarded as an element of L2 (M),
where M is the configuration space of the classical mechanical system whose quantum counterpart is under consideration (for example, the classical 2-body problem
if one is interested in the hydrogen atom). But an orthonormal basis for L2 gives
rise to an isometric isomorphism onto l2 so anything one might want to say about
complex-valued square integrable functions can equally well be said in terms of infinite arrays of complex numbers. Indeed, all separable, infinite-dimensional, complex Hilbert spaces are isometrically isomorphic so, when von Neumann set himself
the task of constructing a mathematically rigorous, abstract setting for quantum theory, it was more natural to formulate it in terms of an arbitrary such Hilbert space
H. For any particular quantum system the choice of H then became a matter of
convenience and clarity.
The reason for assuming that the states are represented by unit vectors in H is
more subtle and will be addressed more thoroughly in Postulate QM3. Briefly, the
situation is as follows. In its early years (1925-1927) quantum physics was in a
rather odd position. Heisenberg formulated his matrix mechanics without knowing
what a matrix is and without having a precise idea of what the entries in his rectangular arrays of complex numbers should mean physically (see Section 7.1 of [Nab5]
for more on this). Nevertheless, the rules of the game as he laid them down predicted
precisely the spectrum of the hydrogen atom. Schrodinger formulated a dierential
equation for his wave function that yielded the same predictions, but no one had any
real idea what the wave function was supposed to represent (Schrodinger himself
initially viewed it as a sort of charge distribution). It was left to Max Born, and
149
then Niels Bohr and his school in Copenhagen, to supply the missing conceptual
basis for quantum mechanics.
Remark A.4.1. It is our good fortune that Born himself, in his Nobel Prize Lecture
in 1954, has provided us with a brief and very lucid account of the evolution of his
idea and we will simply refer those interested in pursuing this to [Born1]. Interestingly, Born attributes to Einstein the inspiration for the idea, although Einstein never
acquiesced to its implications.
As we will see, Born postulated that Schrodingers wave function should be interpreted as a probability amplitude. In particular, for a single particle moving along
a line, |(q)|2 is the probability
density function for the particles position, that is,
7
for any Borel set S in R, S |(q)|2 dq is the probability that a measurement of the
particles position7will result in a value in S . Since the particle is bound to be found
somewhere in R, R |(q)|2 dq = 1 so is a unit vector in L2 (R). All of this will be
spelled out in more detail quite soon.
Finally, we point out that, because unit vectors in H that dier only by a phase
factor describe the same state, one can identify the state space of a quantum system with the projectivization P(H) of H, that is, the quotient of the unit sphere
in H by the equivalence relation that identifies two points if they dier by a complex factor of modulus one. The mathematical structure of P(H) is spelled out in
more detail in Section 1.3, but for the moment we need only observe that, with the
quotient topology, P(H) is a Hausdor topological space. We will write for the
equivalence class containing and refer to it as a unit ray in H. It is sometimes also
convenient to identify the state represented by the unit vector with the operator P
that projects H onto the 1-dimensional subspace of H spanned by (which clearly
depends only on the state and not on the unit vector representing it). In all candor,
however, it is customary to be somewhat loose with the terminology and speak of
the state when one really means the state , or the state P . Since this is
almost always harmless, we will generally adhere to the custom.
Postulate QM2
For a quantum system with Hilbert space H, every observable is identified with
a (generally unbounded) self-adjoint operator A : D(A) H on H and any possible outcome of a measurement of the observable is a real number that lies in the
spectrum (A) of A.
Classically, one thinks of an observable associated with a physical system as
something specified by a real number that one can measure. For a single particle
moving in space one might think of a coordinate of the particles position, a component of its momentum, or its total energy. It would be more accurate, however,
to fully identify an observable with a specific measurement procedure not only because such things as position, momentum, and energy are defined operationally in
150
A Physical Background
physics by specifying how they are to be measured, but also because familiar classical concepts such as these do not always transition well into the quantum realm.
The problem of measurement in quantum mechanics is very subtle and, as we will
see in Postulate QM3, quantum theory makes no predictions whatsoever regarding
the outcome of any single measurement even when the state of the system being
measured is known with certainty. Repeated measurements on identical systems in
the same state need not give the same result. The results of these measurements
are constrained, however, by Postulate QM2 because every quantum observable has
a specific set of possible measured values, that is, (A). It turns out, for example,
that a quantum harmonic oscillator must have a total energy that lies in a countable,
discrete set of real numbers tending to infinity (see Section 7.4 of [Nab5]).
In the model we are in the process of constructing we require a functional analytic object associated with the quantum Hilbert space H that encodes the essential
physical content of this notion of a quantum observable. This essential content is
the observables set of possible measured values. Now recall that a closed, symmetric operator A on H has a spectrum (A) that consists entirely of real numbers.
This suggests that one might try to identify a quantum observable such as the total
energy with some appropriately chosen closed, symmetric operator whose spectrum is equal to (or, at least, contains) the set of possible measured values. Such an
idea has analogues in classical physics. In continuum mechanics, for example, the
stress tensor of a 3-dimensional material body is a symmetric linear operator on R3
whose matrix in any Cartesian coordinate system is obtained from the stress components within the body in directions perpendicular to the coordinate planes and
whose eigenvalues are the principal stresses, that is, those that are independent of
the coordinate system (see [Gurtin]).
Postulate QM2, however, specifies that an observable is represented by a selfadjoint operator whose spectrum contains the set of possible measured values. Now,
every self-adjoint operator is closed and symmetric, but the converse is not true so
this is a strictly stronger requirement. The rationale behind this is largely mathematical rather than physical. The formalism of quantum mechanics depends crucially on two of the pillars of functional analysis, namely, the Spectral Theorem and
Stones Theorem, and both of these require self-adjointness (Section 5.5 of [Nab5]
or Chapter IX, Section 9, and Chapter XI, Section 6, of [Yos]). Notice that if the set
of possible measured values is unbounded (as it is for the harmonic oscillator, for
example), then the spectrum must be an unbounded subset of R and therefore the
operator itself must be unbounded.
Postulate QM3
Let H be the Hilbert space of a quantum system, H a unit vector representing
a state of the system and A : D(A) H a self-adjoint operator on H representing
an observable. Let E A be the unique projection-valued measure on R associated
with A by the Spectral Theorem and let {EA }R be the corresponding resolution of
the identity. Denote by ,A the probability measure on R that assigns to every Borel
151
set S R the probability that a measurement of A when the state is will yield a
value in S . Then, for every Borel set S in R,
,A (S ) = , E A (S ) = E A (S )2 .
If the state is in the domain of A, then the expected value of A in state is
6
A =
d, EA = , A
R
We should first be clear on how we will interpret the sort of probabilistic statement made in Postulate QM3. Given a quantum system in state and an observable
A, quantum mechanics generally makes no predictions about the result of a single
measurement of A. Rather, it is assumed that the state can be replicated over and
over again and the measurement performed many times on these identical systems.
The probability of a given outcome might then be thought of intuitively as the relative frequency of the outcome for a large number of repetitions. More precisely,
the probability of a given outcome for a measurement of some observable in some
state is the limit of the relative frequencies of this outcome as the number of repetitions of the measurement in this state approaches infinity. Given this interpretation
the formulas for the expected value and the dispersion are just those borrowed from
probability theory. In particular, the dispersion 2 (A) is zero precisely when a measurement of A in state will result in the expected value A with probability 1;
this, of course, does not mean that every measurement of A in state will result in
A . It follows from the Spectral Theorem that 2 (A) = 0 if and only if is an
eigenvector of A with eigenvalue A (Section 6.2 of [Nab5]). As we mentioned
earlier, this probabilistic interpretation of quantum mechanics is due to Max Born
and one can consult [Born1] to learn how the interpretation evolved in the early days
of quantum mechanics..
The assertion of Postulate QM3 that ,A (S ), which is defined physically, is given
by ,A (S ) = , E A (S ) is called the Born-von Neumann formula and is the link
between the mathematical formalism and the physics. Its either true, or its not and,
in principle, this can be decided in the laboratory. Choose a self-adjoint operator
A to represent the physical observable you have in mind and compute its spectral
resolution. Then make many measurements of your physical observable in some
fixed state to determine ,A . Now compare the observations ,A (S ) with the
computations , E A (S ). If they are the same, all is well; if not, you either made
the wrong choice for the operator A or the Born-von Neumann formula is wrong.
The evidence would suggest that the formula is correct. To gain some appreciation
of how all of this works in practice one might look at the concrete calculations of
152
A Physical Background
,A , A , and 2 (A) for various operators and states in Examples 6.2.2 - 6.2.7 of
[Nab5].
Before moving on we would like to record a general result on dispersions of selfadjoint operators that is related to the uncertainty relations of quantum mechanics.
The following is Lemma 6.3.1 of [Nab5].
Lemma A.4.1. Let H be a separable, complex Hilbert space, A : D(A) H and
B : D(B) H self-adjoint operators on H, and and real numbers. Then, for
every D([A, B]) = D(AB) D(BA),
(A ) 2 (B ) 2
*2
1 **
* , [A, B] ** .
4
Now, if A and B represent observables and D([A, B]) is a unit vector representing a state and if we take and to be the corresponding expected values of
A and B in this state, then it is shown in Section 6.3 of [Nab5] that Lemma A.4.1
implies
2 (A) 2 (B)
*2
1 **
* , [A, B] ** .
4
*
1 **
* , [A, B] **.
2
In Section 6.3 of [Nab5] this is applied to the operators representing the position Q
and momentum P of a single particle moving along a line to obtain
(Q) (P)
"
,
2
(A.10)
h
where " = 2
is the reduced Planck constant. This is obviously a statistical statement
about position and momentum measurements for such a particle. It is quite common,
and totally incorrect, to see it identified with the famous Heisenberg Uncertainty
Principle which, as Heisenberg phrased it, states that
q p
"
,
2
(A.11)
where q and p are the uncertainty, or inaccuracy in the simultaneous measurements of position and momentum for a single particle. Section 6.3 of [Nab5]
discusses in some detail the physical origin of Heisenbergs Uncertainty Principle,
its current experimental status and the fact that it has nothing whatsoever to do with
the inequality (A.10).
153
154
A Physical Background
|, |2
2 2
for any and any . Wigner defined a symmetry of the quantum system
whose Hilbert space is H to be a bijection T : P(H) P(H) that preserves transition probabilities in the sense that (T ( ), T ()) = (, ) for all , P(H).
Notice that, although P(H) has a natural quotient topology, no continuity assumptions are made. Any unitary or anti-unitary operator U on H induces a symmetry
T U that carries any representative of to the representative U() of T U ( ). What
Wigner proved was that every symmetry is induced in this way by a unitary or antiunitary operator. The result is, in fact, a bit more general. There is a detailed proof
of the following result in [Barg].
Theorem A.4.2. (Wigners Theorem on Symmetries) Let H1 and H2 be complex,
separable Hilbert spaces and T : P(H1 ) P(H2 ) a mapping of P(H1 ) into P(H2 )
satisfying
(T ( ), T ())P(H2 ) = (, )P(H1 )
for all , P(H1 ). Then there exists a mapping U : H1 H2 satisfying
1. U() T ( ) if P(H1 ),
155
Furthermore, if H1 and H2 are of dimension at least 2 and T ([1 ]) = T ([2 ]), where
1 and 2 are unit vectors in H1 , then there is a unique such mapping U : H1 H2
for which U(1 ) = U(2 ).
In particular, any symmetry of a quantum system with Hilbert space H arises from
an operator on H that is either unitary or anti-unitary. The anti-unitary operators
correspond physically to discrete symmetries such as time reversal (they are the
analogue of reflections in Euclidean geometry).
Now we can justify the unitarity assumption in Postulate QM4. Suppose that
the time evolution is described by an assignment to each t R of a symmetry
t : P(H) P(H) and that t t satisfies t+s = t s for all t, s R. Then, for
any t R, t = 2t/2 . By Wigners Theorem, t/2 is represented by an operator Ut/2
that is either unitary or anti-unitary. Since the square of an operator that is either
unitary or anti-unitary is necessarily unitary, every Ut must be unitary.
Remark A.4.2. We investigate the consequences of Wigners Theorem more thoroughly in Section 1.3, but a few remarks here would seem to be in order. A bijection
T : P(H) P(H) preserving transition probabilities that is a homeomorphism
with respect to the quotient topology is called an automorphism of P(H) and the
collection of all such is clearly a group under composition. This is called the automorphism group of P(H) and is denoted Aut(P(H)). Wigners Theorem will permit
us, in Section 1.3, to provide Aut(P(H)) with a natural topology with respect to
which it is a Hausdor topological group. Now suppose that G is a Lie group. A
continuous homomorphism of G into Aut(P(H)) is called a projective representation of G on H. If H is the Hilbert space of some quantum system, then such a
projective representation assigns a symmetry to each element of G in such a way
that the group operations are respected and this leads us to refer to G as a symmetry
group of the quantum system. When G is the Poincare group P+ , the existence of
such a projective representation is the natural expression of the relativistic invariance of the quantum system.
Notice that there is nothing special about t = 0 in Postulate QM4. If t0 is any
real number, then (t0 ) = Ut0 ((0)) so (t + t0 ) = Ut+t0 ((0)) = Ut (Ut0 ((0))) =
Ut ((t0 )) and therefore
(t) = ((t t0 ) + t0 ) = Utt0 ((t0 )) = ei(tt0 )H/" ((t0 )).
Thus,
Utt0 = ei(tt0 )H/"
propagates the state at time t0 to the state at time t for any t0 , t R. It follows from
this that if (t) is thought of as a curve in H and (t0 ) is in the domain of H, then,
by Stones Theorem, (t) is in the domain of H for all t R and
156
A Physical Background
i"
d(t)
= H((t)),
dt
(A.12)
(A.13)
and the limit is in H. Equation (A.12) is called the abstract Schrodinger equation.
This, however, is not the way one generally sees the Schrodinger equation written
in the physics literature. When H is, for example, L2 (Rn ), then each state of the
system is represented by a complex-valued, square integrable wave function (q)
on Rn with L2 -norm 1. In this case the Hamiltonian H is generally a dierential
operator of the form
H = H0 + V(q) =
"2
+ V(q),
2m
(A.14)
(q, t)
"2
=
(q, t) + V(q)(q, t).
t
2m
(A.15)
The point we would like to stress, however, is that writing the Schrodinger equation in the form (A.15) involves a fair amount of potentially hazardous notational
subterfuge. The abstract Schrodinger equation (A.12) contains in it the t-derivative
d(t)/dt which is defined as the limit in L2 (Rn ) of the familiar dierence quotient
(see (A.13)). The partial derivative (q, t)/t that appears in the traditional physicists form (A.15) of the Schrodinger equation is, on the other hand, defined as a
limit in C of an equally familiar dierence quotient. In general, there is no reason to
suppose that the latter exists for L2 (Rn ) and, even if it does, that it is in L2 (Rn )
for each t and, granting even this, that it is equal to d(t)/dt. Sucient regularity
assumptions on (q, t) will guarantee that all of these things are true (Exercise 6.2.1
of [Nab5]), but even these may not be enough to ensure that, for each t, the distribution is actually a function in L2 (Rn ) and, if this is not the case, we have no
business equating it in (A.15) to something that is a function in L2 (Rn ). All of these
diculties can be willed away by restricting attention to functions (q, t) that are
157
suciently regular, say, continuously dierentiable with respect to t and, for each t,
a Schwartz function or smooth function with compact support on Rn . The hope then
is that one can find suciently many such classical solutions to (A.15) to provide
an orthonormal basis for L2 (Rn ) for each t. Example 5.3.1 of [Nab5] shows that, at
least in the case of the harmonic oscillator, ones hopes are not dashed.
We will conclude this section with a brief synopsis the salient features of the
picture of quantum mechanics we have painted thus far and then suggest a rather
dierent way of looking at it that will bear a striking resemblance to the picture of
classical Hamiltonian mechanics in Section A.3. A quantum system has associated
with it a complex, separable Hilbert space H and a distinguished self-adjoint operator H, called the Hamiltonian of the system. The states of the system are represented
by unit vectors in H and these evolve in time from an initial state (0) according to (t) = Ut ((0)) = eitH/" ((0)). As a result, the evolving states satisfy the
abstract Schrodinger equation
i"
d(t)
= H((t)).
dt
(A.16)
Each observable is identified with a self-adjoint operator A that does not change
with time. Neither the state vectors nor the observables A are accessible to direct
experimental measurement. Rather, the link between the formalism and the physics
is contained in the expectation values A = , A. Knowing these one can construct the probability measures ,A (S ) = , E A (S ) and these contain all of the
information that quantum mechanics permits us to know about the system.
We would now like to look at this from a slightly dierent point of view. As the
state evolves so do the expectation values of any observable. Specifically,
A(t) = (t), A(t) = Ut ((0)), AUt ((0)) = (0), [Ut1 AUt ] (0),
because each Ut is unitary. Now, define a (necessarily self-adjoint) operator
A(t) = Ut1 AUt
for each t R. Then
A(t) = A(t)(0)
for each t R. The expectation value of A in the evolved state (t) is the same
as the expectation value of the observable A(t) in the initial state (0). Since all of
the physics is contained in the expectation values this presents us with the option of
regarding the states as fixed and the observables as evolving in time. From this point
of view our quantum system has a fixed state and the observables evolve in time
from some initial self-adjoint operator A = A(0) according to
A(t) = Ut1 AUt = eitH/" AeitH/" .
(A.17)
158
A Physical Background
(A.18)
159
- A(t + t) A(t) .
dA(t)
= lim
t0
dt
t
and [A(t), H] is the commutator of A(t) and H on D(H) , that is, [A(t), H] =
A(t)(H) H(A(t)) for every D(H).
Lets simplify the notation a bit and write (A.18) as
9
dA
i8
= A, H .
dt
"
(A.19)
df
= { f, H}
dt
describing the time evolution of a classical observable in the Hamiltonian picture of
mechanics. The analogy is striking and suggested to Paul Dirac [Dirac1] a possible
avenue from classical to quantum mechanics, that is, a possible approach to the
quantization of classical mechanical systems. The idea is that classical observables
should be replaced by self-adjoint operators and the Poisson bracket { , } by the
quantum bracket
/ 0
i8 9
, "= , .
"
Lets spell this out in a bit more detail. Diracs suggestion was to find a linear map
from the classical observables f, g, . . . to the quantum observables F, G, . . . with the
property that
i
{ f, g} $ {F, G}" = [F, G].
"
If one further stipulates that the constant function 1 should map to the identity operator I then this implies that the classical canonical commutation relations (A.8)
{qi , q j } = {pi , p j } = 0 and {qi , p j } = i j , i, j = 1, . . . n
(A.20)
(A.21)
(A.22)
Presumably the image under this map of the classical Hamiltonian would be the
appropriate quantum Hamiltonian and with this in hand the analysis of the quan-
160
A Physical Background
tum system could commence. The extent to which Diracs program can actually be
carried out is discussed in some detail in Section 7.2 of [Nab5].
Example A.4.1. We will conclude with a brief synopsis of the quantum system
describing a single particle of mass m moving in R3 under the influence of a timeindependent potential V(q) = V(q1 , q2 , q3 ). The corresponding problem for motion
in R is treated in considerable detail in [Nab5] and we will provide references for
those results whose extension from one to three spatial dimensions is not routine.
Remark A.4.5. We should point out that, in this example, we will be ignoring an
important quantum mechanical property of elementary particles called spin. We will
take up this subject and see what modifications of our present discussion are required
in Section A.5.
The Hilbert space H of our quantum system is taken to be L2 (R3 ), that is, the
Hilbert space of (equivalence classes of) complex-valued, square integrable functions on R3 with respect to Lebesgue measure. The dynamics of the system is
governed, via the Schrodinger equation, by the Hamiltonian H which must be a
self-adjoint operator on L2 (R3 ) representing the total energy of the system. The total energy is the sum of the kinetic energy and the potential energy so H will be
the sum of two self-adjoint operators. The kinetic term is always the same and is
defined in the following way. Begin by looking at the subspace C0 (R3 ) of L2 (R3 )
consisting of smooth, complex-valued functions with compact support on R3 (or the
Schwartz space S(R3 ) of rapidly decreasing complex-valued functions on R3 ). On
this subspace the ordinary Laplacian is defined and essentially self-adjoint (see
pages 395-396 of [Nab5]) so the same is true of
H0 =
"2
.
2m
161
"2
+V
2m
(A.23)
162
A Physical Background
(A.24)
Example 5.2.6 of [Nab5] gives two proofs of the corresponding statement for motion
in one spatial dimension that can easily be adapted to the 3-dimensional context in
which we currently find ourselves; alternatively, Section 4.5, Chapter III, of [Prug]
provides a dierent argument that proves self-adjointness in any number of spatial
dimensions.
Next we would like to define, for each j = 1, 2, 3, the jth -coordinate momentum
operator P j on L2 (R3 ). One begins by showing that the operator
P j = i"
q j
(A.25)
is defined and essentially self-adjoint on C0 (R3 ) (or S(R3 )) and taking the momentum operator, denoted with the same symbol, to be its unique self-adjoint extension
(motivation for the corresponding definition in one spatial dimension is provided in
Remark 6.2.15 of [Nab5]). The domain D(P j ) is best viewed in the following way.
The Fourier transform F : L2 (R3 ) L2 (R3 ) is a unitary operator on L2 (R3 ). For
in C0 (R3 ) (or S(R3 )) one has P j = (F1 ("Q j )F). Consequently,
P j = F1 ("Q j )F
(A.26)
on C0 (R3 ) (or S(R3 )). Since each of these is dense in L2 (R3 ), (A.26) is true everywhere on L2 (R3 ). In particular, P j is unitarily equivalent to Q j and the domain of
P j is
D(P j ) = F1 (D(Q j )).
(A.27)
One often sees (A.26) taken as the definition of P j in which case the self-adjointness
of P j follows from its unitary equivalence to Q j (Lemma 5.2.5 of [Nab5]).
Finally, we would like to write out the operators on L2 (R3 ) that represent the
quantum analogues of the classical components of orbital angular momentum. The
motivation is to be found in the infinitesimal generators (A.3), (A.4) and (A.5) of
classical angular momentum. Specifically, we begin by defining operators M 23 , M 31
and M 12 on C0 (R3 ) (or S(R3 )) by
+
,
M 23 = Q2 P3 Q3 P2 = i " q3 2 q2 3
q
q
(A.28)
,
M 31 = Q3 P1 Q1 P3 = i " q1 3 q3 1
q
q
(A.29)
,
M 12 = Q1 P2 Q2 P1 = i " q2 1 q1 2
q
q
(A.30)
163
F i q j (p) =
F()(p)
p j
F
1. [(FQ j F1 )](p)
= i p j (p),
j = 1, 2, 3
1
,
M 23 = i " p3
p2
,
p2
p3
(A.31)
,
M 31 = i " p1
p3
,
p3
p1
(A.32)
,
M 12 = i " p2
p1
.
p1
p2
(A.33)
164
A Physical Background
A.5 Spin
We will try to provide some sense of what this phenomenon of spin is and how
the physicists have incorporated it into their mathematical model of the quantum
world. There is nothing like quantum mechanical spin in classical physics. There
is, however, a classical analogy. The analogy is inadequate and can be misleading if
taken too seriously, but it is the best we can do so we will begin by briefly describing
it.
Imagine a spherical mass m of radius a moving through space on a circular orbit
of radius R a about some point O and, at the same time, spinning around an
axis through one of its diameters (to a reasonable approximation, the Earth does
all of this). Due to its orbital motion, the mass has an angular momentum L =
r (mv) = r p, where r is the position vector from O and v = r is the velocity,
which we call its orbital angular momentum.. The spinning of the mass around its
axis contributes additional angular momentum that one calculates by subdividing
the spherical region occupied by the mass into subregions, regarding each subregion
as a mass in a circular orbit about a point on the axis, approximating its angular
momentum, adding all of these and taking the limit as the regions shrink to points.
The resulting integral gives the angular momentum due to rotation. This is called
the rotational angular momentum, is denoted S, and is given by
S = I,
where I is the moment of inertia of the sphere and is the angular velocity ( is
along the axis of rotation in the direction determined by the right-hand rule from
the direction of the rotation). If the mass is assumed to be uniformly distributed
throughout the sphere (in other words, if the sphere has constant density), then an
exercise in calculus gives
S=
2 2
ma .
5
q
L,
2m
where q is the charge of the sphere (which can be positive or negative). Similarly,
the rotational angular momentum of the charge gives rise to a magnetic field that is
A.5 Spin
165
q
S.
2m
q
(L + S).
2m
The significance of the magnetic moment of the dipole is that it describes the
strength and direction of the dipole field and determines the torque
=B
experienced by the magnetic dipole when placed in an external magnetic field B. If
the magnetic field B is uniform (that is, constant), then its only eect on the dipole
is to force to precess around a cone whose axis is along B in the same way that
the axis of a spinning top precesses around the direction of the Earths gravitational
field (see Figure A.1 and Section 2, Chapter 11, of [Eis]). Notice that this precession
does not change the projection B of along B.
166
A Physical Background
Remark A.5.1. Silver is a good choice, but for reasons that are not so apparent. A
proper explanation requires some hindsight (not all of the information was available to Stern and Gerlach) as well as some quantum mechanical properties of atoms
that we have not discussed here. Nevertheless, it is worth saying at least once since
otherwise one is left with all sorts unanswered questions about the validity of the
experiment. So, here it is. The stable isotopes of Ag have 47 electrons, 47 protons
and either 60 or 62 neutrons so, in particular, they are electrically neutral. Since the
magnetic moment is inversely proportional to the mass and since the mass of the
proton and neutron are each approximately 2000 times the mass of the electron, one
can assume that any magnetic moments of the nucleons will have a negligible eect
on the magnetic moment of the atom and can therefore be ignored. Of the 47 electrons, 46 are contained in contained in closed, inner shells (energy levels) and these,
it turns out, can be represented as a spherically symmetric cloud with no orbital or
A.5 Spin
167
rotational angular momentum (this is not at all obvious). The remaining electron is
in what is termed the outer 5s-shell and an electron in an s-state has no orbital angular momentum (again, not obvious). Granting all of this, the only possible source of
any magnetic moment for a Ag atom is a rotational angular momentum of its outer
5s-electron. Whatever happens in the experiment is attributable to the electron and
the rest of the silver atom is just a package designed to ensure this.
We will first see what the classical picture of an electron with a rotational magnetic moment would lead us to expect in the Stern-Gerlach experiment and will then
describe the results that Stern and Gerlach actually obtained (a more thorough, but
quite readable account of the physics is available Chapter 11 of [Eis]). For this we
will need to be more specific about the magnetic field B that we intend to send the
Ag atoms through. Lets introduce a coordinate system in Figure A.2 in such a way
that the Ag atoms move in the direction of the y-axis and the vertical axis of symmetry of the magnet is along the z-axis so that the x-axis is perpendicular to both
of these. The magnet itself can be designed to produce a field that is non-uniform,
but does not vary with y, is predominantly in the z-direction, and is symmetric with
respect to the yz-plane. The interaction between the neutral Ag atom (with magnetic
moment ) and the non-uniform magnetic field B provides the atom with a potential
energy B so that the atom experiences a force
F = ( B) = ( x Bx + y By + z Bz ).
For the sort of magnetic field we have just described, By = 0 and Bz dominates Bx .
From this one finds that the translational motion is governed primarily by
F z z
Bz
z
(see pages 333-334 of [Eis]). The conclusion we draw from this is that the displacements from the intended path of the silver atoms will be in the z-direction (up and
down in Figure A.2) and the forces causing these displacements are proportional to
the z-component of the magnetic moment. Of course, dierent orientations of the
magnetic moment among the various Ag atoms will lead to dierent values of z
and therefore to dierent displacements. Moreover, due to the random thermal effects of the furnace, one would expect that the silver atoms exit with their magnetic
moments randomly oriented in space so that their z-components could take on
any value in the interval [ ||, || ]. As a result, the expectation based on classical
physics would be that the deflected Ag atoms will impact the photographic plate at
points that cover an entire vertical line segment (see the segment labeled Classical
prediction in Figure A.2).
Remark A.5.2. Writing q = e for the charge of the electron and m = me for its
mass we find that
168
A Physical Background
F z z
Bz
e
Bz
=
Sz
z
2me
z
3
5
n1
1
, 1, , 2, , . . . ,
, ... ,
2
2
2
2
A.5 Spin
169
called baryons which have spin 32 , but you dare not blink if youre looking for one
of these since their mean lifetime is about 5.63 1024 seconds. Among the bosons,
the very recently observed Higgs boson has spin 0, whereas the conjectured, but not
yet observed graviton has spin 2. The photon has spin 1, but it is massless and so
does not quite fit into the m > 0 picture we have been discussing.
We have seen that the classical vector S used to describe the rotational angular
momentum does not travel well into the quantum domain where it simply does not
behave the way one expects a vector to behave. Nevertheless, it is still convenient
to collect together the quantities S x , S y and S z , measured, for example, by a SternGerlach apparatus aligned along the x-, y- and z-axes, and refer to the triple
S = (S x , S y , S z )
as the spin vector. Quantum theory decrees that, for a particle with spin quantum
number s, the only allowed values for the components S x , S y and S z are
s", (s 1)", . . . , (s 1)", s".
In particular, for a spin
values so, for example,
1
2
(A.34)
With this synopsis of the general situation behind us we will return to the particular case of spin 12 . We know that the classical picture of the electron as a tiny
spinning ball cannot describe what is actually observed so we must look for another
picture that can do this. Whatever this picture is it must be a quantum mechanical
one so we are looking for a Hilbert space H and some self-adjoint operators on it
to represent the observables S x , S y and S z . Previously we represented the state of
the electron by a wave function (x, y, z) that is in L2 (R3 ), but we now know that
the state of a spin 12 particle must depend on more that just x, y, and z since this
alone cannot tell us which of the two paths an electron is likely to follow in a SternGerlach apparatus; we say likely to because we can no longer hope to know more
than probabilities. What we would like to do is isolate some appropriate notion of
the spin state of the particle that will provide us with the information we need
to describe these probabilities. Now, we know that the only possible values of S z
are "2 . By analogy with the classical situation one might view this as saying that
the spin vector S can only be either up or down, but nothing in-between. This
suggests that we consider wave functions
(x, y, z, )
(A.35)
that depend on x, y, z, and an additional discrete variable that can take only two
values, say, = 1 and = 2 (or, if you prefer, = up and = down). Then
| (x, y, z, 1) |2 would represent the probability density for locating the electron at
(x, y, z) with S z = "2 and similarly | (x, y, z, 2) |2 is the probability density for locat-
170
A Physical Background
ing the electron at (x, y, z) with S z = "2 . Stated this way it sounds a little strange,
but notice that this is precisely the same as describing the state of the electron with
two functions 1 (x, y, z) = (x, y, z, 1) and 2 (x, y, z) = (x, y, z, 2) and this is what
we will do. Specifically, we will identify the wave function of a spin 12 particle with
a (column) vector
1
2
1 (x, y, z)
,
2 (x, y, z)
where 1 and 2 are in L2 (R3 ) and
6
( | 1 (x, y, z) |2 + | 2 (x, y, z) |2 ) d = 1
R3
"
2
or
H = L2 (R3 ; C2 ) ! L2 (R3 ) C2 .
Now we must isolate self-adjoint operators on H to represent the observables
S x , S y and S z . Since these observables represent an intrinsic property of a spin 12
particle, independent of x, y and z, we will want the operators to act only on the
spin coordinates 1 and 2 and the action should be constant in (x, y, z). Thus, we are
simply looking for 22 complex, self-adjoint (that is, Hermitian) matrices or, stated
otherwise, self-adjoint operators on C2 . Since the only possible observed values are
"2 , these must be the eigenvalues of each matrix. There are, of course, many such
matrices floating around and we must choose three of them. Our choice is motivated
by the desire to keep spin angular momentum and orbital angular momentum on
the same formal footing since, classically at least, they really are the same thing.
We accomplish this by insisting that the operators satisfy the same commutation
relations as those of the orbital angular momentum.
Recall that the generators M23 = J1 , M31 = J2 , and M12 = J3 in p can be realized
as the components of the orbital angular momentum operators with " = 1 and that
these satisfy the commutation relations (2.27), that is,
[J j , Jk ] = i jkl Jl , j, k = 1, 2, 3.
Including a factor of " in each operator these become
[J j , Jk ] = i" jkl Jl , j, k = 1, 2, 3.
Thus, we need to find operators S 1 , S 2 , and S 3 on C2 each of which has eigenvalues
"2 and that satisfy
[S j , S k ] = i" jkl S l , j, k = 1, 2, 3.
(A.36)
A.5 Spin
171
Sj=
"
j , j = 1, 2, 3.
2
(A.37)
Exercise A.5.1. Show that S 1 , S 2 , and S 3 given by (A.37) satisfy the required conditions.
Exercise A.5.2. Define the operator S2 on C2 by S2 = S 12 + S 22 + S 32 and show that
S2 =
+ 1 ,+ 1
2 2
,
+ 1 "2 idC2 .
3
4
In order to make some direct contact with the material at the end of Section 2.8
we recommend the following exercise.
Exercise A.5.3. Let D(1/2) be the spin 12 representation of SU(2) on C(D(1/2) ) = C2
and let M23 , M31 , and M12 be the generators of rotations in p. Show that
i"
i"
i"
S2
= 2
=
2 (x, y, z)
2 (x, y, z)
2
2 i1 (x, y, z)
1
2
1
2
1
2
"
" 1 (x, y, z)
1 (x, y, z)
1 (x, y, z)
S3
= 3
=
2 (x, y, z)
2 (x, y, z)
2
2 2 (x, y, z)
172
A Physical Background
We call S j , j = 1, 2, 3, the spin operators for spin 12 particles, although the terminology is often applied to the S j , j = 1, 2, 3, themselves. The physical observables
corresponding to these operators are the analogue of the classical spin angular momentum and are referred to either as the spin components or the intrinsic angular
momentum components of the spin 12 particle. These all have the same eigenvalues
"2 and the largest of these eigenvalues "2 is the spin of the particle (the term spin
1
2 tacitly assumes a factor of " or a choice of units in which " = 1). In light of Exercise A.5.3 this coincides with the spin parameter for D(1/2) introduced in Section
2.8 when the units are chosen so that " = 1.
The scheme is precisely the same for particles of arbitrary spin s = j/2. These
have wave functions with 2s + 1 components corresponding to the 2s + 1 dots on
the Stern-Gerlach apparatus so that the Hilbert space is H = L2 (R3 ) C2s+1 . The
spin operators S 1 , S 2 , S 3 on C2s+1 are again required to satisfy the commutation
relations (A.36) of angular momentum and have eigenvalues that are the observed
values (A.34) of the spin components. One then notices that C(D(s) ) ! C2s+1 and
that a set of such matrices is given by
S 1 = i"
S 2 = i"
S 3 = i"
We will have no need to write these out explicitly, but for those who would like to
see S 1 , S 2 , and S 3 in one more case we recommend the following exercise for spin
s = 1.
Exercise A.5.4. Define 3 3 matrices S 1 , S 2 , and S 3 by
0 1 0
0 i 0
1 0 0
"
"
S 1 = 1 0 1 S 2 = i 0 i S 3 = " 0 0 0
2 0 1 0
2 0 i 0
0 0 1
A.5 Spin
173
S 2 = i"
S 3 = i"
We point out that, generalizing Exercise A.5.2 and Exercise A.5.4 (3), one finds
that, for any s,
S2 = S 12 + S 22 + S 32 = s(s + 1)"2 idC2s+1
The corresponding spin operators on the Hilbert space
H = L2 (R3 ) C2s+1 ! L2 (R3 ) C(D(s) )
are then
S j = idL2 (R3 ) S j , j = 1, 2, 3.
References
AM.
Abraham, R. and J.E. Marsden, Foundations of Mechanics, Second Edition, AddisonWesley, Redwood City, CA, 1987 .
AMR. Abraham, R., J.E. Marsden and T. Ratiu, Manifolds, Tensor Analysis, and Applications,
Second Edition, Springer, New York, NY, 2001.
AS.
Aharonov, Y. and L. Susskind, Observability of the Sign Change of Spinors Under 2
Rotations, Phys. Rev., 158, 1967, 1237-1238.
AMS. Aitchison, I.J.R., D.A. MacManus, and T.M. Snyder, Understanding Heisenbergs Magical Paper of July 1925: A New Look at the Computational Details, Am. J. Phys. 72
(11), November 2004, 1370-1379.
Alb.
Albert, D., Quantum Mechanics and Experience, Harvard University Press, Cambridge,
MA, 1992.
AHM. Albeverio, S.A., R.J. Hoegh, and S. Mazzucchi, Mathematical Theory of Feynman Path
Integrals: An Introduction, Second Edition, Springer, New York, NY, 2008.
Alex.
Alexandrov, A.D., On Lorentz Transformations, Uspehi Mat. Nauk, 5, 3(37), 1950, 187
(in Russian)
AAR. Andrews, G.E., R. Askey, and R. Roy, Special Functions, Cambridge University Press,
Cambridge, England, 2000.
Apos. Apostol, T.M., Mathematical Analysis, Second Edition, Addison-Wesley Publishing Co.,
Reading, MA, 1974.
Arn1. Arnold, V.I., Ordinary Dierential Equations, MIT Press, Cambridge, MA, 1973.
Arn2. Arnold, V.I., Mathematical Methods of Classical Mechanics, Second Edition. Springer,
New York, NY, 1989.
ADR. Aspect, A., J. Dalibard, and G. Roger, Experimental Test of Bells Inequalities Using
Time-Varying Analyzers, Physical Review Letters, Vol. 49, No. 25, 1982, 1804-1807.
Bal.
Ballentine, L.E., The Statistical Interpretation of Quantum Mechanics, Rev. Mod. Phys.,
Vol 42, No 4, 1970, 358-381.
Barg.
Bargmann, V., Note on Wigners Theorem on Symmetry Operations, J. Math. Phys., Vol
5, No 7, 1964, 862-868.
BW.
Bargmann, V. and E.P. Wigner, Group Theoretical Discussion of Relativistic Wave Equations, Proc. Nat. Acad. Sci., Vol 34, 1948, 211-223.
BB-FF. Barone, F.A., H.Boschi-Filho, and C. Farina, Three Methods for Calculating the Feynman
Propagator, Am. J. Phys., 71, 5, 2003, 483-491.
Bar.
Barut, A.O. and R. Raczka, Theory of Group Representations and Applications, Polish
Scientific Publishers, Warszawa, Poland, 1977.
Bell.
Bell, J.S., On the Einstein, Podolsky, Rosen Paradox, Physics 1, 3, 1964, 195-200.
BS.
Berezin, F.A. and M.A. Shubin, The Schrodinger Equation, Kluwer Academic Publishers, Dordrecht, The Netherlands, 1991.
175
176
BGV.
References
Berline, N., E. Getzler, and M. Vergne, Heat Kernels and Dirac Operators, Springer,
New York, NY, 2004.
Bern.
Berndt, R. An Introduction to Symplectic Geometry, American Mathematical Society,
Providence, RI, 2001.
BP.
Bernstein, H.J. and A.V. Phillips, Fiber Bundles and Quantum Theory, Scientific American, 245, 1981, 94-109.
BG.
Bishop, R.L. and S. Goldberg, Tensor Analysis on Manifolds, Dover Publications, Inc.,
Mineola, NY, 1980.
BD1.
Bjorken, J.D. and S.D. Drell, Relativistic Quantum Mechanics, McGraw-Hill Book
Company, New York, 1964.
BD2.
Bjorken, J.D. and S.D. Drell, Relativistic Quantum Fields, McGraw-Hill Book Company,
New York, 1965.
BEH.
Blank, J., Exner, P., and M. Havlicek, Hilbert Space Operators in Quantum Physics,
Second Edition, Springer, New York, NY, 2010.
Blee.
Bleecker, D., Gauge Theory and Variational Principles, Addison-Wesley Publishing
Company, Reading, MA, 1981.
BLT.
Bogolubov, N.N., A.A. Logunov, and I.T. Todorov, Introduction to Axiomatic Quantum
Field Theory, W.A. Benjamin, Inc., Reading, MA, 1975.
Bohm. Bohm, D., Quantum Theory, Dover Publications, Inc., Mineola, NY, 1989.
BR.
Bohr, N. and L. Rosenfeld, Zur Frage der Mebarkeit der Electromagnetischen
Feldgroen,
References
CL.
Corn.
CM.
deJag.
Del.
deAI.
deMJS.
Dirac1.
Dirac2.
Ehren.
Ein1.
Ein2.
Ein3.
Ein4.
EPR.
Eis.
Emch.
Evans.
Fad.
FP1.
FP2.
Fels.
Ferm.
FLS.
Feyn.
Flamm.
Fol1.
177
Cini, M. and J.M. Levy-Leblond, Quantum Theory without Reduction, Taylor and Francis, Boca Raton, FL, 1990.
Cornish, F.H.J., The Hydrogen Atom and the Four-Dimensional Harmonic Oscillator, J.
Phys. A: Math. Gen., 17, 1984, 323-327.
Curtis, W.D. and F.R. Miller, Dierential Manifolds and Theoretical Physics, Academic
Press,Inc., Orlando, FL, 1985.
de Jager, E.M.,
The Lorentz-Invariant Solutions of the Klein-Gordon Equation, Siam J. Appl. Math., Vol. 15, No. 4, 1967, 944-963. For more details see
http://oai.cwi.nl/oai/asset/7768/7768A.pdf.
Deligne, P.,et al, Editors, Quantum Fields and Strings: A Course for Mathematicians,
Volumes 1-2, American Mathematical Society, Providence, RI, 1999.
de Azcarraga, J. and J. Izquierdo, Lie Groups, Lie Algebras, Cohomology and Some
Applications in Physics, Cambridge University Press, Cambridge, England, 1995.
de Muynck, W.M., P.A.E.M. Janssen and A. Santman, Simultaneous Measurement and
Joint Probability Distributions in Quantum Mechanics, Foundations of Physics, Vol. 9,
Nos. 1/2, 1979, 71-122.
Dirac, P.A.M., The Fundamental Equations of Quantum Mechanics, Proc. Roy. Soc.
London, Series A, Vol. 109, No. 752, 1925, 642-653.
Dirac, P.A.M., The Lagrangian in Quantum Mechanics, Physikalische Zeitschrift der
Sowjetunion, Band 3, Heft 1, 1933, 64-72.
Ehrenfest, P., Bemerkung u ber die angenaherte Gultigkeit der klassischen Mechanik
innerhalb der Quantenmechanik, Zeitschrift fur Physik, 45, 1927, 455-457.
178
Fol2.
References
Folland, G.B., A Course in Abstract Harmonic Analysis, CRC Press, Boco Ratan, FL,
1995.
Fol3.
Folland, G. B., Quantum Field Theory: A Tourist Guide for Mathematicians, American
Mathematical Society, Providence, RI, 2008.
FS.
Folland, G.B. and A. Sitaram, The Uncertainty Principle: A Mathematical Survey, Journal of Fourier Analysis and Applications, Vol. 3, No. 3, 1997, 207-238.
FU.
Freed, D.S. and K.K. Uhlenbeck, Editors, Geometry and Quantum Field Theory, American Mathematical Society, Providence, RI, 1991.
Fried. Friedman, A. Foundations of Modern Analysis, Dover Publications, Inc., Mineola, NY,
1982.
FrKo. Friesecke, G. and M. Koppen, On the Ehrenfest Theorem of Quantum Mechanics, J.
Math. Phys., Vol. 50, Issue 8, 2009. Also see http://arxiv.org/pdf/0907.1877.pdf.
FrSc.
Friesecke, G. and B. Schmidt, A Sharp Version of Ehrenfests Theorem fo General Self-Adjoint Operators, Proc. R. Soc. A, 466, 2010, 2137-2143. Also see
http://arxiv.org/pdf/1003.3372.pdf.
Fuj1.
Fujiwara, D., A Construction of the Fundamental Solution for the Schrodinger Equations,
Proc. Japan Acad., 55, Ser. A, 1979, 10-14.
Fuj2.
Fujiwara, D., On the Nature of Convergence of Some Feynman Path Integrals I, Proc.
Japan Acad., 55, Ser. A, 1979, 195-200.
Fuj3.
Fujiwara, D., On the Nature of Convergence of Some Feynman Path Integrals II, Proc.
Japan Acad., 55, Ser. A, 1979, 273-277.
Gaal.
Gaal, S.A., Linear Analysis and Representation Theory, Springer, New York, NY, 1973.
Gr.
Grding, L., Note on Continuous Representations of Lie Groups, Proc. Nat. Acad. Sci.,
Vol. 33, 1947, 331-332.
Gel.
Gelfand, I.M., Representations of the Rotation and Lorentz Groups and their Applications, Martino Publishing, Mansfield Centre, CT, 2012.
GN.
Gelfand, I.M. and M.A. Naimark, On the Embedding of Normed Rings into the Ring of
Operators in Hilbert Space, Mat. Sbornik 12, 1943, 197-213.
GS.
Gelfand, I.M. and G.E. Shilov, Generalized Functions, Volume I, Academic Press, Inc.,
New York, NY, 1964.
GY.
Gelfand, I.M. and A.M. Yaglom, Integration in Functional Spaces and its Applications
in Quantum Physics, J. Math. Phys., Vol. 1, No. 1, 1960, 48-69.
GJ.
Glimm, J. and A. Jae, Quantum Physics: A Functional Integral Point of View, Second
Edition, Springer, New York, NY, 1987.
Gold.
Goldstein, H., C. Poole and J. Safko, Classical Mechanics, Third Edition. AddisonWesley, Reading, MA, 2001.
Good. Goodman, R.W., Analytic and Entire Vectors for Representations of Lie Groups, Trans.
Amer. Math. Soc., 143, 55-76, 1969.
Got.
Gotay, M.J., On the Groenewold-Van Hove Problem for R2n , J. Math. Phys., Vol. 40,
No. 4, 2107-2116, 1999.
GGT.
Gotay, M.J., H.B. Grundling, and G.M. Tuynman, Obstruction Results in Quantization
Theory, J. Nonlinear Sci., Vol. 6, 469-498, 1996.
GR.
Gradshteyn, I.S. and I.M. Ryzhik, Table of Integrals, Series, and Products, Seventh
Edition, Academic Press, Burlington, MA, 2007.
Gre.
Greenberg, M., Lectures on Algebraic Topology, W.A. Benjamin, New York, NY, 1967.
Gri.
Greiner, W., Relativistic Quantum Mechanics: Wave Equations, 3rd Edition SpringerVerlag, Berlin, 2000.
Groe.
Groenewold, H.J., On the Principles of Elementary Quantum Mechanics, Physica (Amsterdam), 12, 405-460, 1946.
GS1.
Guillemin, V. and S. Sternberg, Symplectic Techniques in Physics, Cambridge University
Press, Cambridge, England, 1984.
GS2.
Guillemin, V. and S. Sternberg, Supersymmetry and Equivariant de Rham Theory,
Springer, New York, NY, 1999.
Gurtin. Gurtin, M.E,, An Introduction to Continuum Mechanics, Academic Press, New York,
NY, 1981.
References
HK.
179
Hafele, J.C. and R.E. Keating, Around-the-World Atomic Clocks: Observed Relativistic
Time Gains, Science 177 (4044), 168-170.
Hall.
Hall, B.C., Lie Groups, Lie Algebras, and Representations: An Elementary Introduction,
Springer, New York, NY, 2003.
Hal1.
Halmos, P.R., Measure Theory, Springer, New York, NY, 1974.
Hal2.
Halmos, P.R., Foundations of Probability Theory, Amer. Math. Monthly, Vol 51, 1954,
493-510.
Hal3.
Halmos, P.R., What Does the Spectral Theorem Say?, Amer. Math. Monthly, Vol 70,
1963, 241-247.
Ham.
Hamilton, R.S., The Inverse Function Theorem of Nash and Moser, Bull. Amer. Math.
Soc., Vol 7, No 1, July, 1982.
Hardy. Hardy, G.H., Divergent Series, Clarendon Press, Oxford, England, 1949.
180
LaLi.
Lang1.
Lang2.
Lang3.
Lang4.
Langm.
Lax.
Lee.
LGM.
LL.
Lop.
Lucr.
Mack1.
Mack2.
MacL.
Mar1.
Mar2.
Mar3.
Mazz1.
Mazz2.
MS.
MR.
Mess1.
Mess2.
MTW.
MP.
Mos.
Nab1.
References
Landau, L.D. and E.M. Lifshitz, The Classical Theory of Fields, Third Revised English
Edition, Pergamon Press, Oxford, England, 1971.
Lang, S., Linear Algebra, Second Edition, Addison-Wesley Publishing Company, Reading, MA, 1971.
Lang,S., S L2 (R), Springer, New York, NY, 1985.
Lang, S., Dierential and Riemannian Manifolds, Springer, New York, NY, 1995.
Lang, S., Introduction to Dierentiable Manifolds, Second Edition. Springer, New York,
NY, 2002.
Langmann, E., Quantum Theory of Fermion Systems: Topics Between Physics and Mathematics, in Proceedings of the Summer School on Geometric Methods for Quantum Field
Theory, edited by H.O.Campo, A.Reyes, and S. Paycha, World Scientific Publishing Co.,
Singapore, 2001.
Lax, P.D., Functional Analysis, Wiley-Interscience, New York, NY, 2002.
Lee, J.M., Introduction to Smooth Manifolds, Springer, New York, NY, 2003.
Liang, J-Q., B-H. Guo, and G. Morandi, Extended Feynman Formula for Harmonic
Oscillator and its Applications, Science in China (Series A), Vol. 34, No. 11, 1346-1353,
1991.
Lieb, E.H. and M. Loss, Analysis, American Mathematical Society, Providence, RI,
1997.
Lopuszanski, J., An Introduction to Symmetry and Supersymmetry in Quantum Field
Theory, World Scientific, Singapore, 1991.
Lucritius, The Nature of Things, Translated with Notes by A.E. Stallings, Penguin
Classics, New York, NY, 2007.
Mackey, G.W., Quantum Mechanics and Hilbert Space, Amer. Math. Monthly, Vol 64,
No 8, 1957, 45-57,
Mackey, G.W., Mathematical Foundations of Quantum Mechanics, Dover Publications,
Mineola, NY, 2004.
Mac Lane, S., Hamiltonian Mechanics and Geometry, Amer. Math. Monthly, Vol. 77,
No. 6, 1970, 570-586.
Marsden, J., Darbouxs Theorem Fails for Weak Symplectic Forms, Proc. Amer. Math.
Soc., Vol. 32, No. 2, 1972, 590-592.
Marsden, J., Applications of Global Analysis in Mathematical Physics, Publish or Perish,
Inc., Houston, TX, 1974.
Marsden, J., Lectures on Geometrical Methods in Mathematical Physics, SIAM,
Philadelphia, PA, 1981.
Mazzucchi, S., Feynman Path Integrals, in Encyclopedia of Mathematical Physics, Vol. 2,
307-313, J-P Francoise, G.L. Naber, and ST Tsou (Editors), Academic Press (Elsevier),
Amsterdam, 2006.
Mazzucchi, S., Mathematical Feynman Path Integrals and Their Applications World
Scientific, Singapore, 2009.
McDu, D. and D. Salamon, Introduction to Symplectic Topology, Oxford University
Press, Oxford, England, 1998.
Mehra, J. and H. Rechenberg, The Historical Development of Quantum Theory, Volume
2, Springer, New York, NY, 1982.
Messiah, A., Quantum Mechanics, Volume I, North Holland, Amsterdam, 1961.
Messiah, A., Quantum Mechanics, Volume II, North Holland, Amsterdam, 1962.
Misner, C.W., K.S. Thorne and J.A. Wheeler, Gravitation, W.H. Freeman and Company,
San Francisco, CA, 1973.
Morters, P. and Y. Peres, Brownian Motion, Cambridge University Press, Cambridge,
England, 2010.
Moser, J., On the Volume Elements on a Manifold, Trans. Amer. Math. Soc., 120, 1965,
286-294.
Naber, G., Spacetime and Singularities: An Introduction, Cambridge University Press,
Cambridge, England, 1988.
References
Nab2.
181
Naber, G., Topology, Geometry and Gauge Fields: Foundations, Second Edition,
Springer, New York, NY, 2011.
Nab3. Naber, G., Topology, Geometry and Gauge Fields: Interactions, Second Edition,
Springer, New York, NY, 2011.
Nab4. Naber, G., The Geometry of Minkowski Spacetime: An Introduction to the Mathematics
of the Special Theory of Relativity, Second Edition, Springer, New York, NY, 2012.
Nab5. Naber, G., Foundations of Quantum Mechanics: An Introduction to the Physical Background and Mathematical Structure, Available at http://gregnaber.com
Nash. Nash, C., Dierential Topology and Quantum Field Theory, Academic Press (Elsevier),
San Diego, CA, 2003.
Nel1.
Nelson, E., Analytic Vectors, Ann. Math., Vol. 70, No. 3, 1959, 572-615.
Nel2.
Nelson, E., Feynman Integrals and the Schroedinger Equation, J. Math. Phys., 5, 1964,
332-343.
Nel3.
Nelson, E., Dynamical Theories of Brownian Motion, Second Edition, Princeton University Press, Princeton, NJ, 2001. Available online at https://web.math.princeton.edu/ nelson/books/bmotion.pdf
ODV. ODonnell, K. and M. Visser, Elementary Analysis of the Special Relativistic Combination of Velocities, Wigner Rotation, and Thomas Precession, arXiv:1102.2001v2 [gr-qc]
11 June 2011.
Olv.
Olver, P.J., Applications of Lie Groups to Dierential Equations, Second Edition,
Springer, New York, NY, 2000.
ON.
ONeill, B., Semi-Riemannian Geometry with Applications to Relativity, Academic Press,
San Diego, CA, 1983.
Oz.
Ozawa, M., Universally Valid Reformulation of the Heisenberg Uncertainty Principle on
Noise and Disturbance in Measurement, Phys. Rev. A., Vol 67, Issue 4, 042105(2003),
1-6, http://arxiv.org/pdf/quant-ph/0207121v1.pdf.
Pais.
Pais, A., Subtle is the Lord ... The Science and the Life of Albert Einstein, Oxford
University Press, Oxford, England, 1982.
PM.
Park, J.L. and H. Margenau, Simultaneous Measurability in Quantum Theory, International Journal of Theoretical Physics, Vol. 1, No. 3, 1968, 211-283.
Pater.
Paternain, G.P., Geodesic Flows, Birkhauser, Boston, MA, 1999.
Pazy.
Pazy, A., Semigroups of Linear Operators and Applications to Partial Dierential Equations, Springer, NY, New York, 1983.
Per.
Perrin, J., Les Atomes Flammarion, Paris, New Edition, 1991.
Planck. Planck, M., Zur Theorie des Gesetzes der Energieverteilung im Normalspektrum, Verhandlungen der Deutschen Physikalischen Gesellschaft, 2, 1900, 237-245. English translation available in [terH].
Prug.
Prugovecki, E., Quantum Mechanics in Hilbert Space, Academic Press, Inc., Orlando,
FL, 1971.
Ratc.
Ratclie, J.G., Foundations of Hyperbolic Manifolds, Second Edition, Springer, New
York, NY, 2006.
RS1.
Reed, M. and B. Simon, Methods of Modern Mathematical Physics I: Functional Analysis, Academic Press, Inc., Orlando, FL, 1980.
RS2.
Reed, M. and B. Simon, Methods of Modern Mathematical Physics II: Fourier Analysis,
Self-Adjointness, Academic Press, Inc., Orlando, FL, 1975.
RiSz.N. Riesz, F. and B. Sz.-Nagy, Functional Analysis Dover Publications, Inc., Mineola, NY,
1990.
Rob.
Robertson, H.P., A General Formulation of the Uncertainty Principle and its Classical
Interpretation, Phys. Rev. 34(1929), 163-164.
Ros.
Rosay, J-P., A Very Elementary Proof of the Malgrange-Ehrenpreis Theorem, Amer.
Math. Monthly, Vol. 98, No. 6, Jun.-Jul., 1991, 518-523.
Rosen,S. Rosenberg, S., The Laplacian on a Riemannian Manifold, Cambridge University Press,
Cambridge, England, 1997.
Rosen,J. Rosenberg, J., A Selective Hisory of the Stone-von Neumann Theorem, Contemporary
Mathematics, 365, Amer. Math. Soc., Providence, RI, 2004, 331-353.
182
Roy.
Roz.
Rud1.
Rud2.
Rud3.
Ryd.
Sak.
Schl.
Schm1.
Schm2.
Schm3.
Schro1.
Schro2.
Schul.
Segal1.
Segal2.
SDBS.
Simm1.
Simm2.
Simms.
Simon1.
Simon2.
Smir.
Smith.
Sp1.
Sp2.
Sp3.
Srin.
Str.
SW.
References
Royden, H.L., Real Analysis, Second Edition, Macmillan Co., New York, NY, 1968.
Rozema, L.A., A. Darabi, D.H. Mahler, A. Hayat, Y. Soudagar, and A.M. Steinberg, Violation of Heisenbergs Measurement-Disturbance Relationship by Weak Measurements,
Physical Review Letters, 109, 100404(2012), 1-5.
Rudin, W., Fourier Analysis on Groups, Interscience Publishers, New York, NY, 1962.
Rudin, W., Functional Analysis, McGraw-Hill, New York, NY, 1973.
Rudin, W., Real and Complex Analysis, Third Edition, McGraw-Hill Book Company,
New York, 1987.
Ryder, L.H., Quantum Field Theory, Second Edition, Cambridge University Press,
Cambridge, England, 2005.
Sakuri, J.J. and J. Napolitano, Modern Quantum Mechanics, Second Edition, AddisonWesley, Boston, MA, 2011.
Schlosshauer, M., Decoherence, the Measurement Problem, and Interpretations of Quantum Mechanics, Rev. Mod. Phys., 76(4), 2005, 1267-1305.
Schmudgen, K., On the Heisenberg Commutation Relation I, J. Funct. Analysis, 50,
1983, 8-49.
Schmudgen, K., On the Heisenberg Commutation Relation II, Publ. RIMS, Kyoto Univ,
19, 1983, 601-671.
Schmudgen, K., Unbounded Self-Adjoint Operators on Hilbert Space, Springer, New
York, NY, 2012.
Schrodinger, E., Quantisierung als Eigenwertproblem, Annalen der Physik, 4, Volume
79, 1926, 273-376. English translation available in [Schro2].
Schrodinger, E., Collected Papers on Wave Mechanics, Third Edition, AMS Chelsea
Publishing, American Mathematical Society, Providence, RI, 1982.
Schulman, L.S., Techniques and Applications of Path Integration, Dover Publications,
Inc., Mineola, NY, 2005.
Segal, I.E., Postulates for General Quantum Mechanics, Annals of Mathematics, Second
Series, Vol. 48, No. 4, 1947, 930-948.
Segal, I.E., A Class of Operator Algebras Which Are Determined By Groups, Duke
Math. J., 18, 1951, 221-265.
Sen, D., S.K. Das, A.N. Basu, and S. Sengupta, Significance of Ehrenfests Theorem in
Quantum-Classical Relationship, Current Science, Vol. 80, No. 4, 2001, 536-541.
Simmons, G.F., Topology and Modern Analysis, McGraw-Hill, New York, NY, 1963.
Simmons, G.F., Dierential Equations with Applications and Historical Notes, McGrawHill, New York, NY, 1972.
Simms, D.J., Lie Groups and Quantum Mechanics, Lectures Notes in Mathematics 52,
Springer, New York, NY, 1968.
Simon, B., The Theory of Semi-Analytic Vectors: A New Proof of a Theorem of Masson
and McClary, Indiana Univ. Math. J., 20, 1971, 1145-1151.
Simon, B., Functional Integration and Quantum Physics, Academic Press, New York,
NY, 1979.
Smirnov, V.A., Feynman Integral Calculus, Springer, New York, NY, 2006.
Smith, M.F., The Pontrjagin Duality Theorem in Linear Spaces, Ann. of Math. 2, 56,
1952, 248-253.
Spivak, M., Calculus on Manifolds, W.A. Benjamin, New York, NY, 1965.
Spivak, M., A Comprehensive Introduction to Dierential Geometry, Vol. I-V, Third
Edition, Publish or Perish, Houston, TX, 1999.
Spivak, M., Physics for Mathematicians, Mechanics I, Publish or Perish, Houston, TX,
2010.
Srinivas, M.D., Collapse Postulate for Observables with Continuous Spectra, Commun.
Math. Phys. 71 (2), 131-158.
Strange, P., Relativistic Quantum Mechanics with Applications in Condensed Matter
Physics and Atomic Physics, Cambridge University Press, Cambridge, England, 1998.
Streater, R.F. and A.S. Wightman, PCT, Spin and Statistics, and All That, W.A. Benjamin,
New York, NY, 1964.
References
Synge.
Szab.
Sz.
Takh.
TaylA.
TaylM.
TW.
terH.
t Ho1.
t Ho2.
Trau.
TAE.
Tych.
VDB.
VDW.
VH.
Vara.
Vog.
v.Neu.
Wald.
Warn.
Water.
Wat.
Weinb.
Weins.
Wien.
Wight.
Wig1.
183
Synge, J.L., Relativity: The Special Theory, North-Holland Publishing Company, Amsterdam, The Netherlands, 1956.
Szabo, R.J., Equivariant Cohomology and Localization of Path Integrals, Springer, New
York, NY, 2000. Also see http://arxiv.org/abs/hep-th/9608068.
Szego, G., Orthogonal Polynomials, American Mathematical Society, Providence, RI,
1939.
Takhtajan, L.A., Quantum Mechanics for Mathematicians, American Mathematical Society, Providence, RI, 2008.
Taylor, A.E., Introduction to Functional Analysis, John Wiley and Sons, Inc., London,
England, 1967.
Taylor, M.E., Partial Dierential Equations I, Second Edition, Springer, New York, NY,
2011.
Taylor, E.F. and J.A. Wheeler, Spacetime Physics, W.H. Freeman, San Francisco, CA,
1963.
ter Haar, D., The Old Quantum Theory, Pergamon Press, Oxford, England, 1967.
t Hooft, G., Gauge Theories of the Forces between Elementary Particles, Scientific
American, 242, 1980, 90-116.
t Hooft, G., How a Wave Function can Collapse Without Violating Schrodingers Equation, and How to Understand Borns Rule, arXiv:1205.4107v2 [quant-ph] 21 May 2012.
Trautman, A., Noether Equations and Conservation Laws, Commun. Math. Phys. 6,
248-261, 1967.
Twareque Ali, S. and M. Englis, Quantization Methods: A Guide for Physicists and
Analysts, http://arxiv.org/abs/math-ph/0405065
Tychono, A., Theor`emes dUnicite pour lEquation de la Chaleur, Mat. Sb., Vol 42, No
2, 1935, 199-216.
van den Ban, E., Representation Theory and Applications in Classical Quantum Mechanics, Lectures for the MRI Spring School, Utrecht, June, 2004, v4: 20/6. Available at
http://www.sta.science.uu.nl/ ban00101/lecnotes/repq.pdf
van der Waerden, B.L., Ed., Sources of Quantum Mechanics, Dover Publications, Mineola, NY, 2007.
Van Hove, L., Sur Certaines Representations Unitaires dun Groupe Infini de Transformations, Proc. R. Acad. Sci. Belgium, 26, 1-102, 1951.
Varadarajan, V.S., Geometry of Quantum Theory, Second Edition, Springer, New York,
NY, 2007.
Vogan, D.A., Review of Lectures on the Orbit Method, by A.A. Kirillov, Bull. Amer.
Math. Soc., Vol.42, No. 4, 1997, 535-544.
von Neumann, J., Mathematical Foundations of Quantum Mechanics, Princeton University Press, Princeton, NJ, 1983.
Wald, R.M., General Relativity, University of Chicago Press, Chicago, IL, 1984.
Warner, F., Foundations of Dierentiable Manifolds and Lie Groups, Springer, New
York, NY, 1983.
Waterhouse, W.C., Dual Groups of Vector Spaces, Pacific Journal of Math., Volume 26,
No. 1, 1968, 193-196.
Watson, G., Notes on Generating Functions of Polynomials, Hermite Polynomials, J.
Lond. Math. Soc., 8, 1933, 194-199.
Weinberg, S., Dreams of a Final Theory: The Scientists Search for the Ultimate Laws of
Nature, Vintage Books, New York, NY, 1993.
Weinstein, A., Symplectic Structures on Banach Manifolds, Bull. Amer. Math. Soc., Vol.
75, No. 5, 1969, 1040-1041.
Wiener, N., Dierential Space, J. Math. Phys., 2, 1923, 131-174.
Wightman, A.S, How it was Learned that Quantized Fields are Operator-Valued Distributions, Fortchr. Phys. 44 (2), 1996, 143-178.
Wigner, E.P., On Unitary Representations of the Inhomogeneous Lorentz Group, Annals
of Mathematics 40 (1), 1939, 149-204.
184
Wig2.
Wig3.
Wilcz.
Witt.
Yos.
Z.
References
Wigner, E.P., Group Theory and its Application to the Quantum Mechanics of Atomic
Spectra, Academic Press, New York, NY, 1959.
Wigner, E.P., Phenomenological Distinction Between Unitary and Antiunitary Symmetry
Operators, J. Math. Phys., Volume 1, Issue 5 (1960), 414-416.
Wilczek, F., Origins of Mass, arXiv:1206.7114v2 [hep-ph] 22 Aug 2012, 1-35.
Witten, E., Supersymmetry and Morse Theory, J. Di. Geo., 17 (1982), 661-692.
Yosida, K., Functional Analysis, Springer, New York, NY, 1980.
Zeeman, E.C., Causality Implies the Lorentz Group, J. Math. Phys. 5, 1964, 490-493.
Index
([v], [w])P(H) , 26
Ad, 11
Adg , 11
Aut(G), 1
Aut(P(H)), 26
Aut(g), 11
H2 , 59
H0 , 40
H x0 , 10
IndGH (), 32
L(), 56
LO0 , , 40
N ! H, 33
P H, 30
R = (R ),=0,1,2,3 , 53
S 1, 2
[ , ]C , 5
P2 , 88
W2 , 89
= ( ),=0,1,2,3 , 47, 52
A , 60
i (), 54
L+ , 4, 47, 53
P+ , 47, 58
x1 , x2 , x3 ), . . ., 44
(x1 , x2 , x3 ), (
AC , 5
C(D( j/2) ), 23
CN (x0 ), CN (x0 ), 50
CT (x0 ), CT (x0 ), 50
D( j/2) , 23
E(x0 ), 51
H , 32
M, 43, 48
. . ., 44
O, O,
O0 , 40
O x0 , 10
x0 , x1 , x2 , x3 ), . . ., 46
S(x0 , x1 , x2 , x3 ), S(
T, T , 50
U(H), 14
U(g), 86
U (H), 27
, , 49
C , 2
P(H), 24
R , 1
R1,3 , 51
= ( ),=0,1,2,3 , 47, 49
1 = ( ),=0,1,2,3 , 49
38
N,
: SL(2, C) L+ , 61, 63
iso(3), 36
p, 72
T
A ,3
P , 24
g , 10
j , j = 1, , 2, 3, 18
(v), 55
{e0 , e1 , e2 , e3 }, 48
ad, 11
adX , 11
1-parameter group of dieomorphisms, 130
1-particle space, 115
4-acceleration, 57
4-velocity, 57
abstract Schrodinger equation, 156
action functional, 124
adjoint action, 12
adjoint representation of U(g), 87
adjoint representation of g, 12
admissible basis, 51
admissible observer, 44
algebra of classical observables, 135, 140
angular momentum
185
186
classical, 131
intrinsic, 168
orbital, 164
rotational, 164
total, 164
angular momentum operator
orbital, 120
spin, 121
total, 121
angular momentum operators
orbital, 163
anti-unitary map, 26
anticommutation relations, 19
associated Legendre functions, 80
associative algebra, 8
associative algebra
commutator bracket, 9
commutator Lie algebra structure, 9
ideal, 8
ideal generated by S , 9
subalgebra, 8
unit, 8
unital, 8
automorphism group of P(H), 26, 155
automorphism group of a Lie group, 1
is a Lie group, 37
automorphism of P(H), 26, 155
automorphism of Lie groups, 1
Bargmanns Theorem, 105
base space, 29
boost, 54
Born-von Neumann formula, 151
boson, 168
bracket, 4
bundle space, 29
canonical 1-form, 133
canonical commutation relations
for classical mechanics, 141
for quantum mechanics, 159
canonical coordinates, 133, 140
canonical map, 142
canonical symplectic form, 133
Cartans magic formula, 136
Casimir invariants, x
Casimir invariants of p, 90
causal automorphism, 46
causality assumption, 46
Change of Variables Formula, 100
character, 38
character group, 38
of Rk , 39
classical canonical commutation relations, 141
Index
classical harmonic oscillator Hamiltonian, 135
classical observable, 135, 140
Closed Subgroup Theorem, 2
commutation relations, 19
iso(3), 37
commutator bracket, 9
commutator Lie algebra structure, 9
configuration space, 123, 133
conjugate momentum, 125
conjugation representation, 18
conservation law
classical mechanics, 126
conservation of energy, 137, 142
conserved quantity, 125, 137
and the Poisson bracket, 142
angular momentum, 131
energy, 142
contragredient action, 93, 103
covector, 133
covering group, 13
covering map, 13, 63
covering space, 13
critical curve, 125
critical point, 125
Darboux Theorem
finite dimensional, 139
linear, 139
derivation, 11
dispersion, 151
double cover
universal, 63
dual group, 38
duration of a timelike vector, 55
Einstein energy-momentum relation, 95
Einstein summation convention, xi
elsewhere, 51
energy, 153
equivalent projective representations, 28
Euler-Lagrange equations, 125, 132
and Newtons Second Law, 128
event, 43, 49
evolution operator, 153
expectation value, 151
expected value, 151
exponential map, 9
fermion, 168
fiber, 29
Fizeau procedure, 45
flow of a vector field, 130
frame of reference, 46
free group action, 10
Index
free particle, 128
general linear group, 2
generators of a Lie algebra, 36
generators of boosts, 66
generators of rotations, 37, 66
generators of translations, 37
GL(n, C), 2
GL(n, R), 2
global section
of a principal H-bundle, 30
graviton, 169
group action, 10
group representation
induced, 29
semi-direct products, 33
Haar measure, 1
Hamiltons equations, 134, 136, 142
Hamiltonian, 133, 140
classical harmonic oscillator, 135
on L2 (Rn ), 156
quantum, 153
Hamiltonian flow, 134
Hamiltonian Noether Theorem, 144
Hamiltonian system, 140
Hamiltonian vector field, 133, 136, 140, 142
harmonic oscillator
classical Hamiltonian, 135
Heisenberg equation, 158
Heisenberg picture, 158
Heisenberg Uncertainty Principle, 152
Hermitian inner product, 3
Higgs boson, 169
Hilbert bundle, 30, 31
section, 31
homogeneous manifold, 12
homogeneous space, 12
ideal, 8
ideal generated by S , 9
indefinite inner product, 4
induced representation, 29, 32, 42
infinitesimal symmetry, 126, 143
inhomogeneous rotation group, 35
inhomogeneous SL(2, C), 64, 103
internal semi-direct product, 34
intertwine, 15
intrinsic angular momentum, 168, 172
invariant under rotation, 129
irreducible representation, 15
ISO(3), 35
isolated quantum system, 153
isomorphic Lie algebras, 5
187
isomorphism of Lie groups, 1
isomorphism of projectivizations, 26
isotropy subgroup, 10
Jacobi Identity, 136, 141
Jacobi identity, 4
kinetic energy operator, 160
Lagrangian, 124
symmetry of, 126
left action, 10
left invariant vector field, 5
Leibniz Rule, 136, 142
Leibniz rule, 11
Lie algebra, 4
complex, 4
complexification, 5
exponential map on, 9
isomorphism, 5
of a Lie group, 5
of GL(n, C), 7
of GL(n, R), 7
of O(n), 7
of O(p, q), 8
of SO(n), 7
of SO(p, q), 8
of SU(n), 8
of the Lorentz group, 65
of U(n), 8
Lie algebra automorphism, 5
Lie algebra cohomology, 28
Lie algebra homomorphism, 5
Lie algebra isomorphism, 5
Lie algebra of operators, 82
Lie algebra representation, 5
Lie group, 1
closed subgroups of, 2
continuous homomorphisms of, 2
isomorphism, 1
representation, 1
Lie groups, 1
Lie subalgebra, 5
lift of a projective representation, 28
light travel time, 45
Linear Darboux Theorem, 139
little group, 40
local section
of a principal H-bundle, 30
locally isomorphic Lie groups, 7
Lorentz group
general, 4
proper, 4
proper, orthochronous, 4
188
Lorentz inner product, 49
Lorentz transformation
general, 52
orthochronous, 53
proper, 52
special, 55
Mackey machine, ix, 37, 41
and ISL(2, C), 107
Mackeys Theorem, 41
magnetic dipole, 164
magnetic moment
orbital, 164
rotational, 165
total, 165
mass hyperboloid, 93
mass operator, 118
matrix Lie group, 3
matrix mechanics, 148
measure
pushforward, 102
measurement, 147, 153
Minkowski inner product, 49
Minkowski spacetime, 48
moment map, 144
momentum, 128
conjugate, 125
momentum map, 144
momentum operator, 162
momentum space, x, 92
natural coordinates on T M, 123
Noethers Theorem, 127, 144
null cone, 50
future, 50
past, 50
null curve, 56
future directed, 56
past directed, 56
null vector, 49
O(n), 3
O(p, q), 3
observable
classical, 135
quantum, 149
orbit, 10
orbit space, 30
orbital angular momentum, 164
orbital magnetic moment, 164
orthogonal group, 3
orthogonal transformation of M, 52
orthogonal vectors in M, 50
orthonormal basis for M, 49
Index
past directed, 50
path space, 124
Pauli spin matrices, 18
Pauli-Lubanski vector, 88
phase space
in classical mechanics, 133
in Hamiltonian mechanics, 137
photon, 169
Poincare algebra, 72
Poincare group, 58
Poincare-Birkho-Witt Theorem, 86
Poisson algebra, 136
Poisson bracket, 136, 141
Poisson commutes, 137
position operator, 161
positive energy, 94
Postulates of Quantum Mechanics
QM1, 148
QM2, 149
QM3, 150
QM4, 153
potential, 156
spherically symmetric, 128
potential energy operator, 161
principal H-bundle
trivial, 29
principal bundle, 29, 30
principal H-bundle, 29
Principle of Least Action, 125
Principle of Stationary Action, 125
probability amplitude, 149
projection, 29
of a principal bundle, 29
projective representation, ix, 24, 28
lift, 28
projectivization of H, 149
projectivization of a Hilbert space, 24
proper time function, 57
proper time length, 56
proper time separation, 55
pushforward measure, 102
quantization, 161
quantum bracket, 159
quantum canonical commutation relations, 159
quantum field, ix
quantum symmetry group, 155
quantum system, 147
state space, 149
symmetry of, 26, 154
quasi-invariant measure, 32
ray in a Hilbert space, 24
realization of a Lie algebra, 84
Index
reducible representation, 15
regular semi-direct product, 40
relatiivistic invariance, x
relativistic angular momentum, 77
relativistic scalar field, 116
Relativity Principle, 48
representation
and left action, 14
carriers, 14
dimension, 14
induced, 32, 42
invariant subspace, 14, 15
irreducible, 15
of a group, 14
reducible, 15
spinor, 23
trivial, 21
Wigner, 116
rest frame, 115
Reversed Schwartz Inequality, 55
Reversed Triangle Inequality, 56
right action, 10
Robertson Uncertainty Relation, 152
Rodrigues formula, 80
rotation group, 3, 129
exponential map on, 129
Lie algebra of, 129
rotation subgroup of L+ , 54
rotational angular momentum, 164
rotational magnetic moment, 165
scalar field
relativistic, 116
scalar field on Xm+ , 116
Schrodinger equation, 148
abstract version, 156
physicists version, 156
Schrodinger picture, 158
Schurs Lemma, 15
for associative algebras over C, 91
infinite-dimensional version, 15
section
global, 30
local, 30
of a Hilbert bundle, 31
semi-direct product, 33, 35
internal, 34
regular, 40
representations, 33
semi-direct product of Lie groups, 37
semi-orthogonal group, 3
signature of Minkowski spacetime, xi
SL(2, C)
conjugation representation, 20
189
standard representation, 20
SL(n, C), 3
SL(n, R), 3
SO(3)
geometrical description, 10
SO(n), 3
SO(p, q), 4
space of an admissible observer, 51
spacelike curve, 56
spacelike vector, 49
special linear group, 3
special Lorentz transformation, 55
hyperbolic form, 55
special orthogonal group, 3
special semi-orthogonal group, 4
special unitary group, 3
special unitary group SU(2), 17
spherical harmonics, 80
spherically symmetric potential, 128
spin, 172
of an SU(2) representation, 24
spin 12 , 168
spin angular momentum operator, 121
spin components, 172
spin model of R3 , 19
spin operators, 172
spin quantum number, 168
spin vector, 169
spinor
contravariant, 23
rank m, 23
SL(2, C), 24
SU(2), 23
spinor field on Xm+ , 116
spinor map, 63
spinor representation
SU(2), 23
state
classical mechanical system, 123, 133
in quantum mechanics, 148
state space, 123
in quantum mechanics, 149
stationary curve, 125
stationary point, 125
Stern-Gerlach experiment, 165
structure constants, 7
structure group, 29
SU(2)
defining representation, 17
representation
spin, 24
spinor representation, 23
standard representation, 17
SU(n), 3
190
SU(2), 17
irreducible representations, 17
subalgebra, 8
surface measure, 100
symmetry, 126
quantum system, 26, 154
symmetry group, 143
of a Lagrangian, 127
physical interpretation, 84
quantum, 155
symplectic dieomorphism, 142
symplectic form, 137
canonical, 133
symplectic gradient, 136, 140
symplectic manifold, 137
compact, 138
is even dimensional, 138
is orientable, 138
symplectic vector field, 142
symplectomorphism, 142
time axis, 51
time cone, 50
future, 50
past, 50
time orientation, 50, 51
timelike curve, 56
future directed, 56
past directed, 56
timelike vector, 49
timelike worldline, 56
total angular momentum, 164
total energy
classical mechanics, 133
in quantum mechanics, 153
symplectic geometry, 140
total magnetic moment, 165
total orbital angular momentum, 78
Index
total relativistic energy, 94
total space, 29
transition amplitude, 153
transition probability, 26, 153
transitive group action, 10
trivial principal H-bundle, 29
trivial representation, 14, 21
twin paradox, 56
U(n), 3
uncertainty relations, 152
Robertson, 152
unit ray, 25, 149
unitarily equivalent representations, 15
unitary group, 3
unitary group representation, 14
unitary representation, 14
strongly continuous, 14
weakly continuous, 14
universal covering group, 63
universal double cover, 63
universal enveloping algebra, x, 85
basis, 87
variance, 151
wave function, 148
wave mechanics, 148
weight of spinor representation, 23
Wigner representation, 116
Wigner representation of mass m and spin j/2,
116
Wigner rotation, 67
Wigner symmetry, 26, 154
Wigners Theorem on Symmetries, 26, 154
worldline, 43
worldline of a material particle, 56