Académique Documents
Professionnel Documents
Culture Documents
(1918-1979)
Acquisitions and
Bibliographk Secvices
Acquisitiorrs et
servce~biMMiographiq~
Abstract
In this work, we trace the role of symmetry throughout the history of theoretical
particle physics, paying particular attention to the role o f group theory, the fornial
mathematics of syrnmetry. Mer a aridysis of the role of conservation laws and invariance
in the theory of general relativity, we move on to Weyl's gauge theory of 1918, which was
gravitation and
electromagnetism. Weyl was trying to exploit an invariance of scale, and although his theory
was experimentally refuted, it provided a formulation of the conservarion of charge. M e r the
Weyl and Wigner studied group theory in the context of quantum mechanics, but the
broadness of its application had yet to be appreciated- Symmetry was soon exploited in the
nuclear interactions, however, and we examine the events Ieading to the discovery of SU(2) of
isotopic spin. We anaiyze how the discovery of strangeness was linked to the generalization
of SU(2) to SU(3), and also how it led to a differentiation between the strong interactions,
which conserve isotopic spin and strangeness, and the weak interactions, which violate these
Yang and Mills were impressed with gauge invarance, and in 1954, they took the bold
step of irnposing it upon the Lagrangian of the strong interactions, forcing the introduction of
three new gauge fields. There was a problem, however, because although the short-range of
the strong interactions implied that these gauge bosons should be massive, they needed to be
massless in order to preserve gauge invariance. In addition, efforts were made to extend
Yang-Mills theory to the weak interactions, but they also faced the same zero-mass problem.
This problem was fmdly solved in 1967. when Weinberg and SaIam showed how gauge
boson masses could be generated using spontaneous symmetry breaking. They based a
unification of the electromagnetic and weak interactions upon local gauge invariance, and this
principle was soon appIied to the strong interactions as weli.
Acknowledgements
Fust and foremost, 1 wouId [ike to thank m y supervisor, Brian Baigrie- Brian was
always there to give me a d v k and help whenever 1needed t, whiIe always giving me the
freedom and space which 1 needed just a s rnuch to grow and develop as a scholar. Thanks
for always being in my corner, Brian. Thank you to Sungook Hongy who was particularly
helppful during the beginning stages of my research, when my ideas were taking shape.
Special thanks to J m Brown, for many helpfl discussions-1 got the idea for the dissertation
during a t a k with Jim. I dso appreciate tbat he welcomed me into hs monthiy meetings with
his philosophy graduate -dents.
Thank you to a l l the professors at the Institute for the History and Philosophy of
Science and Technology. Thank you to Trevor Levere for welcoming me into the Institute,
Craig Fraser and Brian Baigrie for teaching me my &a history of science classes, Bert Hail
for teaching me the ropes of the lbrary system, and Polly Winsor for being a source of
inspiration. Polly deserves special mention as she is truly one of most dedicated professors I
have ever had. These are the professors who taught me what history of science is al1 abouf
and 1am grateful for it.
1have benefitted fiom discussions with Jed Buchwdd, Jay Foster, Peter Galison, Yves
Gingras, Ian Hacking, Envin Hiebert, Ed Jurkowitz, Margaret Momson, David Pantalony,
Silvan S. Schweber, Alan Shapiro, and especiafly C a t h e ~ eWestfall. Thanks to Catherine for
going out of her way to give me help with my written work.
Thank you to the Connaught Fundyand to the Social Science and Humanities Researcb
Council for financial support. Thanks to my parents, who never doubted that 1would
succeed, and thus made that success possible. Thank you also to Muna and Zag, for their
help above and beyond the cal1 of duty. Lasf but not least, thanks to Dierdre, and to the rest
of my friends and family that helped to make my He easier, by believing in me.
Table of Contents
I.
Introduction
II.
5. Concluding rem&
3. Concluding remarks
W.
11.
m..
111.
iv.
3. Concluding remarks
V.
.-
ii.
3. ConcIuding remarks
VI. Conservation Laws and Nuclear Interactions (1930-58)
1. The continuous spectrurn ofp-decay
1.
..
ir.
.-
11.
*.-
ru.
iv.
v.
3 . Concluding rernarks
VIII.
1,
a
.
11.
8. Concluding rernarks
-
IX.
Conclusions
References
1- introduction
In this dissertation, we shail investigate the changing role of symmetry throughout the
development of high energy physics, nom its beginnings in the early 20th century, up until
the widespread acceptance of the Standard Model of particle physics in 1979. There are two
main issues to be explored:
1.
How did symmetry corne to gain its preseni status in the arena of pYticle physics?
2.
How did the widespread acceptance of group theory, the formal mathematics of
symmetry, affect the developing physics?
A symmetry is any transformation that can be applied to a system which ieaves it in a
configuration indistinguishable fiom the original. For a given physicai system, the
s y m m e ~ e sinherent in the equations of motion, and therefore in its associated Lagrangian,
prove to be quite useful. For instance, rotationai symmetry implies that if we rotate a
planetary system through any angle we choose, the resultant orbit is also a possible orbit
Symmetries and conservation laws are intimately related. In fact, Emmy Noether
proved in 1918 that every continuous symmetry of Nature yields a conserved quantity, and
that every consewation law can be linked to an underlying syrnmetry. For instance, the laws
of physics are symmetrical with respect to translations in t h e : they are supposed to work the
same way today as they did yesterday. This invariance with respect to tirne is linked to
then momentum is conserved, while if it is symmetrical under rotations about a point, then
angula. momentum is conserved.
In the history of partide physics, there has been a shift in the primacy of conservation
laws and symmey. From the days of the old quantum theory, through to the 1920s and the
deveioprnent of quantum mechanics, conservation laws of space-tirne were imposed upon the
emerging physics. Early particle physicists applied these empincal laws as definitions in an
effort to make sense of their discipline, and also to esiablish what was new in the interactions
of elementary particles. Since the conserved quantities implied symnietries in the Lagrangian,
theonsts were able to guess at what the form of the Lagrmgian might be.
Today, symmetry is the foundation upon which the Standard Mode1 of particle physics
is built Each of the three forces: the strong, the electromagnetic, and the weak interaction,
is modelled upon a symmetry group. These symmetry groups not only determine the
und&ing
conservation hm--they also deterrnine what kinds of actual physical States are
possible. Theorists often propose new symmetry groups, which are then imposed upon the
Lagrangia.. This allows theorists to work out what kind of new particles should exist, and
what kinds of decays and interactions should be possibIe. In fact, certain choices of unifying
symmetry groups have led to suggestions that a well-estabiished conservation Iaw might be
violated. Experimentalists, in turn, are constantly on the lookout for evidence of these
symmetry. There is a degree of fieedom in symmetry choice, a certain latitude within which
ali associated physically meamrable quantities remain fixed. For instance, the invariance of
electrodynamics under phase transformations, which incidentaIly irnplies that the phase of
electromagnebc fields is immeasurable, Ieaves theorists fiee to choose the phase arbitrady as
long as the physical equations can be kept invariant. This is achieved with a bit of
mathematical artince known as the requirement of local gauge invariance. 'The Lagrangian,
and the equations of motion, are forced to remain invariant by the introduction of new fieIds,
called gauge fields, which conrol the shifting phases. These gauge fields can then be
interpreted as red, physid particles-
Neither gauge invariance nor quantum field theory themselves are recent
developments. Gauge invariance was introduced by Hermann Weyl in 1918 in the context of
his proposed unified field theory of gravitation and electromagnetism, well before the omet of
quantum rnechanics. Weyl was trying to gene&ze Einstein's theory of gravitation to include
both gravitation and electromagnetism, by extending the Iocality of space-time to a different
kuid of infitesimal geometry, in which it would be possible to choose the scale of
measurement at each point in space at wiII. His plan was to relate electromagnetism to this
new kind of invariance, just as Einstein had comected gravitation with
invariance under
general continuous transformations of space-time coordinates. Weyl's point was that standard
Riemamian geometry was slightly defective, because, although it appeared to be infinitesimal,
it contained a residue of rigid Euclidean geometry since it maintained that dthough the
directions of vectors are path-independent with respect to parallel transfer, their magnitudes
are not. This is why Weyl's theory was narned gauge theory. The term gauge was
appropriate, since the rescaling of the metric tensor due to the path factor could be interpreted
as a change of length measureement, and the word gauge was in common use, at the tirne, for
the different measurements of length, for example, in the width of railway tracks.
This new theory was very appealiug, in that Weyl was able to denve a fllndarnental
conservation law, the conservation of charge, directly fkom his proposed invariance of scale.
There was no denying that mathematicdly, the theory was beautifi, and it was very inspiring
to mathematicaily oriented physicists Like Theodor Kaluza and Vladimir Fock who were
searching for their own geometrcai unified field theories. Unfortunately, the direct
application of Weyl's theory to gravitation ainied out to be unacceptable. Einstein soon
pointed out that there was a problem, aamely, that the lengths of meaninng rods and the tirne
upon their histories. Thus, although Weyl's theory was elegant, its direct application to
gravitational theory turned out to be unacceptable. A decade later, however, &er the birth of
quantum mechanics, gauge invariance was to be totally reinterpreted and its mathematical
mechmics relied upon new mathematicai methods. In 1926, Eugene Wigner forged a link
between quantum mechanics and group theory, the formal mathematics of symmetry, when he
recognized that the theory of transformation groups could be used to solve problems involving
identical particles. Group theory was not at a l l new. In fact, it was a well-entrenched
mathematical theory which had flourished under Felix Klein and Sophus Lie in the late 19th
century, but was not at ali widely used withn physics until the 1920~~
whm it was realized
that group theory provided powerfiil methods which muid be used to treat the developing
mathematics of quantized spin and angular momenhim In 1927, Wolfgang Pauli introduced
three spin matrices which operated on a two-component mode1 of the electron. Wigner soon
recognized that the three Pauli spin matrices codd be identined as the three generators of the
two-dimensional representation of the group SU(2). Independently, this was aiso recognized
by Weyl, who was also very familiar with group theory,
In the wake of quantum mechanics, Fritz London was very impressed with Weyl's
theory, as well as with Weyl's conviction that it was correct in the face of conflicting
experimental evidence. The flaw with Weyl's original theory, according to London in 1927,
lay not in the identification of its non-integrable factor with electromagnetism, but in its
application as a gravitational scale factor. London believed that gauge invariance contained a
rnuch wider range of possibiIities th& Weyl had fim envisioned, including nothng l e s than
a logicd path to wave mechanics. The key to the reformulation o f gauge theory was the
reinterpretation of the Weyl scale factor- London recommended that the non-integrable scale
factor of the metric be reinterpreted as a phase factor of the wave-bction. Weyl accepted
this proposal with great enthusiasm, and in fact, he soon went beyond it. Just two years later,
he published his now famous paper of 1929, in which he suggested that gauge invariance be
used as aprinciple fiom which the electromagnetic interactons could be derived. This phase
invariance eventuaily tumed out to be a powemil tool in the development of quantum field
theory, but not until a few dccades later. This is because, again, although the theory was
mathematically sophistcated, its physical interpretation was problematic. Gauge theory was
stili regarded, in the main, as a mathematical cuiosity, because although it attempted a
In the early 1930s, the study of the nucleus revealed new interactions of short-rangeSince 1913, when Neils Bohr had argued that 6-decay was a nuclear process, there had been
This problem was so troubling, it even threw the conservation of energy into doubt! Pauli
saved the conservation law in 1930 by postulathg an undetected particle, now known as the
neutrino, which could cary off the missing energy. In 1932, James Chadwick discovered the
neutron, but it was stiU far fiom obvious how to explain how eiectrons were emitted fkom the
nucleus. EMCOFermi solved this problem in Iate 1933, by offerhg a field-theoretid
solution to the problem of @-decay. Electrons were not already present in the nucleus, but
uistead, they were created in the process of emission, dong with the associated neutrinos.
A year earlier, in 1932, Werner Heisenberg had discovered a symmetry within the
electrical forces oniy. In a way, Heisenberg considered the interaction between the proton and
neutron to be like an exchange back and forth of a spi1ess electron. His theory explained the
stability of nuclei weii, because at close range, attractive forces dominated, although at greater
distances, nuclei repelled each other. Although Heisenberg was experimentally refuted in
1936 by the discovery of charge independence of nuclear forces, his idea, and his
This new SU(2) symmetry was markedly dinerent fiom other known symmetries.
Previously, physical conceptions of symmetry such as rotation, Lorentz invariance, general
covariance, and even party had d l centred upon the symmetry of space-time, and even
electron spin was thought of in tenns of up and down. Isospin was M e r e n t Although
isospin drew its name fkom its mathematical similarities to the quantum theory of spin, it had
nothing to do with space-time, or even spin, for that matter- It was merely an intemal
symmetry demonstrating the relations between sets of distinct particles, yet its eventual
interpretation involved a conceptual leap. A rotation in three-dimensionai space is not thought
to change an electron intrinsicdy: the eIectron before and d e r the rotation is the same
electron, just with its orientation of spin reversed. A rotation in isospin space, however, can
change the identity of a particle.
physically transformed into a neutron. Mer this step was Uy appreciated, it meant that no
longer did protons and neutrons have separate identities: within the context of group theory,
their identities became h e d A few years earlier, in 1934, Hideki Yukawa was trying to explain both the theones of
interact by emitting or absorbing a paired electron and neutrino, this interaction energy is not
nearly enough to account for the binding energies of neutrons and protons within the nucleus.
To remove this defect, Yukawa proposed that the transition from neutron to proton was not
always accompanied by the emission of light particles, but on the contrary, it was more often
accompanied by the emission of a new, heavy particle, which couid then initiate a transition
from proton to neutron. In other words, neutrons and protons could exchange this new
particle, which came to be known as the Yukawa meson, back and forth, but a small fiaction
of the time, this particle could mediate a B-decay. nius, the new theory could simultaneously
explain the intense interaction of the neutron and the proton, and the relatively small
probability of 0-decay as predicted by Fermi theory. The only drawback was that the theory
reIied upon a hypothetical particle with specif5c properties: it wouId have to have a mass
intermediate between the electron and the proton, an integral spin, and either a positive or
negative charge.
A candidate for Yukawa's particle was soon discovered. In 1937, Neddemeyer and
Anderson reported the discovery of a heavy particle found withui cosmic rays whkh seemed
to fit the description which Yukawa had demanded. For the next decade, there were high
expectations that nucIear forces could be descnibed by a meson field theory, just as
electromagnetism was described by quantum electrodynamics (QED). During this rime, many
qualitative insights emerged fkom the theory, nich as the range, strength, and spin-dependence
of the meson, but the agreement with the observed Iifietime and scattering cross-section was
poor. In 1946, the theory was totally unable to explain the new results fiom the ConversiPancini-Piccioni experiment, which meanired the Merence in behaviour of the positvely and
negatively charged varieties of the meson. A year later, Marshak and Bethe solved the
problem: there were two different kinds of mesons! The first cosmic ray meson, the p-
meson, turned out to be a weaker interacting decay product of the even less stable, but
stronger interacting n-meson. Even after this clarification, however, meson field theory still
had problems.
meson field theory because the nuclear force involved was much more powerful than
electromagnetism, and so the coupiing constant was much Iarger. As more and more particles
were discovered, in fact, it began to be questioned whether mesons were even hdamental
particles at all.
Cosmic rays were proving to be a nch source of new particles- In 1947, Rochester
and Butler announced the existence of two new, unstable eIementary particles, which came to
be known as V-particles.
Since they were rare, they did not cause much widespread
excitement at kst, but that changed in the 1950s, when a senes of experiments reveded a
whole series of V-particks, which were distingui-shed by the abundance of their producion
together with their comparatively long lifetimes. In 1952, Abraham Pais attempted to
reconcile these two apparently conflicting properties, by offering a new selection rule which
would hold for electrornagnetic and strong couplings, but not for weak couplings, that is,
those of the same magnitude as found in & and pdecay. According to the new nile, the Vparticles should only be produced in pairs, a concept referred to as associated productionThe foIlowing year, Murray Gell-Mann provided a more refined and highly successfbl
weak interactions.
Strange particles soon led to the violation of the well-established consemation of
party. In 1954, Richard Dalia presented an d y s i s of the decays of two particles, the T-
meson and the 8-meson. Although the two sets of decay products belonged to distinct parityangular momentum States, in al1 other ways, the particles seemed to be identical- In 1956,
Lee and Yang suggested that these two particles might represent different decay modes of the
same particle, although this idea implied that the weU-entrenched concept of pari@, the
symmetry with respect to invariance under spatial refiection, wouid not be conserved in weak
interactions. In a senes of ingenious experiments, this stsrtling proposal was soon connmied.
The experrnental evdence Ied to a rapid clarification of the 8-Hamiltonian, and by 1958,
three different papers had offered a new and mproved version of the Fermi interaction, which
carne to be known as the vector-axial (V-A) theory. Although the V-A theory enjoyed much
success, it proved once and for a i l that the weak interactions could not be mediated by a rmeson field, as hoped by Yukawa, since a-mesons are pseudoscalar, and thus, they cannot
provide the vector coupling needed. Therefore, t seemed that the weak interactions were
unamenable to quantum field theory.
In 1954, Yang and M a s turned their attention to the strong interaction, and they
published two papers in which they proposed a new generalized gauge invariance. They were
very impressed by the fact that conservation of charge was related to invariance under phase
Their inspiration and strategy were similar to Weyl's, but their implementation was more
subtle. Although global isospin invariance is an obvious symmetry of the Yang-Mills
Lagrangian, local isospin invariance must be irnposed upon it, and not just by the introduction
of a single gauge field as is necessary in quantum electrodynamics, but by three of them.
Thus, Yang and Mills introduced a triplet of vector fields, which they c d e d the b field.
Unlike Maxwell's equations, which are Iinear in A, the isotopic gauge field equations are
nonlinear in b. This is because photons do not carry charge, but the isotopic field does carry
isospin, and so it acts as a source for itself. Mathematically, this is a consequence of the
transformations involved in SU(2) being non-Abelian (2x2 matrices do not cornmute, although
1x1 matrices obviously do).
guaranteed zero-mass in QED, there is no corresponding argument for the b quantum, and any
massive vector field spoils gauge invariance of the Lagrangian, and in addition, generates
divergences which nn renormalization. This problem was not overcome until over a decade
later. Until then, the work of Yang and H s seemed to be yet another w o n d e f i idea that
Nature had chosen not to exploit.
During the 1 9 5 0 ~accelerator
~
physics came into its own. The experimental study of
elementary particles had entered the era of big physics. Prevously, observations of cosmic
ray showers in cloud chambers and emuIsions had reveaied the existence of unstable particles
with short lifetimes, dthough they were able to travel a perceptible distance in the detector
before decaying. In c o n t r a the new high energy accelerators, with their new detector
technology, dowed physicists to systematically shidy the properties of the strong force by
examining tens of thousands of interactions. Scores of extremely shoa'4ved baryons and
mesons called resonances were found, and then characterized by theu production and decay
characteristics. In 1964, Murray Gell-Mann and Yuval Ne'eman introduced a clever
classification scheme called the Eightfold Way, based upon isotopic spin and strangeness, and
named for the octets which it produced. It proved to be very successful, and even though
most physicists were surprisingly ignorant of group theory, at the tirne, the Eightfold Way
was actually a representation of the group SU(3).
As more and more new particles were discovered, it seemed less and less likely that
they could al1 be considered as elementary. In 1964, Gell-Mann and George Zweig
independently proposed that mesons and baryons were not elementary particles at alI, but that
instead. they were made up of a mplet of fundamental particles cdled quarks. This triplet of
quarks corresponded to the fundamentai representation of SU(3). The success of the quark
mode1 was immediate, and it soon dowed for a shift in application of the V-A theory: it was
soon applied to quarks, instead of mesons and baryons.
Con-
fom, eventually proved to be suitable for describing both the weak interactions, and the
strong interactions. In 1967, it was shown that the weak and electromagnetic interactions
could be unified under the group SU(2)xU(1), which underwent spontaneous symmetry
breakdown at energies available in the labofatory, and six years later, the strong interactions
were treated with the new exact symmetry of SU(3) colour, and the theory of quantum
chromodynamics (QCD) was bom. Shortly after that, theorists began to get excited about the
gauge
groupGauge symmetry has thus become a central feature of theoretical particle physics. In
fact, certain choices of unined symmetry groups imply the violation of the conservation of
baryon number. How did syrnmetry corne to hold such a pnvileged position? To evaluate its
changing role, we m u t examine how physicists have actually used the concept of symmetry,
and how relations between consemation laws and both space-tirne and intemal symmetries
II.
physics, we need to explore how symmetry entered the discipline. This chapter is intended as
a prelude, an introduction to the ideas of symmetry and invariants as they existed in the early
twentieth century. It dso provides an historical sketch of generd relativity which wiIl fmish
the context for the beginning of our investigation.
result became essential for particle physics, Noether developed it early in her career in a very
different context, the context of invariants.' Before moving to Gottingen in 1916, where she
worked with David Hilbert and F e h Klein, Noether had studied at Erlangen, where Klein had
inaugurated the Erlangen Programm of 1872. Although Klein had since le& he had kept up
his contacts there with Max Noether who was an important rnathematician in his own right,
apart fiom being Emmy's father.
The Erlangen Programrn of 1872 marked a break in the development of group theory.
Klein took the concept of a group, already present in the theory of algebraic equations, and he
used it to clas*
different geometries.'
geometry, as the science which studies the invariants of groups? A particdar geometry is
then d e k e d by its properes which remain invariant under a certain group of transformations.
For example, in Euclidean geometry, areas and lengths remain invariant under translations and
rotations in the plane. The theory of invariants of a symmetry group became familia to
physicists and mathematicians through the work of both Klein, and his fiiend and coileague,
the Norwegian Sophus Lie?
In 1916, Klein and Hilbert were working on relativiy at Gottingen, and they
welcomed Noether because of her expertise in invariant theory. It was due to their innuence
that she stayed?
Mos importandy for us, she published, m 1918, her paper on the problem
These
fnctions were identities that contained the conservation theorems of energy and momentum
in the case of invariance with respect to arbitrary transformations of the four worldcoordinates. She estabiished two theorems in her fundamental paper, one for f i t e groups,
(ii)
results were important for Hilbert in his formulation of general relativity, as we s h d see.
Hendrk Lorentz, in 1904, showed that MaxweU's equations maintained their form
under the coordinate transformations,
provided the field intensities in the primed system are suitably c h o ~ e n - 'Lorentz
~
did not
think, however, that these equations were completely invariant. Because of the way he had
chosen to trmsform charge demity and cunent, he believed that his solution was only good to
a fxst order of approximation of v/c, and that second order quantities rnight show a
perceptible difference.
H~M
Poincar, however, believed in invariance of the electrodynamic equations. In
1905, he corrected Lorentz's formulae for the charge density A d curent, and he showed that
the field equations of electron theory kept the same mathematical form in the moving
Further, he showed that the transformations which effect the transformation must form a
group, which he named the Lorentz g r ~ u p . ' ~
A more profound understanding of the entire problem was offered by Aibert Einstein,
who wrote his 1905 paper, "On the electrodynamics of moving bodies", without knowledge of
the work of either Lorentz or Poincar.
... the sarne Iaws of electrodynamics and optics wiIl be valid for ail frames of
reference for which the equations of mechanics hold good.
(ii)
... light is always propagated in empty space with a definite velocity c which is
independent of the motion of the ernitting body."
The first postdate he referred to as the relutivity pnncQde. The second brought with it a new
meaning to simultaneity: events which are simdtaneous in one inertid m
e are not
necessarily simultaneous in another. Together, 'these two postulates M y specify the special
theory of relativity. Remarkably, both of them can be combined into the single requirement
that ali physical laws shodd be invariant under Lorentz transformations. From a
mathematical point of view, the special theory of relativity is the theory of invariants of the
Lorentz gr ou^.'^
This was cleariy demonstrated by Hermann Minkowski in 1908, using a clever new
formalism. He put space and time on an equal footing by replacing the time r, with an
imaginary quantity u=ict.15 He referred to a point in space and a particula. tirne, represented
by (x.y,z,u) as a world-point, and a path through space-time as a world-line. With his new
notation, we can see the similarity between the expression for length s in three-dimensional
space,
s = +J&
&?+y'+i+u'
(3
Although the two geometries are not identical because of the imaginary character of the time
coordinate,16 we see that the world geometry, or menic, is closely related to Euclidean
geometry. In the same sense that relation (2) is invariant under rotations in three-dimensional
-space, relation (3) is invariant under aD h e a r orthogonaI transformations in four dimensional
space-the. In other words, it is invariant under the Lorentz group." This invariance is
enough to demonstrate that the Iaws of physics are equally valid in ail neaial reference
systems, but Einstein was soon looking for ways to generalize his theory.
The general theory of relativty grew out of an attempt to extend the principle of
relativity to include gravitation. In his 1907 review article on the special theory of relativity,
"The principle of relativify and its consequences", Einstein introduced his equivalence
principle, which suggested the equivalence between a gravitational field and an accelerated
fiame of reference,
... we shail therefore assume complete physical equivalence between the gravitational
field and the corresponding acceleration of the reference fiame."
This p ~ c i p l has
e serious implications: it implies that in a gravitational field, clocks nui
slower, and the velocity of light is no longer constant In fact, light rays become curved.
1911, Einstein calcdated the influence of gravitation on light, after he realized that the
deflection codd be measured e~perimentally.'~In this paper, Einstein showed that the
velocity of light varies as,
accelerated name of reference and a different m e , at rest but subject to a homogenous field
of pvitation, the identity of inertial and gravitational masses can be denved? Lookig
back on this period, Einstein claimed that already he h e w that gravitation would have to lead
him beyond the Lorentz transformations, but he was not yet sure how faru
S.
il.
In the summer of 1912, when Einstein moved to Zrch, he began to search for the
mathematical structure of general relativity. By then, he had already convinced himseE of the
be a limiting case, and he did not want the laws of nature to depend upon the absolute
in 1913, after stating these four constraints, Einstein went on to discuss the consequences
which would follow fiom the general invariance of the line element ds, given by,
(5)
+g&@e
where the symmetric tensor g,, represents the metrical dependence upon space tirne."
The
ten quantities gpvare interpreted as a characterization of the gravitational field, and they
replace the scalar potential @ of the Newtonian theory."
With his =end Marcel Grossrnann, who was a professor of mathematics, Einstein
studied the mathematical literature, especiaily the theory of invariants and the absolute
differential calculus of Ricci, Levi-Civia and christoffel? It tumed out that Bernhard
Einstein's task
was clear: he had to relate the invariants of this new geometry to his new phyncal principles.
In the mathematical section of the 1913 paper, Grossmann introduced covariant and
contravariant tensors, according to Ricci and Levi-Civita dong with two xts of quantities
and
and finally, he presented a second rank tensor, the Ricci tensor, which was a contraction of
In his section of the paper, Einstein gave the physical reasoning. He used the
Christoffel symbols right away in his analysis of the motion of a materiai point in the field of
gravitation. He considered the motion of a point of matter in a gravitational field,
6 ds-0
This expression is covariant, as expected. Next, Zinstein attempted to derive the field
equations. He guessed that the field equations should take the fonn of a g e n e r b t i o n of the
Newton-Poisson equation,
where k is the gravitational constant, and p, is the rest density of matter- Einstein Iooked for
an expression of the form,
is an as yet
undetermined covariant tensor of the second order which is to be constructeci fiom differential
operations out of the metric tensor g,. Grossmann, in tum, emphasized the importance of the
... this tensor, in the special case of an infinitely weak gravitational field, would not
reduce to the expression A~#LWe must therefore leave open the question as to what
extent the gened theory of the differential tensors linked with the gravitational field is
related to the problem of the equations of gravitation?
With the Ricci tensor d e d out, Einstein was p b l e to determine a tensor I
', which reduced
to At$ i the Newtonian limit, and which was a tensor under arbitrary transformations.
Einstein was forced to make a choice between these two conditions, and he chose the frstSubsequently, he restricted r,, to covariance with respect to h e a r tcansformations only.
Thus, he abandoned general covariance of the field equations,
... [we] must therefore abstain fkom seting up equations of gravitation which would
tum out to be covaraut with respect to arbitrary transformations. Moreover it must be
ernphasized that we have no clue regarding a general covariance of the equations of
gravitation?
Later that same year, Einstein believed that he had found a proof of the physical
unacceptability of the generally covariant field equaons, based upon a consideration of the
conservation of energy?
relativity in a gravitational field with varying g,, Einstein found that T,,alone was not
conserved, but the total energy was still conserved, as stiown by an equation involving tensor
This conservation is made possible by the appearance of a new tensor t", which Einstein
interpreted as the energy tensor of the gravitational field. Einstein justified this new tensor by
saying that it represents the physical fact that the gravitational field transports energy to the
matenal system? This had important consequences,
Since we demand the validity of the conservation laws, we restrict the reference fkme
to the correspondhg extent, and thereby renounce the sethg up of the equations of
gravitation in general covariant form."
In this initial sketch (En-
mathematical concepts had played a smaller role thau might have k e n expected. In fact,
Einstein wrote to his riend Michele Besso in March 1914, "The general theory of invariants
acted a s an obstacle. The direct way turned out to be the only one practi~able."~~
Einstein was happy, for the moment, that the conservation Iaws led to this condition,
more redctive than generai covariance, but this feeling did not Iast long. He realized the
following year, in 1914, that the key to extending relativity was not to be found by simply
incorporating gravitation into the special theory of relativity, but rather by using gravitation as
a means of breaking away fkom the privileged position of covariance for unifonn relative
motion, and gaiing covariance for general motion? He wrote a second paper &th
Grossmann, in which they reexamined general covariance? In the introduction, they
stressed that the field equations had to be covariant under both nonlinear coordinate
transformations as weIl as linear coordinate transformations if the theory were to contain an
extension of the principle of relativity and the principle of equivaience. They were looking
for field equations which had the highest degree of covariance possible, while stiIl completely
determinhg the g,,
... the gravitation equations set up by us have that degree of general covariance which
is conceivable under the condition that the fiindamental tensor g,, should be
compietely determined by the gravitation equations, in paaicular, it ninis out that the
gravitation equations are covariant under acceleration transformations (that is,
nonluiear tratlsformations) of many different kindinds4'
In an attempt to prove this, they derived four conditions which indicated the level of
covariance available, by showing which tratlsformations of the coorduizte system would be
justified."
They were,
where g is the determinant of the metric tznsor. These conditions were interpreted by using a
variational principle,
J
where H is interpreted as the HamiItonian of the gravitational field, and it is set using the
The allowed coordinate systems can then chosen such that, for the fixed boundary values of
the coordinates and their derivatives, the integraI J approaches an extremum?
Einstein and
... since the conditions &=O, with the help of which we have restncted the coordinate
systems, are the immediate consequences of the equations of gravitation, our
considerations show that the covariance of the equations is as far reaching as
possible?
Einstein soon changed his mind, and admitted that he had made a mistake. The
following year, in the first of a series of three papers presented to the Prussian Academy in
November 1915, he admitted that the field equations were no? uniquely deterrnined by the
restricted covariance, and these conditions were not enough to detennine the Hamiltonian
function either? All they were capable of doing, he realized, was limiting the choice of H
to invariance under linear transformations, which was not enough for a relativity of
accelerated motions."
determine the g,, ~uiambiguously,but this does not ruin general covariance! He renewed the
search for field equations, except by this time, he had,
general covariance of the field equations, which Be] abandoneci ody with a heavy heart in
the first place three years ago...ri48
A week after the three presentations, on 25 November 19 15, Einstein presented his
fuial, correct form of the field equations. The most important change was that the equations
were no longer expressed in the artificial foms he had introduced before. Instead, he codd
use the naturally suited Ricci tensor,
where T=T, Einstein demanded, as before, that the conservation of energy was satisfied,
This collection of quantities f, is necessary, since the curvature of space does not d o w for a
conservation of Pmalone, but it is not a tensor at aii: it only behaves We one under h e a r
transformations. Einstein concluded the paper triumph~illtly~
With that, the general relativity is fiaiiy compIeted as a logicd structure. In its most
general formulation, which turns the space-time coordinates into parameters devoid of
physical significance, the principle of relativity concludes, with conclusive necessity, to
a quite definite theory of gravitation-.-49
Early the next year, in 1916, Einstein gave an almost complete account of general
relativiv, his paper, "The foundations of the theory of general relativity". He explaineci
clearly that it was not enough to consider arbitrarily moving reference systems, but that all
coordinate systems must be allowed.
The generai laws of nature are to be expressed by equations which hold good for ail
systems of coordinates, that is, are covariant with respect to any substitutions
whatever?'
This ailows for a natural explmation of gravitation. The elements of the metric tensor gpv
which allow the general transformations are interpreted as the gravitational potentials, and
they determine the curvature of space, the amount of curvature being proportional to the
gravitational force,
This dlows the fusion of two previously quite disconnected subjects, gravitaiion and
geometry,
In November 1915, ive days before Einstein's presentation of his field equations,
David Hilbert presented covariant equations of gravitation as part of an attempt towards a
d e d theory of electromagnetic and gravitational fields?
was very different. Unlike Einstein, who had sought covariant equations of gravitation using
universal fnctions of the electromagnetic fields E,B, and the electromagnetic potential+.
Under the restrictions of energy conservation and Lorentz covariance, these ten b c t i o n s
By applying
Hamilton' s principle,
6 H dxdydzdt-0
(19)
the field equations can then be determined? The problem is thus reduced to f'hding the
form of this world-function, but this problem is obviousIy extremely W c d t Mie tried
many combinations of invariants without success, eventudy giving up." Nevertheiess,
Hilbert found Mie's work inspiring.
Hilbert incorporated Mie's vision of the world-fiinction within the mathematics of the
general theory of relativty. Starting nom a Riemannian geometry, Hilbert postulated that any
Hilbert thought that his axiom (ii) was the simplest expression of Einstein's fiuidamental idea
of general covariance, and he added that Einstein did not write the axiom in this way because,
... for Einstein, Hamilton's principle ody plays a secondas. role, and his fiinction H in
no way consists of general invariants and does not contain the electric potential~.~
Hiibert then offered his first theorem,
If the expression J is invariant urider arbitrary transformations of the four worldparameters, and contains n quantities and theV derivatives, and if fkom the conditions,
then n Lagrange variational equations for these n equations are formed, then in this
invariant system of n differential equati~ns~
four of them are always a consequence of
the remaining n-4 equations, in the sense that four mutually independent linear
combinations of these n dfffkrentiai equations and their derivatives are aiways
identicdy satisfedbl
Hilbert m e s this theorem without proof, but it is a consequence of Noether's second
theorem, which she proved in 1918P2 It shows that there are four relations between the n
fields under variation, and Hilbert thought that this theorem connected the laws of gravitation
and the laws of electrodynamics. He let J correspond to his world-hction, which depends
on 1014 fields, namely, the gravitational potentials g,, and the electromagnetic potentials q,
Cunously, Hilbert did not take ali of these equations to have an equal statu. Instead, he
argued that the gravitationai equations were fundamental, and that the electromagnetic
equations were a consequence (Folgeerscheinung)of gravitation?.
Einstein was very criticai of this method. He sharply cnticized Hilbert in a postcard to
Ehrenfest in May 1916,
1 do not like Hilbert's formulation. It is needlessly specialized and, as far as matter is
concerned, unnecessarily compiicated It is not honest ... in design [and indicates] the
Einstein was not alone. Hilbert's method was not regarded as acceptable by most physicists
at the tirne, because the variational principle was introduced as a . axiom, and variational
The question persiste4 for a few years, whether the theones of Hilbert and Einstein
were identical. In 1916, Einstein published a paper on Hamilton's principle in which he tried
to cl-
the connection between Hilbert's formulation and his own, which he thought to be
...in contrast to [the work 04 Hilbert, as few restrictive assumptions as possible about
the constitution of matter will be made. On the other han& contrary to my own
previous paper, the choice of the coordinate system shaiI rernain perfectly fie@
When Einstein was questioned about the physical interpretations of generai relativity, he
tended to place a formal energy consemation law in the forefiont, because in his earlier
procedure, fidl covariance looked rather d~ubtful.~'Conversely, Hilbert tried to exploit
covariance physically. Ultimately, it was Felc Klein who completed the qbthesis of the
views of Einstein and Hilbert?
also published in 1918- Klein concluded that both Hilbert's constraint condition for he
energy tensor of matter and Einstein's conservation of energy-rnomentum both fouowed fiom
general covariancePg Nowadays, Hilbert's conditions are thought of as conditions for the
curvahire tensor Rup,A,or more precisely, the contracted Ricci tensor R, which enters the
equations of gravitation. These conditions were written in 1902 by Klein's former pupil,
Luigi Bianchi,
~ ; v * ;
r+qL;
;*-O
(21)
(22)
These four equations may be interpreted as either the four identities of Hibert or the energy
conservation equations of Einstein, since each share the fom of the field eqyations. Thus,
Hilbert's theorem actualiy referred to a part o f the Bianchi identities which can be derived
5. Concluding rernarks
The concept of a classical field was developed significantly with the elaboration of the
general theory of relativity in 1915. AIthough it was constnicted as a relativistic theory of the
gravitational field, the generai theory of relativity was more than just another theory of
gravity. It was a new theory o f space and time that differed fkom all other physical theories
in uiat it geometrized a physical interaction: the gravitational field was interpreted as the
manifestation of space-time curvature while space-he was identified with a physical field
whose equations were determined by the distribution of matter. Because of its appeal, it
induced others to seek a d a r construction which could represent both the gravitational field
and the electromagnetic field. The first effort, as we have seen, was made by Hilbert in
191 5. A few years iater, in 1918, an attempt was made by his former student, Hennarui
Weyl.?' This is where our story begins.
1.
Weyl
career .
(1935) provides
Klein and Lie met in Berlin in 1870, and then they both moved
[Yaglom
Paris, where they worked under Camille Jordan.
(1988), p. 221
4.
to
5,
Hilbert tried to push through Noetherfs application for
Habili t a t i o n in the Philosophy Faculty at Gottingen.
At a
Eaculty meeting, he declared, "1 do not see that the sex of the
candidate is an argument against h e r admission as Privatdozent,
After all, we are a University and not a bathing e~tablishment.~
Even so, he failed. [Weyl (19351, p. 431.1
6.
Noether (1918).
7.
Mehra
found
(19741, p . 231
Il.
13 .
14.
16.
This implies, f o r instance, t h a t t w o world-points whose
distance f r o m each o t h e r is zero do not coincide-
xIr=x,coshw-ix,sinhw
x, =ix, sinhotx,coshw
with coshw=l/ (1-B) 35 and sinho=B/ (1-6)36. [Sakurai (1964), p. 201
4-
Einstein (1911).
dimensions,
%n(n+l)=IO28.
Mehra (1974), p. 9.
is defined
g,,=gv,, and for n=4
29.
30-
3 1-
33.
34-
35 -
36-
Einstein ( 1 9 1 3 )
37 -
38 -
39.
40.
41 -
42.
43.
Mehra (l974), p. 13 -
44.
45
46
See
47.
48.
49
Einstein (1915d), 8 4 7 ,
50.
51-
52.
Hilbert presented his equations in Gottingen five days
before Einstein' s presentation.
Mehra (1974) gives a thorough
discussion of the relation between the work of Einstein and
Hilbert- Einstein and Hilbert had a great deal of mutual respect
for each other, and they communicated quite frequently in 1915 Hilbert refexenced Einstein thoroughly, and in fact, because his
paper, Wrundlagen der Phyik " took a few weeks to be published,
Hilbert
was
able to refer to al1 four of Einstein's
Although very proud of his
communications of November 1915independent derivat ion, Hilbert himself considered Einstein to be
the principal architect of general relativity,
53-
54-
55.
B o t h Mie and Weyl (1922) refer to the function H as a
Hamiltonian, since it is used in conjunction w i c h Hamilton's
principle.
Dirac (1933) spelled out the advantage of using a
Lagrangian with the action principle instead of a Hamiltonian:
it makes the action a relativistic invariant.
56-
57.
One of the most serious difficulties, as Pauli (1921)
emphasized, was the fact that Mie's theory rested upon an
absolute potential.
The equations of motion do not rernain
unchanged if one replaces the potential @ by @+constant.
This
problem, already noticed by Mie, m e a n s that a material particle
c a m o t exist in a constant external potential field58. The scalar density dg, characteristic of general relativity,
simply ensures the invariance of the integral with respect to
coordinate transformations.
59.
Hilbert ( l 9 X S )
p 396.
60,
Mehra (19741, p - 2 7 -
63.
Hilbert (19151, p . 4 0 6 In addition, Hilbert saw a deep
afinity between general relativity, and Miers Zheory . Hilbert
broke the world function up into two parts, H=K+L, where K is the
gravitational part, and L is the electromagnetic part. Re could
relate L directly to the work of Mie, "Mie's electromagnetic
energy tensor is thus none other than the knvariant tensor
obtained by differentiating the invariant L with respect to the
gravitational
potentials
g,,
on
the
passage
to
the
limit . " [ H i l b e r t (1915), p . 4041
64-
Mehra ( 1 9 7 4 1 , p. 3 5 -
66.
E i n s t e i n (1916b), p.
67.
M e h r a (19?4), p. 4 5 .
68.
Mehra ( 1 9 7 4 ) , pp. 4 5 - 5 0 .
70.
M e h r a (1974). p. 4 9 .
1111-
71.
W e y l w a s a m a t h e m a t i c i a n , a student of H i l b e r t educated a t
G o t t i n g e n , where he tudied and taught for a l m o s t t e n y e a r s ( f r o m
1 9 0 4 t o 1 9 1 3 w i t h a break of a year, w h e n he was at Munich) .
From 1 9 1 3 t o 1 9 3 0 , he taught at Z r i c h .
There he met Einstein,
(1918-25)
but concerning matter itselc it was less wmplete, as the controversy with Hilbert had shown.
Gravity and electromagnetism were isolated fkom each other. Once the success of Einstein's
gravitational theory was established, however, there were a number of attempts to generalize it
even M e r . The frst who stepped forward, in 1918, was Hermann WeyI, a student of
HiIbert and a successor of the Gottingen tradition of mathematical physics. Weyl proposed
nothng less than a new geometry, and a unified theory of gravitation and electricity, which
were the only known fiindamental forces at the tirne. He was &er a pure innnitesimal
geometry, as he called it, based on the idea of parailel transfer, an idea within differential
In 1918, Weyl made an effoa to extend the general theory of relativity to include
eleceomagnetism as well as gravitation. As in Einstein's general theory, the square of the
distance between two infitesimally separated points, as before, takes a quaciratic Merential
form,
*g;&dr,
(1)
in which the gik are identified with the ten components of the gravitationai potential. In
addition, Weyl introduces a linear form which he identifies with the four components of the
electromagnetic potential 6,
d4-b,&
(2)
Weyl's unification is made within the context of an extension of Riemannian geometry based
not simply upon the metic, but upon the idea of the concept of paraiiel tramfier, dscovered
by Levi-Civit in 1917.' If P and P r are two points comected by a cuve, then one can
transfer a vector from P to P ' dong the c w e , keeping it parallel to itself.
In general,
however, this transfer of the vector is not integrable, that is, the vector that is obtained at
point P' depends upon the path taken from point P. The two vectors, in general, will not
coincide. Instead, they wiil have some angle separating them, where the aagle depends on the
curvature of space-the in the region enclosed by the two pathd
Weyl thought that Einstein's theory contahed a residual element of rigid geometry,
simply due to the historical accident that it developed out of Euclidean geometry? Weyl
noted that the metric aUows the magnitudes of two vectors to be compared, not only at the
same point, but also at any two arbitrarly separated points. He thought that since fiection
was non-integrable, length should be as well.
A true infinitesimal geometry should, however, recognize only a principle for
tramferring the magnitude of a vector to an innnitenmally close point and then,
on transfer to an arbitrady distant point, the integrability of the magnitude of a
vector is no more to be expected than the integrability of its direction?
Weyl suggested that this new degree of fieedom would aliow his geometry to exphin not only
gravitation, but also electromagnetiun The two forces would become uitertwined.
On the removal of this inconsistency, thete appears a geometry that, surprisingly, when
applied to the world, explains not only the gravitational phenornena but also the
electrcal. According to the resuitant theory, both spring fkom the same source, indeed
in general one cannot separate gravitation and electromagnetism in an arbitrary
manner?
Weyl d o w s non-integrability of length by suggesting an invariance of scale, or
measure. His plan is to then relate electromagnetism to this new kind of invaciance, just as
transformations of the space-the coordinates. Under p d e l tramfer, the change dEi of the
vector E, when transported nom point P to point P' is given by,
S-C ris&,
(41
and
at
P are pardel transfened to ci+dt' and $+dqr at P ', then their scaiar product at P r ,
This proportionality factor is taken to be infinitesimal and set to (l+@), giving the equation,
dg,-
(dy,+dy,)
-g&@
(7)
From this, we see that dc$ is a linear form, as s h o w in equation (2). Under this application
"... the intemal metrical connection (Ma~zusummenhang)of space thus depends not
only on the quadratic form (which is determined up to an arbitrary coefficient of
proportionaiity), but aiso on the hear fo
Thus, the space is characterized by both the quadratic form dF2 and the linear form d&.
This Iinear form ailows a change in the scale of the coordinates. For Weyl, it is n o w
possible to compare lengths of vectors oniy if they are measured at one and the same point
Moreover, it is no longer the actual values of the g
, that c m be detemiined by measurements,
but rather, only the ratios or the relations between them. This leaves a freedom of s a l e for
the potentials ,
g
hg, where X can be taken as a function of position. Substituthg Ag, in equation (7) gives,
'
,
s
-As,
These are transformations of scale, or measme, and Weyl referred to the invariance they
He thinks
Thus, under Weyl's new interpretation, Einstein's gravitational theory is only exact in the
absence of an eiectromagnetic field."
associated with the arbitrariness in the choice of the scale for measuring length.
Thus, Weyl is linking both gravitation and electromagnetism with the structure of the
underlying differential geometry. This geometry is not ody metrical, but also @ne, that is,
the vectors within the space are invariant under a multipLication altering their length."
This
affine space demonstrates both the usual invariance under coordinate transformations, and the
laws need to depend upon the metrk gik and the form t& in such a way that the changes
induced by the variation in each must cancel each other out, leaving the equations of motion
invariant Only then is there a d
Weyl's theory, besides being aesthetidy pleasing, also provides a natural formulation
of the conservation of electric charge. Following the example of Mie's theory, which Weyl
references, the action is written,
where Weyl c a s W the action density.12 Weyl writes that the actuai world is selected fkom
the class of all possible worlds by the fact that the action is extremai in every region with
respect to the variations of the g, and & which vanish on the boundary of that region. Weyl
then shows that, as Hilbert, Einstein, and Klein had shown previously,
... Cjust as] the four conservation laws of matter (the energy-momentum tensor) are
connected with the invariance of the action quantity, expressed through four
independent functions, so Hi the same way the law of conservation of electricity is
connected with the new scale invarianceyexpressed through a fifth arbitrary
h c t i o n . l3
This is because under the variations,
the integral (12) vanishes. Weyl was very impressed with this. In fact, he felt that this was
The manner in which the latter resembles the prhcples of energy and
momentum seems to me one of the strongest generd arguments in favour of the
present theory-so far as there can be any question at d of confinnation withh
the context of pure speculation."
It turned out very soon that the theory was not purely speculative: it had experimental
consequences which Einstein quickly spotted.''
Einstein was allowed to write a supplement to Weyl's paper, in which he noted that
the indeterminate factor in the line element rls, as weiI as in the metric g,k should be subject to
memement. Accordingly, both Iengths of rods and rates of clocks should be dependent on
their history, implying that chernical elements with spectral-lines of definite frequency should
not exist and the relative fkequency of two n e i g h b o h g atoms of the same khd should be
different in general. Einstein concluded,
Since this is not the case, it seems to be that the basic hypothesis of this theory is,
unfortunately, unacceptable, although its depth and daring must delight any reader.16
Weyl answered Einstein in his conclusion to the paper, and he attempted to eliminate the
difficulty. He admitted that when a dock or an atom experiences a strongly varykg
electromagnetic field, the quantities g, and 4, are affecte& but be thought that this should net
matter since these quantities are not directly measurable. In Weyl's opinion, one can only
speak with confidence about measurements made in a static gravitational field and in the
absence of an electromagnetic field, and so his theory was not in conflict with experiment,
It m u t be borne in mind that the mathematically ideai process of parallet transport,
which must form the basis of the mathematical construction of geometry, does not bear
any relation to the real process of the motion of a clock, the rate of which is
determined by the laws of nature.'?
Aithough this reasoning allowed Weyl to defend his theory against its discrepancy with
experiment, by the same token, it also isolated it fiom physical content. Therefore his theory
only provided formal, not physical, evidence for a comection between gravitation and
In the next two years, Weyl refined his argument to deal with ths problem, aamely,
the non-uitegrability of distance under paralle1 transfer. He explained why clocks and rods
did not expenence a parallel transfer fkom moment to moment, by distinguishing two modes
of detemiining a quantity in Nature, that of persistence (Behaming), and adjustment
We can give to the axis of a rotating top any arbitrary direction in space. This
arbitrary original direction then determines for aU time the direction of the axis of the
top when left to itself, by means of a tendency of persistence which operates nom
moment to moment; the axis experiences at every instant a parailel displacement. The
exact opposite is the case for a magnetic needle in a magnetic field. Its direction is
determined at each instant uidependently of the condition of the system at other,
instants by the fact that, in vimie of its constitution, the system a&usts itself in an
unequivocdy detennined manner to the field in which it is situated- A priori we have
no ground for asniming as integrable a t r a d e r which results purely fiom the tendency
of per~istence.~'
Weyl thought that his theory predicted how vectors and lengths would behave if they
happened to follow their tendency of persistence, but in actual fact, it was quite possible that
just as the magnetic needle adjusts itself to the magnetic field, the rates of clocks and lengths
of rods couid adjust themselves to the curvature of the mettic. As a result, there is a doubling
of geometry: there is the original geometry of the space-time continuum in which the parallel
transfer happens, and there is the natural geometry, which is constnicted from the readings of
what extent persistence and a d j m e n t modify one another is to start fkom the physicai laws
The German mathematician Theodor Kaluza agreed with Weyl that in generd
relativity, a full description of the physics can only be obtained by taking the electromagnetic
four-potential qi into account dong with the gravitatiod potentials g,
In 1921, Kaluza
presented a unification attempt of his own, achieved by a different methodz4 Whereas Weyl
had introduced a linear form into the Riemannian geometry which he associated with
electromagnetism, Kaluza found a way to give a unified description of gravitation and
elec&ornagnetism in the fiamework of Riemannian geometry by using five dimensions, rather
than four. KaIuza envisioned gravitation and electromagnetism both arising, in the same way,
which suggested to him that the F,, might somehow be tnincated (verstlimmelte) Christoffel
symbols r,,?
of the metric tensor and its denvatives. Kaluza thought that a unincation of gravitation and
electromagoetism might be possible with the introduction of the rather strange idea of a fifth
space-tirne dimension? There is nothing wrong with this addition of an extra dimension,
since Kaluza ensures that oniy four-dimensional variations of physicd quantities d l play a
physicd role. He achieves this by imposing a restriction: al1 derivatives with respect to the
fifth space-time coordinate must be either zero, or at least, a smaller order of magnitud than
the others. He c d s this the cylinder condition (~~lindrrbedingung)-"This does not mean,
of course, that the effects of the fifth-dimension are canceiied out, because of the way in
which the coordinates are embedded in the Cbristoffel symbols?
(the
f
&
Taking into account the cylinder condition, he wrote down all the values of ri, in tems of
the derivatives of g,.'9 Besides the 40 four-dimensionai ri, describing the gravitationai
field, there were dso 35 new three-index quantties: 16 rt,, 10 rom4 r,,
ro,
and 1 r,,
of the form,
ri.EWa.r-gr~.J
ro,=~(.!?o,.r+&.r)
~,.oo=-
%O.i
,or
2 r,,-,,,,=O
Kaluza thought the results, at first, did not look aii that encouraging. Besides the sixteen
quantities ,'I
Kaluza continued. Because it ensures the proportionality of F, and r,,, Kaluza thought that
components g, of the five-dimensional metnc tensor. The signifcance of the final, corner
component g, remained undeterrnned, for the time beihg.
Kaluza then attempted to find the equations of motion for charged particles in gravitational
and electromagnetic fields. In m
approximations. F h t , he asnimes the fields are weak, with g, which are close to their
Euclidean values. Second, he assumes a small five-velocity, v/c
electromagnetic field Unfortunately, as Kaluza recognized, this is not the case for matter.
This is because matter, in its fundamentai constituents (Urteilchen), is not at al1 weakly
charged, or slow moving. Kaluza quotes Weyl, "...macroscopic placidity stands in sharp
contrast to microscopic turbulence.""
but small. In addition, the charge-to-mass ratio of the electron is so high, that upon
substitution into the geodesic equation, terms involving g, are dominant rather than
45
negligible.
In spite of ail the physical and theoretical difficuities which are encountered in the
above proposal, it is hard to believe that the derived relationships ... represent nothing
more than a rnalicious coincidence. Shodd it be estabfished sometme that the scheme
is more than an empty formalism, this would signm a new triumph for Einstein's
general theory of re~ativity?~
Thus, Kalum had discovered an appealing mathematical structuret but he could not foresee the
consequences. He realized, at the time, that,
3. Concluding remarks
Weyl tried to iink electromagnetism and the coordinate invariance of gravity with an
underlying differential geometry in 1918. His idea was original and deep, but is direct
application to gravitational theory turned out to be physically unacceptable. This was pointed
out by Einstein, who noticed that the scale change associated with paralie1 transfer wouid
make atomic clocks dependent upon theu history, and lead to the smearing of spectral lines.
Even though Weyl's theory was flawed, however, its mathematics was inspiring. It showed,
for the fust tirne, how a geometrical significance could be ascribed to the electromagnetic
field, and fuahermore, it was the bridge by which the concept of non-Riemannian connections
was introduced to physics. It inspired Kaluza to set out to anain a unification of gravitation
and electromagnetism by a different aethod, and he, in turn, was led to a five-dimensional
relativity. Although both Weyl and Kaluza only achieved f o d unifications with classical
fields, they both exhibited the possibilies of associating a geometncal structure with
electromagnetism. Together, their work wodd prove to be Mporiant following the
development of quantum mechanics, when a new meaning for gauge theory wodd emerge.
5-
6.
12.
Pauli (1921) notes that under Weyl's theory, the action
integral under which Einstein derived his field equations is not
gauge invariant. Therefore, Weyl needs to set up his own integral
which
is
also
invariant
with
respect
to
scale
transformations. [Pauli (1921), 200.1
13.
14.
15Einstein (1918).
Einstein's remarks are i n c h d e d in a
postscript to Weyl's paper in the original version.
16 -
Einstein (1918).
1501
25.
Kaluza (1921).
26 .
32.
dd=g,.*$x,
Symmetry plays a larger role in quantum mechanics than in classical mechanics. The
actual solution of quantum mechanical equations is, in general, so difficult that direct
calculations only yield crude approximations. Altematively, many features of a system can be
deduced by considering fundamental symmetry operations: group theory can reveai features
which are not contingent upon any special assimptions regarding the forces or dynamical Iaws
involved,'
A g o u p G is defbed as a set of eIements g for which a single law of multiplication
law is defined. The product of any two elements of the group g, and g,, must satisfy four
(i)
(ii)
(iii)
(iv)
The nurnber of elements in the group need not be finite, or even countable. Note that the
group elements need not cornmute: g,g2#ggl.If a l l the elements do cornmute, the group is
called Abeliun. Translations in space and time form an Abelian group, but rotations do not.
Groups can be either nnite or idhite, discrete or continuous. In physics, there are groups of
groups are named after Sophus Lie, who was the first to undertake a systematic study of the
elements are known, the rest of the group can be generated by integration.'
Lie hunself
referred to these groups as infinitesimal groups, and he worked with both global and local
groups. The name Lie g r o q was introduced by Elie Cartan in 1930, but he used it to refer
only to locally Euclidean groups, which are now known as global Lie groupso Global Lie
groups have parameters which are space-time independent: these rigd groups include the
rotation groups, the Lorentz group, and groups of UILitary transformations. If the parameters
are ailowed to Vary IocalIy with space-the however, the group becomesmbZe.
Flexible
groups include the group of a l l coordinate traDSformatiom in general relativity, which we have
encountered dready, and local gauge groups, which will become of prime importance to us
later.'
The archetypal Lie group is an nxn matrix group, with continuous elements. The
Lorentz group, for instance, consists of a set of 4x4 matrices. The collection of all
orthogonal nxn matrices is the group O(n).' The group O(3) describes rotational symmetry
in three-dimensional space, and this is the symmetry which Noether's theorem reIates to the
conservation of angdar momenhun. Indeed, the entire quantum theory of anguiar momentum
is really clandestine group theory. Altematively, the collection of a l l unitary nxn matrices is
cailed U(n)? If we restnct ourselves to unitary matrices with determinant 1, we have the
group SU(^)." The group SU(2) has a mathematical structure close enough to that of O(3)
to
look familiar, but different enough to be suitable for the new physics."
Every group G can be represented by a group of matrices. For every group element g,
there is a maeix M, This correspondence respects group multiplication, in the sense that if
representation is said to befiithfuLn This does not have to be the case, as many distinct
group elements can be represented by the same rnatrk-l3 Each group of nxn matrices,
dimension^.'^ For example, the group SU(2) has representations of dimension 1 (the trivial
one), 2 (the fundamental represeetaton), 3, 4, and every other positive integer. A major
problem in group theory is the enumeration of dl the representations o f a given group. A
new representation can always be constructed by combining two old ones, but such a
In 1924, before the discovery of quantum mechanics, Woifgang Pauli was sudying the
Zeeman effect, the splitting of spectral lines in a magnetic field, and he found a simple
generalization.15 He noticed that no two electrons could have the same set of quantum
numberd6 A year before, Edrnund Stoner had proposed the d e that the number of
eleceons in each completed shell is equai to double the sum of the huer quantum numbers-l7
For an atom, these numbers are: the principal quatltum number n, the angular momentum
number Z which takes the values I ' , l . . n - 1 , and the magnetic quantum number m which
ranges fiom -&mg- The third number rn becomes important when the atom is subjected to a
magnetic fieId, because each Level (n,l) is split into 21+1 levels. Although the s u m of the
number of electrons in each shell n should be 218. Stoner did not offer an explanation why
this number is twice as large as expected, but Pauli did. Pauli described the Zeeman effect as
arking fiom the individual valence electrons d e r i n g fiom , "...a classically undescribabIe
"
."
two-valuedness (~weideuti~keit)
This two-valuedness, or duplexity, was pualing, and although Pauli did not understand
it, he interpreted it as a new quantum number.
same state, as described by a set of quantum numbers, came to be known as Pauli's exclusion
prin~iple.'~Later that same year, the interpretation for the two-valuedness of the electron
was furnished by two young Dutch physicists, Uhlenbeck and Goudsmit, who discovered an
additional degee of freedom for the electron, namely, electron spin. Paul Dirac remembers
that a few people were hinking about spin in those days, but initially, there was a lot of
opposition to the idea2* At the time, Uhlenbeck and Goudsmit were workhg in Leiden with
.
Paul Ehrenfest, who suggested that they take their idea to Lorentz. Lorentz told them that he
had already thought of the idea hirnself, but that it was impossible for the electron to have a
spin, because if it dici, the speed of the surface of the electron would be greater than the
velocity of light Uhlenbeck and Goudsmit went back to Ehrenfest and said that they would
Iike to withdraw the paper that they had given to him, but Ehrenfest said it was too late,
because he had already sent it off for publication, adding that its authors were young enough
to be able to affod a stupiciity!" The idea of spin, however, rapidly proved to be
ndispensable."
..
11-
derivation of Planck's law, based upon the counting of cells in single-particle phase space.
Bose did this by arranghg the total energy so it was distributed among various possible States
for the oscillators which compose the electromagnetic field, and then dowing equal
probabilities for any number of degrees of excitation of each of the oscillators?
Bose was
having trouble getting his work published, so he sent to Einstein, who was enthusiastic about
it, and translated into German for the Zeitschnfrfir Physik. Einstein then extended this
treatment into a quantum theory of an ideal monatomic gad4 This form of statisticai
mechanics is now known as Bose-Einstein statistics, and it applies to alI particles which have
integral spin, such as or-particles and photons, which are now called bosons.
Enrico Fermi was also working on the quantization of systems containing identical
partictes, but he found a different form of statistics? After reading Pauli's article on the
exclusion principle, Fenni realized that this principle would allow his ideai gas theory to
predict entropy values at both low and hgh temperatures without haWig to resort to any
arbitrary assumptions. In March 1926, he published this statistics, which applies to all
particles with half-integer spin, such as electrons and protons, which are now called
fermions.26
Independently, Dirac found the same statistics by a different method?'
In August of
the same year, Dirac published his work based upon the study of eigenfiuictions of a
Dirac
proved that an antisymmetricd eigenfiuiction vanishes identically when two electrons are in
the same orbit, which is the r e d t he expected fom Pauli's exclusion p r i n ~ i p l e -For
~ an
atom with several electrons,
... if the positions of two of the electrons are interchanged, the new state of the atom is
physically indistinguishable fiom the original one. In such a case, one would expect
only syrnmetrical fiinctions of the coordinates of all the electrons to be capable of
being represented by maPices. It is found that this allows one to obtain two solutions
of the problem s a t i s m g a l l the necessary conditions... One of the solutions leads to
Pauli's priciple that not more than one electron can be in any given orbit, and the
other, when applied to the analogous problem of the ideal gas, leads to the EinsteinBose statistical mechani~s?~
Thus, Dirac was the &st to show that the two types of statistics, now usually designated as
symmetricalIy ccupled to each other, and he found that the quantum States of this system
separate into two sets, one symmetric under exchange of the osciilator coordinates, the other
spectrum, but he compiained to Pauli that his calculations were imprecise and incomplete?
was unable to extend the results to higher n, he condted his &end, the mathematician John
von Neumann. von Neumann referred Wigner to Frobenius, and only two weeks later,
Wigner had solved the generd case! He published a second paper, which begins the paper
with an achowledgment to von Neumann, dong with the assertion,
... there exists a well-developed mathematical theory which one can use here: the
theory of transformation groups which are isomorphic with the symmetnc group (the
group of permutations).36
Thus, goup theory had entered quantum mechanics."
1927, Wigner had already found the principal resuIts of the application of group theory to
atomic spectra that wouid form the foundations of his book."
For Wigner, almost ail the d e s of spectroscopy can be denved nom the application of
group the or^.)^
importance of the rotation group for quantum mechanics. Since the states of a quantum
mechanical system form a linear manifold, any superposition of two or more states with the
same definite energy still has that same energy?
energy remains invariant, as expected fiom classical theory, but in quantum theory, we may
same definite energy. Thk means that w e can build up s p h e n d y symmetnc states-the
spherical harmonies, which lead to the quantum mechanical theory of angular momentum Jy
which in tum, may be applied to the possible electronic configurations grouped around the
stationary nucleus of an atom?
Even when the rotational symmetry of the quantum mechanical system is broken by a
weak electric or magnetic field, the system still has a degree of symmetry lefi. A magnetic
field, for instance, can be treated as a perturbation, which stll has a simple behaviour under
the symmetry operation which it destroys, namely, it splits an energy level w i i angular
momentum J into U
t
1 equdly spaced energy levels, with the spacing proportional to the
of the o?tical transitions orginating fiom the members o f each split level.
Although Weyl had been Ied to his interest in Lie groups through relativity? he also
applied his expertise to the emergent quantum mechanics, and he pubkhed his fkst paper on
his new methods by 19279' In the academic year 1927-28, Weyl delivered a series of
quantum mechanicd applications of group theory to students in Zrich, and by the end of the
year, even before Wigner, Weyl had published his own book on the ~ubject:~ Weyl's book
was profound, but the response from the physics community was mixed, and it was not used
very much. Born, Heisenberg, and Sommerfe1d al1 wrote letters to Weyl cornplainhg about
the dificulty of the book.44 Schfidinger offers a typical response in his letter,
... we know in what dire need we are of much of what you have to Say to us. PIease,
take the trouble to tell us these things in an easily comprehensible fashion and not with
too many concept formations that are new to us--ifpossible in shabby old wom
concepts, which you already fid boring, I know, It is fim to build new concept
structures, it is your innermost sphere of interest, but the physicai is still hidden in the
dark to such an extent, that we cannot hope to be able to work successfully in nich
darlmess with such compiicated, rinfamrliar instruments?
In 1935, Condon and Shortiey offered their opinion of group theory in the introduction to
their book, Theory of Atornic Spectre,
The reader WUhave heard that this mathematical disciplule s of great importance.
We manage to get dong without it When Dirac visited Princeton in 1928, he gave a
seminar report on his paper showing the connection of exchange energy with the spin
variables of the electron. In the discussion following the report, Weyl protested that
Dirac had said that he would derive the results without the use of group theory but, as
WeyI said, aiI of Dirac's arguments were reaily applications of group theory. Dirac
replied, "1 said 1 wodd obtaui the results without previous knowledge of group
theory.16
As an example of the different attitudes towards group theory, for Weyl and Wigner, the three
hand, relate these matrices to the well-known calculationai methods of angular momentun,
without invoking group theory."
Condon and Shortley think that when a physicist wants to Iearn of new theoretical
developments, one of the greatest barriers can be mathematicai methods with which he is
unfamiliar. For example, general relativity brought with it the necessity of leaming tensor
calculus and Riemadan geometry. Because these new methods can often be difficult,
Condon and Shortley choose to minimize the amount of new mathematics necessary to tackle
the problems of line spectra. They do not use group theory explicitly, although they do refer
the interesteci reader to the texts
Early in 1924, Otto Laporte, a student of Sommerfeld, was studying the spectruxn of
iron, and he found that here were two subsets of energy levels that do not intercombine?
He found that transitions were always within one of the two subsets, and never fiom one to
the other. Following Laporte, in May 1927, Wigner divided atomic States into normal ternis
transitions between normal and reflected states are allowed. Later that same year, he
published his seminal paper, "On the conservation laws of quantum mechanics", in which he
noted that these laws are associated with the existence of a uni-
reflection operator P
(Spiegelung) that cornmutes with the Hamiltonian H? AS result, states can be chosen nich
that P and H are simdtaneously diagonal. This means that P is consewed. The eigenvalues
Wigner stressed that although invariance under spatial reflection is well-defined in classical
the energy eigenstates are at least doubly degenerate. To prove diis, he used an operator,
which two years later, Wigner identified with the tirne-reversal operator in quantum
mechanics- A general feature of 2'-invariance is that it aiways involves relations between
diffeerenr states, unlike the situation for P, which is an hainsic property of a single state."
This means that there is no quantum number reiated to pinvariancece To apply tune-revers&
we must not
explain the magnetic moment of the electron, and to explain the fine-structure of the
hydrogen spectrum, his theory had a problem. It predicted states of negative energy, and it
was unclear how ta interpret them,
One gets over the difnculty on the classicd theory by arbitrariiy excluding those
solutions that have a negatve E. One cannot do this on the quantum theory, since in
general, a perturbation wiil cause transitions fkom states with E positive to states with
E negative..."n
Two years Iater, Dirac offered an answer in his paper, "A theory of electrons and protons"?
Dirac invoked the exclusion principle, so that two electrons could not iden-
and he postuiated that, "...ail the states of negative energy are occupied except perhaps a few
of small velo~ity."~~
These empty states he c d e d holes, and these holes exhibited the
properties of positively charged, positive energy particles. Dirac reasoned that the uniformly
filied distribution of negative-energy states would be unobservable to us, but an unoccupied
state, being exceptional, would make its presence felt as a hole. Dirac thought that this
concept could explain pair creation and annihiIation. Given enough energy, a negative-energy
partide could be lifted up into a positive-energy state, while leavig a hole behind, and the
Dirac identified the holes with protons, but he soon reahed that this was a rnistake.
The following year, he realized that if the particle in question were a proton, the chance of it
colliding witb an electron redting in anddation would be too great to account for the
stability of matter. In addition, Weyl had shown that this particle should have the same m a s
as an electrod" Accordingly, Dirac offered another interpretation,
... in the world as we know it, ail, and not merely nearly d,of the negative-energy
state for electrons are occupied. A hole, if there were one, would be a new kind of
particle, unknown to experimentd physics, having the same mass and opposite charge
to an electron. We may c d such a particle an anti-ele~tron.~'
1n 1933, the anti-electron was found by Car1 Anderson, a d he named it the ~ositron." h i s
and Weisskopf showed how quantum field theory naturaily incorporates the idea of admatter,
without introducng unobserved particles of negative energy, as welI as describing
satisfactorily the creation and annihilation of particles and photon^.^ In this context,
particles and antiparticles can both exist on the same level as quanta of the various quantum
fields.
soon viewed as a speciai case of invariance under charge conjugation, a temi introduced later
that same year by Kramers, in his paper, "The use of charge-conjugated wave-functions in the
63
hole-theory of the eleciron-" Kramers' work demonstrated that Ginvariance can be properly
treated only in the context of quantum field theory, and that the electromagnetic curent and
vector potential change sign under c!'
The lima. character of the states in quantum mechanics has interesthg consequences.
If we subject a state to not o d y rotations, but aiso ta ail possibIe Lorentz trdormations, we
can form a set of states with remarkable properties. In 1939, Wigner found that the
fundamental group for particle physics is not the homogenous Lorentz group of boosts and
rotations, but the inhomogeneous Lorentz group, now called the Poincar group, consisting of
these transformations p h trans1ati0n.s.~~
These infinite sets remain closed under any
Poincar transformation, that is, if we subject any member of any set to a boost, rotation, or
translation, it becomes a linex superposition of the original members of the same set- This
set is characterized by only two numbers: energy of the states at rest, which is the same for
ali members, and the angular momentun of the states at rest- Since the allowed
3. Concluding remarks
In 1927, Weyl and Wigner pioneered the application of group theory to the recently
discovered quantum mechanics. Although they were very successfid, and their work was
adrnired, others felt it was unnecessary to master this forma1 mathematics before proceeding
to the physics, and they opted for direct calculation instead At this eariy stage of
2,
5,
10 -
Pauli (1925).
16.
20. Dirac (1983), p - 40, Dirac adds that one of the first people
to suggest spin wa Ralph de Laer Kronig, who was working with
Pauli at the time. Pauli was skeptical, however, and so Kronig
abandoned the idea,
21.
274-80-
22.
To avoid the difficulty raised by Lorentz of speed of
rotation, the concept of electron spin can be thought of instead as
an intrinsic angular momentum23.
26. Fermi (1926). Fermi did not enclose his ideal gas i n a box,
according to the current treatment, but placed the particles
instead in a ttiree-dimensional harmonic oscillator potential. In
this w a y , he obtained a spherically symmetric, rnonotonically
decreasing gas density. [Fermi (1962), p . 178 - 1
27. According to Dirac, "Enrico Fermi had written about t h i s other
kind of statistics, but 1 had forgotten about Fermi's paper when 1
wrote my own work on the subject, and 1 made no reference to Fermi
in i t . Fermi wrote t o me, pointing out that he had been the first
to propose this kind of statistics, and 1 had to agree with h i m and
to apologize to him for forgetting about his paper. [Dirac (1983),
pp. 45-46]
Ir
30.
Heisenberg (19251,
33.
Heisenberg (1926a).
P a i s (1986), p . 26s
The paper in question is Heisenberg
(1926b). Although others refined Heisenberg' s work, this paper
34.
Wigner (1926a)-
36-
Wigner (1931).
39-
40.
41.
With the exception of the principal number n, a l 1 other
quantum numbers can be i n t e r p r e t e d as indices characterizing
representations of groups.
43.
emphasis on concepts.
The book was j u s t too abstract for most
physicists. Ir [Yang (1980), p . 10 - 1
45 -
47-
Pauli (1927), p. 6 0 8 .
48.
49.
50.
t e x t by B .
51.
Laporte (1924) .
52 -
Wigner
(1927b) -
5 3
Note t h a t Weyl (1931) and Pauli (1933) s t i l l use the name
l%iignaturell. 1 am not sure who was t h e first to use t h e term
l 1 p a r i t y U ,but i t appears i n Condon and Shortley (1935)
54.
Note t h a t the term parity, as used here, r e f e r s t o orbital
parity. T h e i n t r i n s i c p a r i t y of t h e proton and e l e c t r o n do not
corne i n t o q u e s t i o n , because for a l 1 of physics, one may introduce
t h e convention t h a t these two parities are p l u s I f any other
convention is chosen, l a q u a g e needs to be changed s l i g h t l y , but
observable conclusions are not af f e c t e d . Wick, Wightman and Wigner
(1952) stressed that t h e q u e s t i o n of whether t h e r e l a t i v e parities
of t w o states are measurable o r j u s t conventional is r e l a t e d to t h e
possibility
of
observing,
in
any
experiment
whatsoever,
the
56.
Pais ( 1 9 8 6 ) , p . 5 2 7 -
D i r a c (1928a), p . 612.
W i n s t e a d of E .
57.
58.
Dirac (1930)-
59.
60,
61.
N o t e that D i r a c
62,
63-
Weinberg (1977), p , 2 4 .
64.
66.
"In calculations using plane wave functions as a basis (Born
approximation) f o r processes-in which the appearance of electrons
and positrons is t r a n s i t o r y only, the odd order contributions
vanish identically."[Furry (19371, p. 1251
(l986), p , 527.
67.
Pais
68,
W i g n e r (1939) .
In 1926, Oskar Klein offered a connection between Kaluza's unified theory and the
quantum theory in his paper, "Quantum theory and five-dimensional relativity".'
Klein noted
that the ten gravitational potentials g , and the four electromagnetic potentids 6,can be linked
through a five-dimensional metric,
With this formalism, Klein could dispense with Kaluzaystwo approxhations, and stiii write
To relate the coefficients y, to the usud g, and 4pof standard relativity, Klein had to
make a few assumptions. The coordinates x ' , 2 3 , x 4 had to correspond to four-space, and gik
couid not depend upon the fi% coordinate x? He demanded that equation (1) be invariant
not only under the familiar group of point transformations?
$=J(x
for +1,2,3,4,
'3
but also,
x0-
iO+-(x'~
(3 1
under which ,
y remains invariant. H e is therefore justified in taking ymto be a constant, and
endowing only the ratios of the y, with a physicd meaning,' Klein was thus able to split up
such that the following differentials were invariant under the transformations of equatiom (2)
This implied that 7, behaves like a four-vector. In fact, snce the qmtities,
transfomi like the electromagnetic field F, Klein associated the 7, with the electromagnetic
four-potental,
This allowed Klein to derive the equations of motion for charged particles, giving both the
equations of gravitation and those of electrodynamics fiom the variational p ~ c i p l e ,
where
P is the ciwature of the manifold.) Because terms involving 2 do not appear, these
KLein then set out to establish the connection between the five-dimensionai relativity
theory and quantum mechanics.* He started fiom the Lagrangian,
P
i
'
aL
a (axi/w
for i=0,1,2.3,4. The motions in the fifth dimension are obviously not apparent in ordinary
experiments, but Klein thought that the observed motion could be thought of as a kind of
projection onto space-the of a wave-propagation which takes place in five dimensions!
By
averaging over the unobserved motion in the fiAh dimension, Klein could associate p,
with Planck's comant,
where X is the length associated with the period which Klein was able to calculate,
Klein took the mail magnitude of th-s value as vindication of the non-appeatance of the fifth
dimension in ordinary experiments. Although his resuits were incomplete, Klein was pleased
so far,
... the differential equation underlying the new quantum mechanics of Schrdinger can
be derived fiom a wave equation of a five-dimensional space, in which h does not
appear originally, but is introduced in connection with a periodicity in 2. Aithough
incomplete, this result ... suggests that the origin of Planck's quantum may be sought
just in this periodicity in the nfth dimension-'
Although Klein appeared to have attained a type of unification between electromagnetism and
gravitation, the theory remained marginal because it was formal in nature, and made no new
physical predictions. Yet although Klein established a definite link between five-dimensional
relativity and quantum mechanics, he ended the papa quite tentatively,
There is also left open the question as to whether the fourteen potentials are su&cient
to describe the physical phenornena or whether the Schrodinger method requires the
introduction of a new state-va.iable.8
ii. Fock and invariance 1192Q
At the same time, the Russian Vladunir Fock was carrying out similia work In
Fock presented his work, "on the invariant form of the wave and motion equations ..."9
1926,
While Fock's work was SHIin press, Klein's paper arriveci in Leningrad. Although the two
men achieved simiiar results, they did it through different rneth~ds.'~Like Klein, Fock ais0
found a relativistic generalization of the wave equation in five-dimensional space, and he also
showed that the trajectory of a charged pamcle can be regarded as a geodesic in a fivedimensional Riemannian space" by interpreting the scalar wave equation in a five-dimensional
Riemannian space as a relativitic g e n e r b t i o n of the SchrSdinger ~ave-e~uation.'~
He
went beyond mein, however, by d i s c o v e ~ ga new transformation invoIvhg the enigmatic
f331coordinate,
Fock noticed a set of transformations under which his relativistic wave-equation
remained invariant, l3
where f is an arbitrary fiuiction of the space-time coordinates, and p is the fiffh coordhite of
the manifold. Further, Fock found that he could rewrite the equations of motion in a form
which obeyed these transformations (dong with Lorentz traudormations) without relying on
the f a coordinate! t 4
Since this coordinate does not appear in his wave equations, Fock assumes that the
dependence of the wave fiinction J. o n p is given by an exponential factor. To eennire
agreement with experiment, Fock concluded that the wave-fiinction rnust take the form,
iC.-fi,e
Sri4
(14)
The significance of the superfiuous parameter p seems to Iie in the fact that it
implements the invariance of the equations with respect to the addition of an arbitrary
gradient to the four-potential."
Fock has therefore shown that two fiuictions J. and S.' obtained with the vector potentials
A and A-Vf
differ o d y by a factor,
which has an absolute value of 1. This implies directly that his transforr~tionsremain valid
if the wave-fiinction is multiplied by the exponentid factor,
Although Klein had also introduced the condition of periodic dependence of the ifth
coorduiate of the wave-hction, only Fock's paper contaios these transformations.
Fock included a footnote to his work in which he linked the dependence upon the fifth
coordinate to an earlier idea, put forward by Erwin Schrodinger,
ths new viewpoint, Fock's transformations were soon to become indispensable to elementary
particle physics."
In I9Z, SchrBdinger became a professor at the Universiiy of Ziinch, where Weyl had
worked since 1913, and the two of them bec-
with generai relativityand was making a few contri'butioons here and there, he was mainly
working on quantum theory at the the? In 1922, SchrOdinger wrote a paper in which he
tried to formulate Bohr's weli-known quantum conditions in the language of Weyl's unined
field theory. He called if "On a remarkable property of the quantum orbits of a single
In the paper, he started off with an explicit calculation of the change of length induced
in a vector by pzaliel transfer. Because of the linear differential form @&,,
the scale of
-f
t Vdc-A,&-A,&-A,dd
(19)
... the "genuine"(echten) quantum-conditions, that is, those that are sufficient to
determine the energy and thus the spectnun, are just sufncient to make the exponent of
the path-factor (19) an integer multiple of h/y (which is a pure number according to
the above) for dl approximate periods of the system."
Schrodinger then demonstrates how it holds for the cases of the unperturbai Kepler orbits, the
76
Zeeman effect, the Stark effect, and combined paralle1 electric and magnetic fields."
Schr3dinger goes on to discuss the implications of this propertyy
... ifthe electron were to carry with it in its orbit some "length (S~ecke)that is
transported unchanged in the motion, then CaIcuIated fiom some arbitrary point of the
orbit, the measure (MaflzahI) of this interval wodd always arrive multiplied by an
almost exactly integral power of
whenever the electron retumed with good accuracy to the original position and,
simultaneously, to the initial state of motion...24
Although he is not sure what it means, Schrodinger befieves ?hatthis is an important remit,
... the other possibility y = h suggests that the pure imagimy value,
might be a possibility, ia which case the universal factor [(20)] would be equal to
unity, and the scde of any transported interval would be reproduced d e r every quasiperiod.'8
'
Note that the constant yywhich was arbitrary but reai in Weyl's theory, is now i m w .
When y is chosen to have the value W2ni7 the scale factor becomes unity, and so the length
s a l e ceases to be non-integrable. SchrCidinger is not sure how to interpret this property. He
concludes, "...1 do not dare to judge whether this would make sense in the context of Weyl
ge~rnetry."~~
Thus, although Sctilodinger had recognized the importance of this property, he
you showed that on the discrete actual orbits the gauge unit (Eichenheit) with
y=27dh reproduces tseif in the case of a spatiaily closed path; moreover, you noted at
the same time that on the nth orbit the unit of length swells and contracts n tirnes
exactly as in the case of the standing wave that describes the position of the charge-..You even held in your hands the resonance nature of the quantum postulate long
-*a
before de ~roglie."~'
A week after he had written the letter, London gave a paper at the Wurttemberg branch of the
Gennan Physical Society at Stuttgart, in which he attempted to identw the Weyl measure
with the wave function. In January of 1927, he sent off a bnef note to
Naturw~senschaften," and a month later, he sent off a longer paper to Zeitschriftfr Physik,
of Riemannian geometry. He agrees with Weyl that the assumption of a rigid length-scale is
in contradiction with stnctly Local geometry, and that only d o s of the g, at a point, and not
under an infitesimai translation dr',and the proportionality factors &t are hctions of
position, and thereby characteristics of the scale-proportions of the space. Upon integration
(22)
London notes that in general, the gauge-measure depends upon the path, that is, it is nonintegrable. It is integrable (independent of the path), however, in the special case that the
quantities,
vanish. R e c d that this had led Weyl in 1918 to suggest that 6,should be interpreted as the
electromagnetic field. London wrote that since the electromagnetic field is interpreted through
the scde-relationships of space-tirne, characterized by the variabiiity of the gauge-measure, we
need to write out a proportion factor explicitly,
1-1,e"P'~
where (Y is the proportionality factor, and @ is the electromagnetic four-potential.M
Because there is no equivdence principle for elecromagnetisn, London thinks that Weyl need
not have womed about the universal influence of the electromagnetic field on rods or clocks.
On the contrary, Weyl shouid have not even considered it at dl, especiaiiy since Einstein's
arguments concerning the sharpness of spectral Iines was correct London continues, refening
to Weyl's 1921 arguments about persistence and adjutment,
In the face of ... experimental evidence, it mut have been an ununially strong
metaphysical conviction that prevented Weyl fiom abandoning the idea that Nature
wodd have to make use of the beautiful geometrical possibility that was offered He
stuck to his conviction, and evaded discussion of the above mentioned conhadictions
through a rather unclear interpretation of the concept of "real scaie", which, however,
robbed his theory of its immediate phycal meaning and attractiod6
London is interested, nstead, in the mtapped possibilities of Weyl's theory. London wants to
show in his papa that,
where i=1,2,3,4. This equation cornes fiom a complete solution W of the Hamilton-Jacobi
partial differentid equatiox~,'~
Weyl's theory demonm'ated that the vector +i was proportional to the electromagnetic fourpotential, Weyl had not specified the proportiodity factor a.* Inspired by SchrOdinger's
This bold choice makes it possible for London to identify the length scalar Z, as it varies
according to the Weyl theory, with the de Broglie field-scalar
+ (the wave-fulction).
By
substituthg the scaie factor (28) into the wave equation and by taking a ratio, London
significance. Notably, Schrodinger did not emphasize the conceptual leap here involving the
use of complex numbers. London, on the other hand, emphasizes the difference, although he
is not sure exactly what it is,
A more senous difnculty is presented by the complex fonn of the path-factor.
It is here not admissible to restrict oneself to its real part This is a reflection
of the fact that the wave finiction is IntrinsicaiIy complex... The meaning of
the fact that every length has to be regarded as complex, and that the whole
variation of lengths shows up as a change of phase without any change of
absolute value, is a question that 1 should not yet like to discuss?
The difference between Weyl's theory and London's is that Weyl thought that the
four-potential, which M y descnbes the electromagnetic field, was enough to determine the
path-factor. London, on the other hand, showed that these four components had to be
supplemented by a i%h, the wave-hctlon #, which was the physical object that behaved Iike
the Weyl rneasure? London was not sure how to interpret the complex form of this factor,
or the fact that it showed up as a change of phase only. This final step he left to Weyl.
Weyl's paper of 1929, "EIektron and Gravitation", extended Weyl's originai ideas past
London's reinterpretation. Whereas London's formulation had been tentative, Weyl's
formulation was complete, and in this paper, Weyl went beyond al1 previous ideas in
proposing that electromagnetism be demted fiom the gauge p ~ c i p l e .Since 1918, Weyl had
been cmvinced that there was a close andogy between gravitation and electromagnetism, and
conservation and energy-momentum conservation in the two respective theones, but in the
1929 paper, he was inally able to formulate the analogies between the two theories explicitly
by means of the tetrad (Vierbein) fornalism.
... this "pprinciple of gauge invariancew, is quite analogous to that previously set up by
the author, on speculative gromds;in order to arrive at a unined theory of gravitation
and electricity. But 1now believe that this gauge invariance does not tie together
electricity and gravitation, but raher electricity and mat~ter..!~
Weyl makes it clear early in his 1929 paper that the principle is to be associated with the
wave-equation $,
... the Dirac field-equations for J. together with the MaxweU equations for the four
potentids remain invariant when one makes the simultaneous substitutions,
s/ - + e f i ( ~ )
and
... the exponent of the factor multiplying $ is not real but pure imaginary. $ now
plays the role that Einstein's ds did before. It seems to me that this new principle of
gauge-invariance, that derives not fiom specdation but fkom experiment, shows that
electromagnetism is an accompanying phenornenon, not of the gravitation, but of the
material wave-function represented by &48
Weyl emphasized that the requirement of gauge invariance was related to the unobservability
of wave functions and the fact that it is their squares $*+, that have direct physicd
significan~e?~
Even though gauge invariance now applies to the wave-fbnction, for Weyl,
the principle is stilI set within the context of gravitation.
Since gauge invariance involves an arbitrary function X(x), it has the character of
general relativity, and can only be understood in this conte~t.~*
For Weyl, both the gravitational and the electromagnetic fields are related to a local
symmetry, entirely within the spirit of the gauge concept In this paper, Weyl attempts to
incorporate the Dirac theory into the scheme of general relativity? He takes
MO-component spinor equations of the Dirac type for electrons and protons as primary, and
then, introducing the tetrad f o d s r n , and introducing the requirements of local variation, he
introduces gravitation. Finally, by requiring local gauge invariance with bis new principle
goveming the transformations of electromagnetic potentials, he is able to derive
fomalism, the metric is established at a world point by a system of vectors e(a), where e(l),
e(2), and e(3) are red space-like vectors, which constitute a left-handed Cartesian coordinate
system, and e(O)/i is a real time-like vector in the futlne direction?' In this system of
tetrads, characterized by surteen components, a coordinate transformation A can be interpreted
as a rotations of the axes of the tetrad The tetrad allows the incorporation of spinors through
the introduction of extra components: Weyl introduces four real components x, @=O, l,2,3),
so that the components of e(a)in this system are designated @(a). In this system, Weyl can
describe the Dirac spinor $ by means of its components $i+,~;,~,-,$2-."
A vector t at P
(29)
By allowng the tetrads fieedom to rotate at different points, Weyl is able to implement the
a
The loosening (Lockemng) of the rigid relationship between the tetrads at different
point converts the gauge factor et', which remains arbitrary with respect to
fiom a
constant to an arbitrary b c t i o n of space-the. In other words, only through the
Ioosening of the rigidity does the established gauge-invariance become
understandable?
+,
This loss of rigidity means that the spinor components J., and & are no longer completely
detemiined by the tetrad, and this arbitrariness of the gauge factor actually forces the
The reinterpreted gauge invariance is stiU closely connected with the conservation of
elecaic charge, as it was in 1918, but rhis time the link is even stronger.
... the fact that the action integral is undtered by the infinitesimal varations,
In this paper, Weyl was aiming to make the resemblance between gravitation and
electromagnetism manifest, and he explicitly demonstrated the similarty between the
derivation of the charge conservation law, and the energy-momentum conservation laws fiom
invariance with respect to both general coordinate transformations and Lorentz ~ s f o r m a t i o r s
of the tetrad?
For Weyl, this is one the paper's most appealuig features. He regards the
gauge-factor in $ and makes the connection between the eqerimentdly observed gaugeinvariance and charge-conservation understandable...~
~ 5 9
In May 1930, Weyl gave his lecture "Geometry and Physics" at Cambridge, and it was
In the lecture, he
spoke against the program of unified geornetrized field theories with respect to its neglect of
quantum theory, although as we have seen, it was in this h e w o r k that the various aspects
of gauge symmetry and the concept of gauge fields had been di~coveredand deveIoped
Quantum mechanics needed to be reckoned with, and since the gauge nature of the
electromagnetic field had now been clarified, it was no longer enough to seek a unified
geometricd description of gravitation and electromagnetim,
... the electromagnetic field is more strongly linked to the matter field than to
gravitation ... if we are to speak of geometrization, then ... we must proceed fiom
geometrkition of the matter field. If this can be done, the electromagnetic field will
be obtained as a bonus."
Thus, gauge invariance fkorn this time forward, was to be associated with the new quantum
theory, in contrast to the older gauge symmetry with flowed fkom an extension of the theory
3. Concluding remarks
Eleven yens after Weyl's original theory, it was established that the electromagnetic
field is associated with a local gauge symmetry and has'a gauge nature. FoIlowing the work
of Kaluza, both KIein and Fock had discovered a relativistic generalization of the wave
equation, and Fock had discovered transformations of the wave equation involving the
unknown, fifth coordinate. This was liked to a property of quantum orbits which
Schr6dinger had noticed in Weyl's geometry. Ushg this resuit, London was able to
reinterpret Weyl's theory: it was no longer liaked with transformations of the metric, but it
was now associated with an imaginary transformation of the wave-function The original
theory of 1918 had been mathematically appeahg, but it was in direct contlict with
invariance now as aprinciple nom which electrornagnetism could be derived. Afthough its
ongins were in an extension o f the general theory of relativity, gauge symmetry was now
1,
Klein (1926a).
Although Klein does not reference Weyl, recall that for Weyl
(1918a), only ratios of g,, had physical meaning.
2.
4,
5.
T h i s derivation appears i n Klein (1926a), but
explanation is provided in Klein (1926b)-
7.
8.
9.
Fock (1927).
a clearer
10 .
13.
228-
17.
It is c u r i o u s that Fock made no mention at al1 of Weyl's
theory, since Schrodinger was enamoured with i t . Under a quantum
mechanical reinterpretation of the theory, Fockrs transformations
were soon to become gauge transformations.
18.
Schrodinger (1922)-
1 =l,e
-PLloe -fi&
(1921),
21.
ei
the
where r is the period of the orbital motion, and T is the timeaveraged kinet ic energy. In accordance with the virial theorem,
the independence
28
Schrdinger (1922), p , 23
30.
31.
London (1927a)-
32-
London (1927b)-
35-
36.
37.
38. In standard non-relativistic Hamiltonian theory, the HarniltonJacobi equation takes the form,
where
Vw+
For non-relativistic particles moving in an electromagnetic field,
the Hamilton-Sacobi equation becomes,
which
is
equivalent
to
the
equation
London- COVRaifeartaigh (1997). p. 19.1
as
used
by
40.
41-
Taking t h e r a t i o g i v e s ,
in
the
group
velocity
of
the
wave
g i v i n g the result,
45.
Weyl (1929a), p , 331- Once again, Weyl does not mite these
transformations as equations, although the meaning is the same- He
writes,
. . . the equations rernain invariant when one makes the
simultaneous substitutions
v by eUp
and
f, by f,-dWdx,
where A is understood to be an arbitrary function in fourspace.
46.
50.
53 -
54.
Actually, Weyl preferred a two-component theory because he
thought that the Dirac theory gives twice as many energy levels as
necessary. Weyl had his own way of dealing with the negative
energy problern Dirac was facing. He believed that by going over to
the two-component form of the theory, one would be able to
eliminate the lrextraItlevels, which he interpreted as the energy
levels of a "positive electron"In Weyl's words, T t is
reasonable to expect that in the two component pairs of the Dirac
field, one pair should correspond to t h e electron and the o t h e r t o
the protonFurthemore, there should appear two electrical
conservation laws, which ( a f t e r quantization) should state the
separate conservation of the number of electrons and protonsThese would have t o correspond to a two-fold gauge invariance,
involving two new a r b i t r a r y functions. [Weyl (1929a), p . 3321
Weyl continues, Ir . . . the formal requirements of group theory,
independently of t h e field-equations t h a t are to be brought into
agreement with observation, f o r c e the number of components to be
raised rom t w o to four, W e shall see that two components suffice
if
the requirement of
left-right symmetry
(parity) is
dropped." [Weyl (19294 , p. 3321
Thus, in 1929, Weyl's spinor formulation excluded the llnear
implernentation of parity, which at the t i m e , was considered a
disadvantage. After the discovery of parity violation in 1956,
there was a revival of interest in Weyl spinors, or chiral spinors,
as they came to be known, since they are eigenstates of the
chirality, or helicity.
55.
56.
Weyl (l929a), p.
57.
348.
58.
Although Weyl did not refer to N o e t h e r , this work was a
special case of Noethertstheorem.
Like Wigner before him, Weyl
made a result of Noether's theorem familiax t o physicists. He
managed to exhibit the analogy between the energy-momentum and
electromagnetic conservation laws in the context of field theory,
with the aid of the tetrad formalism [ W e y l (1929a), pp. 341-43.1
60.
Weyl
61 -
W e y l (l93lb), p . 5 8 -
(1931b).
On 8 May 1930, Neils Bohr gave his Faraday lecture to the Chernical Society of
London. In this lecture, he reviewed the current situation in nuclear physics, and he discussed
the problems of nuc1ea.r structure. At the t h e , empirical evidence fiom a-scattering had led
to the belief that nucIei are b d t up of protons and electrons.' Although this hypothesis led
to decent results for calculations of binding energy, confinement of electrons within a volume
of nuclear dimensions gave rise to quantum mechanical difflculties. F d e r r n o r e , it was
unclear how the protons and electrons were held together in the nucleus. Why for instance,
did four protons and two electrons hold together to form a stable helium nucleus, or a-
particle? The structure of heavier nuclei was even more complicated, although it could be
somewhat simplined by treating the a-particles as building blocks-'
intensity variations in band spectra, nuclear statistics were iderred. Helium nuclei obeyed
Bose statistics, as was expected for a system containuig an even number of particIes under
Pauli's exclusion principle. The next nucleus to be measured was the nitrogen nucleus, NI4.
As it was thought to be composed of an uneven number of particles, 14 protons and 7
electrons, it was expected to obey Fermi statistics, but it did not It obeyed Bose statistics
instead.'
This was problernatic, as Ehrenfest and Oppenheimer had just proven that a
composite system obeys Bose or Fermi statistics depending on whether the nurnber of
component particles is even or odd? Bohr understood this discrepancy to be indicative of the
limitation of regarding an intra-nuclear eIectron as a separate dynamitai entity,
Strictly speaking, we are not even justined in saying that a nucleus contains a definite
number of electrons, but only that its negative electrdication is quai to a whole
number of elementary mits, and in this sense, the expulsion of a P-ray fiom a nucleus
may be regarded as the creation of an electron as a mechanical entity.'
The 8-decay spectrum was p d i n g . Nthough the rate of decay could be predicted
with a simple probability d e , just as in the case of a-decay, the energy liberated within a
decay was found to Vary within a wide continuous range. This was in sharp contrast with the
energy emitted in an a-particle disintegration, which was nxed for each element, as expected
for a two-body decay. This was an extremely serious problem, and for Bohr, it threw the
conservation of energy into d o u b ~ ~
At the present time of atomic theory, however, we may say that we have no argument,
either empincal or theoretical, for upholding the energy principle in the case of @-ray
disintegrations and are even led to complications and difficdties in trying to do s a 7
Thus, the two great sturnbling blocks of nuclear theory were the quantum-mechanical
difficulties stemming fiom the confinement of electrons within a volume of nuclear
that electrons in the nucleus could not be describeci within the fiamework of quantum
mechanics, that they lost their property of determining the statistics and spin of the nucleus as
a whole, and that energy was not conserved when electrons were involved in nuclear
processes. Pauli, on the other hand, was unwilling to abandon energy conservation, and so he
proposed a radical new solution.
accompanied by an as yet undetected unknown neutrd particle, not yet observe& which was
carrying away the missing energy,
Pauli made this suggestion public at a June 1931 symposium on "The present status of
Pauli spoke on "Problems of hyperfine
the problem of nuclear structure" in ~asadena~
structure", and it was clear that he intended his neutron t o be a simultaneous solution to the
problems of both nuclear consitution and 8-decay. Since the new particle weighed very littie,
and it bound to nuclear matter, Pauli envisioned it combining with protons and electrom
within the nucleus. The electrons would still compensate the charge of the nudeus, wnile the
neutrons would compensate the nuclear spin. In October of that same year, the Comegno di
FisrCa nucleure took place in Rome, At the meeting, Samuel Goudsmit gave a talk on
hyperfine structure in which he rnentioned Pauli's neutron,
difficulties in nuclear structure and, at the same tirne, in the explanabon of the /3-
spe~tnim."'~Enrico Fermi was in the audience, and after the paper, he spoke to Pauli
privately. Fermi was impressed with the idea of the new particle, and in a discussion with bis
colleagues h m Rome, it was Fermi who suggested the Itaiian name neuh.ino.' He wanted to
differentiate Pauli's light neutral particle fiom the idea of a heavy neutral particle-"
In February 1932, James Chadwick published a letter in Nature makig a daim for the
possible existence of a neutron. This was a heavy particle that Chadwick had been looking
for since 1920, when Rutherford had suggested the possibility of a tightly bound combination
of a proton and an electron which might be a component in the structure of heavier n u ~ l e i - ' ~
In 1930, Bothe and Becker showed that when bombarded with a-particIes, boron and
beryllium gave off a powemil radiation, whch they had assumed initidy to be -y-rayd3 In
late 1931, however, Mme, Curie-Joliot and Mt JoIiot showed this radiation to be even more
penetrating than y-radiation, capable of ejecting protons fiom matter containing hydrogen at
speeds up to nearly 3x10' cm,sec. The Curie-Joliots had interpreted this as a process
analogous to the Compton effect, concIuding that the beryllium radiation was a quantum of
Chadwick did not fike this conclusion, because it was difficult to reconde with the
conservation of energy and momentum, and so he offered an alternative,
It is evident that we must either rehquish the application of the conservation of
energy and momentum in these collision or adopt another hypothesis about the nature
of the radiation. If we suppose that the radiation is not a quantum radiation, but
consists of particles of mass very nearly equal to that of the proton, all the difficulties
c o ~ e c t e dwith the collisions disappear-"
He reasoned that &ce the new radiation was extremely penerating, it must have a charge
very smaii compared with that of an eiectron- He then offered what he thought was the
simplest hypothesis, that the radiation conssted o f new particle consisting o f a proton and
electron in close combination, givhg a net charge of zero, and a mass slightly Iess than that
of the hydrogen atom. He concluded, "...the neutron hypothesis gives an immediate and
simple explmation of the experimental facts; it is consistent in itself and it throws new light
laws of quantum mechanics, in tenns of the interaction between protons and neutrons. In the
paper, he assumes fiom the outset that nuclei are made up of protons and neutrons, and that
they do not contain electrons. This le& to a considerable simplification of nuclear theory.
By taking the neutron to obey Fermi statistics, with a spin %fi, Heisenberg bypasses the
diffcuties of the statistics of NI4 and the uncertahty o f nuclear moments, and he is able to
reduce the difficulties of the theory of 6-decay to a single question, under what circumstances
Heisenberg treats the attractive force between the proton and neutron as a position exchange
(Platmechsel), in analogy with the van der Waals force binding a hydrogen atom and a
positive hydrogen ion together within the Hc ion by electron exchange. Similady, he
postulates an attractive force given by the function K(r) between two neutrons, in analogy
widi the attractive force between two hydrogen atoms in an H, rnolecule. Note that apart
from these two forces, neutron-proton and neutron-neutron, there is no other nuclear force.
Between protons, Heisenberg assumed the only the standard Coulomb repulsion e 2 f d 9
Heisenberg had noticed that in the kinetic energy and.the exchange energy ternis of the
nuclear Hamiltonian, there was a symmetry between protons and neutrons.20 To make it
easier to write down the Hamiltonian frmction, Heisenberg introduced a new variable
into $=- 1,
exchange, there are transitional elements in the Hamiltonian that change pE=+l
which are embodied in the following matrices:
Although these matrices resemble the Pauli matrices, the coordinates t, 9,and
refer to a
different space. In analogy to the mathematics of spin, this vansable was later referred to as
p-spin, although it came to be lcnown as isotopic spin, and later still, as isospin Initially,
some physicists felt that this formalism was too complicated. For exampie, Ettore Majorana,
in his paper of 1933, was pleased that he could avoid those "troublesome p-spin
coordinates"."
With respect to kinetic energy and exchange energy, the Hamiltonian is symmetric
with respect to protons and neutrons." Heisenberg's Harniltonian term for the exchange
interaction is,
J(rJ ( p f * p r + p ; p ; )
(2)
where r 3 1 r,-r,l, and J(r,) is the strength of the exchange interaction. For either two protons
or two neutrons, this operator is zero, but if one of the pair is a proton, and the other is a
neutron, this term exchanges their isotopic spin. Heisenberg goes on to consider the stability
of the nucleus. He pictures it as a structure containhg generally a few more neutrons than
protons and in which pairs of protons can bind together with pairs of neutrons to form
particuiady stable particles, that is, oiparti~les.~~
He reasons that if a nucleus contains only
neutrons, then these neutrons will transform to protons by fl-ray emission until the energy
gained by adding a proton is exactly the same as the energy used to remove the neutron. In
other words, a nucleus is unstable against 8-disintegration if the replacement of a neutron in
102
the nucleus by a proton makes the energy of the correspondhg atom smder?
The first published statement of the Pauli's neutrino hypothesis appeared in the
discussion of Heisenberg's thrd papa on nuclear structure at the Solvay conference, in
Brussels on October 1933.'~ M e r the paper, Pauli commented that b o t . spin and statistics
conservation would be provided by the neutrino if it had spin % and Fermi statistics, and of
course, energy and momentum conservation would aiso be g~aranteed.~'As to the m a s of
the object, Pauli lefi this question open Enrico Fermi was present at this confrence, and
although he restricted his comments to problems of nuclear forces he must have thought
about the 8-decay theory, because his e
s
t paper on the subject was published only two
months d e r this conference.
Until Fermi's theory of @-decay,the idea of the neutrino had been a vague hypothesis,
and a formal theory had not yet been constructed. When Pauli fmt suggested the particle in
1930, electrons were believed to exist in the nucleus, and the hypothetical neutral particle was
considered as another nuclear constituent, with a smaU but finite rest m a s , Fermi had been
thoroughly studying Dirac's quantum theory, and he combined it with Heisenberg's spin
formalism to constmct a theory in which electrons did not exist in the nucleus before the
decay at all, but were created, together with neutrinos, in the process of e m i ~ s i o n . ~ ~
Fermi intended to announce the results of his 6-decay theory in a letter to Nature, but
he was rejected! The editor thought that his theory contained, "...abstract speculations too
In December 1933, he
submited a five-page paper, containhg all of his essential results to Ricerca Scientrfca,
where it was promptly published. Longer papers were sent to Nuovo Cimento and Zeitschrr3
like the number of photons in the theory of radiation, is not necessarily constant. During j3decay, eIectrons and neutrinos can be created in emission fiom the micleus, just as photons
can be created in emission fiom an excited atom, accorduig to Dirac's theory."
Fermi
recognized that it was difficult to explain how electrons or neutrinos could be bound in orbits
of nuclear dimensions, so following Heisenberg, he assumed that only heavy particles, protons
two quantum states of a single heavy partide, corresponding to the two values of the "inner
-1 for a proton.
such that each transition fiom neutron to proton would be associated with the creation of an
electron and a neutrino, while the transition of a proton to a neutron would be associated wth
where asand a,' are creation and annihilation operators for electrons, and,
where b, and bu' are creation and annihilation operators for neutrinos.
The total energy of the system cm be written in a Hamiltonian, with terms for the
energies of the heavy partides, the energies for the light particles, and the interaction between
heavy and light particles. For transitions, it is the interaction Hamiltonian which is important
with the creation of an electron and a neutrino, and dso tems Qa-4,which can couple the
are quantities which can depend upon the coordinates and momenta of the
heavy particles.
To specify the interaction Hamiltonian U , more precisely, Fermi uses the criterion of
simplicity. He notes that the choice of Hamiltonian is limited by the invariance under rotation
or translation of the space coordinates. Disregardhg corrections due to relativity and spin for
the moment, he writes down the simplest choice available,
Hi-gk2$
(7)
where g is a constant with the dimensions L ' M P , where L,.M,T are length, m a s , and tirne.
Fermi points out explictly that there are many possible forms of an interaction
Harnikonian which c m lead to 8-decay pro cesse^?^ Any scalar expression Like,
be just as good. Yet since expression (7) is in agreement with experiment, Fermi is happy
with it. Ali that is Ieft for him to do is to generalize it so that electrons and neutrinos can be
treated relativistidy. He replaces the $ and
t$
with the interactions of charges and currents with the eiectromagnetic field, he chooses vector
(9)
where $ and 6 are the conjugate spinors, and y is a 4x4 matrix, as required by the Dirac
where v,, and un correspond to wave-fiinctions for protons and neutrons bound in nuclear
the electron into the state s is then proportional to the square of this matrix element,
where W is the dinerence in energy of the neutron- and proton-states, and H, is the energy of
the electrod9 Fermi noticed that the matrix element was either close to 1, when the initial
and h a I nuclear states had the same anguIar momentum, or it was zero, when the angular
momenta were different Fermi interpreted the vanishhg of the ma& element as indicating
forbidden (verboten) transitions, not strictl'y forbidden, but with a much slower rate of decay
than that of allowed transitions.'
of the emitted 8-rays, and his results were in good agreement with experiment, so good in'
fact, that they soon convinced Bohr that Pauli's neutrino hypothesis was correct, and that
the charge exchange was somehow related to the B-decay process, but he did not yet have a
rnechanism for it. This rnechanism, however, had been provided by Fermi! Tamm and
Iwanenko noted that Fermi's theory provides the possibility of deducing the exchange forces
between neutrons and protons nrggested by Heisenberg. They argued that the creation and
subsequent annihilation of an electron and neutrino in the field of a proton and a neutron
wodd Iead to an exchange interaction between the two particies, in the same way as the
Coulomb interaction between two electrons is caused by the exchange of a photon between
where g is Fermi's couphg constant. As this is far too small to account for the interactiom
between neutrons and protons, Tamm had concluded,
Our negative resuit indicates that either the Fenni theory needs substantial modification
(no simple one seems to alter the results matenally), or that the origin of the forces
between neutrons and protons does not lie, as would appear fkom the original
suggestion of Heisenberg, in their transmutations, as considered in detail by Fermi.'3
Although Tamm and Iwanenko were unsuccessfid, they provided Yukawa with an important
Later that same year, Hideki Yukawa synthesized the theories of Heisenberg and
Fermi. The problem witt Fermi's theory, as shown by Tamm and Iwanenko, was that the
107
neutron-proton interaction was not nearly enough to account for the binding energies of the
neutron and the proton within the nucleus. In order to avoid this diffTcuIty, Yukawa proposed
that a neutron could change into a proton by a process different than Fermi's B-decay
mechanism: the negative charge could be transfened by a new heavy particle to a nearby
proton, thus transfomiing it into a neutron.
The transition of a heavy particle fkom a neutron state is not always accompanied by
the emission of light particles, that is, a neutrino and an electron, but the energy
liberated by the trausition is taken up sometlmes by another heavy particle, whch in
turn wilI be transfomed fkom proton state into neutron state?
If this process occrrrred with a large enough probabiIity, it couid account for the intense
interaction between neutrons and protons without affecting the srnail probability of 8-decay
Yukawa decided that this interaction between neutrons and protons could be described
by a new field of force, just as the interaction between charged particles is described by the
just as the electromagnetic field is associated with the photon, but with the special
requirement that it needed to interact more strongly with heavier particles than with light
particles, in order to explain why the interaction between neutrons and protons was so intense.
Yukawa attempted to describe this field between the neutron and the proton with a scalar
field U. He wanted an equation in analogy to the wave-fiinction for the electromagnetic
potential,
C
except that instead of the Coulomb potential, he wanted a potential which decreased more
rapidly with distance, expressed as,
where g is a constant with the dimensions of electric charge, and A is a constant with the
In the presence of heavy particles, however, the U-tield causes transitions nom neutron state
to proton state. Yukawa borrows Heisenberg's matrices, c a h g them r-matrices,
where the neutron and proton States are denoted by r,=+l and z,=- 1 respectively- The
equahon descnbing a &mition fiom proton to neutron is then given by,"
where iI. and $ are the wave functions of the neutcon and proton. The conjugate function U is
used to describe the inverse transition fiom proton to neutron,
Yukawa says that a similar equation would hoid for a vector function describing the U-field,
in analogy to the electromagnetic vector potentiai, but since there is no correct relativistic
theory for the neutron and proton wave-functions, he is usng non-relativistic wave-fnctions
for them and neglecting spin anyway, and so using a Pfield scaiar is adequate?
Under these conditions, he is able to k t e down a symmetric Harniltonian exchange
interaction,
where pu and
excepting, of course, the interactions between neutrons and the electrostatic repulsion between
protons which are described by other Hamiltonian terms.
Yukawa then discusses the nature of the quanta of the U-field- Since the neutron and
proton both obey Fermi statistics, the new quantum must obey Bose satistics, and so it can be
quantized Like the electromagnetic field. Accordingly, the Law of conservation of charge
parameter X as the Compton wavelength, Yukawa is able to calculate its masr, rn&dih.
Assuming h=5xlO-' cm-', he gets a value of about 200 electron masses. Yukawa is not
womed that such a Iarge particle has never been observed, because,
... in the ordinary nuclear transformation, such a quantum cannot be emitted into outer
space.... The reason why such massive quanta, if they ever exist, are not yet
discovered may be ascnbed to the fact that the m a s mu is so large that the condition
1 En-E,,, 1 >mU2is not fulfilled in ordinary nuclear transformation-"
Yukawa moves on to discuss 6-decay. The LI-particle can interact not only with heavy
particles (neutrons and protons), but also with light particles (electrons and neutrinos). M e r
it is emitted by a neutron, the U-particle, instead of being absorbed by a proton, may be
absorbed by a neutrino state of negative energy and then jump to an electron state of positive
energy? In diis case, an anti-neutrino and an electron would then be emitted
simultaneously fiom the nucleus. Therefore, Yukawa's theory matches Fermi's theory,
Fermi considered that an electron and a neutrino are emitted simultaneously from the
radioactive nucleus, but this is fonndy equivalent to the assumption that a Iight
particle moves fiom a neutrino state of negative energy to an electron state of positive
e11ergy4~
Furthemore, Yukawa's theory ais0 explains why the Fermi interaction is 1W8h e s weaker
than this new interaction. Because the U-field interacts with Iight particles, Yukawa needs an
where $ and 6 are the eigenfunctions of the electron and neutrino, respectively, and g' is a
new constant with the same dimensions as g- Yukawa obtains the matrix element,
corresponding to the double process: a heavy particle falIs nom the neutron state with the
eigenfunction u(r) into the proton state with the eigenfbnction v(r) and simultaneously a iight
particle jumps from the neutrino state q5k(r) of negative energy to the electron state &(r) of
positive energy. Since X is much larger than the wave numbers of the electron and neutrino
Inserting a
Yukawa obtains,
which matches Fermi's renilt corresponding to the emission of a neutrino and an eIectron of
positive energy states, except that the f-or
This means that the interaction between the neutrino and the electron is much smaller
than that between the neutron and the proton so that the neutrino wiiI be far more
p e n e t r a ~ gthan the neutron and consequently more difncult to observe?
111
Yukawa concluded that his arguments were only specdative, but thaf "...the massive quanta
may also have some bearing on the shower produwd by cosmic rays."'
In his first nuclear physics paper of 1932, Heisenberg had demonstrated nuclear forces
of short range between neutrons and protons and between neutrons and neutrons. Although
he did not introduce any proton-proton force, aside fiom electrostatic repdsion, he realized
that the approxirnate equality between the number of protons and neutrons in nuclei implied
that neutron-proton short-range forces could not be very dinerent fiom proton-proton forces.
Therefore, he was forced to assume the neutron-neutron force to be weak compared to the
neutron-proton force. In August 1936, however, it became clear that there was a short range
attractive proton-proton interaction comparable in strength wth the neutron-proton nteraction-
Three papers received by Physical Review within two days in that month mark major strides
analysis of these data and a cornparison with available proton-neutron scattering information.
They found that the interactions between protons was found to be nearly equal to the proton-
neutron force in the corresponding condition of relative spin orientation and angular
T,
In this paper, we show how the use of a coordinate having two proper vaiues which
tells whether a particle is a proton or a neutron, together with the assumption of the
Pauli exclusion principle for dl the particles, gives a unified description of the various
types of exchange forces."
Cassen and Condon suppose that each heavy partide (proton or neutron) is descrbed by five
coordinates: three for the space coordinates, one for the spin, and a fifth coordinate
7,which
takes on the values kl, that is, +I if the particle is a proton, and -1 if it is a neutron.58
They cal1 T the character coordinate, and i d e n t e it with the component of a character vector
dong a chosen preferred direction, and although they emphasize the analogy with spin, they
make it explicit that the components of r do not refer to directions in space. Under
Cassen and Condon then apply this formalism to four d i f r e n t types of exchange forces
which had been proposed, not just Heisenberg's exchange of spin and charge, but also Wigner
forces (1932) which exchanged neither spin m r charge, Majorana forces (1933) which
exchanged charge but not spin, and Bartlett forces (1936) which exchanged spin but not
charge?
nius, although Cassen and Condon relied upon the r symmetry, their main
charge independence means that nuclear forces between all pairs of heavy particIes are equal.
Wigner starts off his paper, "On the consequences of the symmetry of the nuclear
Harniltonian. .,"as follows,
Recent investigations appear to show that the forces between ail pairs of comituents
of the nucleus are approxknately equal. This makes it desirable to treat the protons
and neutrons on an equal footing.... Heisenberg introduced a variable r which we
shall c d the isotopic spin... The assumption that the forces between al1 pairs of
particles are equal is equivaient, then, to the assumption that they do not depend on 7
or that the Hamiltonian does not involve the isotopic spim6'
More precisely, this means that to take advantage of the isotopic spin symmetry, the
Hamiltonian operator may be wxitten in terms of space and spui variables alone, so that it
does not differentiate between neutrons and protons. This does not mean, however, that the
Harniltonian cannot be written another way. Alternatively, to highlight the spin symmetry
instead, the Hamiltonian can be written in terms of space and isotopic spin coordinates
done?
Wigner makes it clear that he is using group theory. He shows how charge
independence implies that nuclear forces are invariant under the group of isotopic spin
rotations, the group SU(^)? This allows Wigner to characterize multiplet systems of
nuclear spectra If a group of nuclear states has a total isotopic spin T,then there are 2T+l
states with the same energy, but with
-2' to
(24)
Note that an isospin multiplet of levels is a set of levels, not within a single nucleus, but
within a set of isobark nuclei- This is why isotopic spin is often called isobark spin. AU
states within the ame multiplet have the same anguia.momentum, parity, and energy.
Wigner realizes that the energies are only approximately equal, because as Breit, Condon, and
Present noticed, the levels are split by the Coulomb repulnon between protons.
Wigner takes the symmetry even M e r . By treating the spin coordinates and isotopic
spin coordinates on an equal footing, he introduces the group SU(4).67 This combined
symmetry applies to the extent that nuclear forces depend neither upon the orientation of spin
nor upon the orientation of isotopic spin. Wigner uses the representations of these groups to
characterize the multiplet structure of nuclear spectra, similar to how he applied group theory
to atornic spectra years earlier.
In March 1937, Neddermeyer and Anderson presented a paper, 'Note on the nature of
cosmic ray particles", in which their analysis of cosmic ray data suggested the presence of
There exist particles of unit charge but with a mass (which may not have a unique
value) larger than that of a normal fiee electron and much smaller than that of a
proton.. .68
They observed that these penetrating particles occurred with both positive and negative
charges, leading them to believe that,
Three months later, Oppenheimer and Serber sent a letter to the editor of PhysicaZ
Review suggesting that these recentiy discovered particles might in fact, be Yukawa's
... it has been suggested by Yukawa that the possibility of exchanging such particles of
intermediate mass would offer a more natural explmation of the range and magnitude
of the exchange forces between proton and neutron that the Fermi theory of the
electron-neutrino field?
This was the first mention of Yukawa's particle in the Western literature? In December
1937, Nicholas Kemmer sent a letter to Nature in which he acknowledged that it was
suggestive that this particle, which he called a "heavy electron" did give nuclear forces of the
correct range, but that the these forces had the wrong spin-dependence." Since Yukawa's
scalar field was insufficient, Kemmer suggested instead a vector field,
... a more satisfactory result can be obtained if one admits a vector wave function for
the new particle, such as was used by Proca Cl9361 in a different connection. Proca's
equations c m be quantized on lines completely analogous to the Pauli-Weisskopf
method Cl9341 for the scalar wave equation, and the resdting neutron-proton potential
can e a d y be determined."
On the following page of the same journal, a letter by Homi Bhabha appeared in which he
... the U-particles being charged, they m o t explain the close-range proton-proton
interaction. To formulate this, we would have to introduce a neutral particle N of
about the sarne mass Mu which obeys similar equations to those for the U-particles and
can be absorbed and ernitted when a proton jumps fiom one energy state to another.
The introduction of such a particle may not seem very arbitrary when we consider that
it would give us a symmetrical state of &S."
Thus, the neutral U-particle was the Grst particle predicted upon the grounds of symmetry.
Kemmer had already been thinking dong similar h e s . Earlier that year, before he
had learned of Yukawa theory, Kemmer had written about charge independence in comection
with a common mechanimi for Heisenberg's exchange theory and Fermi's theory?
Kemmer argued that in second order pemirbation theory and beyond, a charge-bearing field
... the emission of two electrons of opposite sign or of two neusinos by a proton or
neutron should be possiibe, in addition to the originalIy assumed em-ssionof one
electron together with one neutrino."
In fact, Kemmer went on to postulate that perhaps isotopic spin could be extended to the light
particles in such a way that the positron and neutrino would form a doublet, that is,
73
would
be +1 for the positron, and -1 for the neutrino. This remarkable idea was very prernahire,
and the magnitude of the forces did not work out correctly, but Kemmer's idea was
In his origial paper, Yukawa had described the U-field as a scalar particle,
positive parity. Kemmer drew out the consequences of a Yukawa particle that behaved iike a
pseudoscaiar, which dso has zero spin, but with negative partytYFinally, Kemmer considered
the two cases correspondhg to n=l, a vector part..dewith spin one and positive party, and a
Yukawa particle came in both charged and mcharged states." The biggest
hurdle he faced, in hs estimation, was to show that the neutral Yukawa particle was not its
own antiparticle,
... no relativistically invariant expression for the interaction is possible which would,
for instance, d o w the ernission of the "particle" only by neutrons, the "antiparticle"
only by protons. It is satisfactory, that it can be proved that the neutral antiparticIe
cm be completely eliminated fkom the theory, so that only a positive, a negative, and
one kind of neutral particle need be assurned to exist. In spite of the apparent
asymmetry of this procedure, the charge-independence d l ho1ds..."
This new idea holds for both the vector case and the scaiar cases? Up to smaU
electromagnetic corrections, a i i three of the particles would have the same mass. Hans Bethe
named this the "symmetrical theory"."
None of the propertes of the meson had been experimentally tested yet In 1940,
Wiarns and Roberts observed, in a cloud chamber, the decay of a negative meson into an
electrod8 This was to be expected according to Yukawa's prediction that the negatively
charged meson could take part in &de~ay.'~ The correspondin&decay of the positive
meson had not yet been observed, but based on these two types of decay, Tomonaga and
Araki predicted that positive and negative Yukawa mesons should produce very different
118
prefer to decay rather than be absorbed by a nucleus, since Coulomb repulsion hinders the
meson reaching the nucleus, while negative Yukawa mesons, on the other hand, should
strongly prefer absorption to decay. If this were tme, then practically all the decay processes
mesons by dense materiais, using iron and carbon as absorbas. In iron, the mesons behaved
as expected. In carbon, however, negative mesons were not absorbed nearly as much as
predicted.g' An andysis of this result by Fermi, Teiler and Weisskopf showed that the
interaction of the negative cosmic ray mesons with nuclei was twelve orders of magnitude
weaker than that of the positive r n e ~ o n .Even
~
by modifLing the spin of the particle, as
Kemmer had suggested, the disagreement codd only be reduced to ten orders of
magnitude?
Durhg the Shelter Island discussion of this problem in 1947, Robert Marshak
? ~ are two
proposed a solution, which he and Ham Bethe later elaborated in a ~ a ~ e rThere
different kinds of mesons which exia in nature, they said, possessing different masses: the
heavy meson is produced in the upper atmosphere and is responsible for nuclear forces, while
the light meson is a decay product of the heavy meson, and is observed at sea level where it
interacts weakly with matter?'
existence of these two types of meson with dinerent mas came fiom Lattes, Occhialini, and
precision?
Poweil and his group named the two types of meson, x-mesons and p-mesons. The nmeson, later called the pion, was identified with the Yukawa meson, and is much heavier than
its decay producc the p-meson, Iater caiIed the muon. The pion couples strongly to nucleons,
and is produced copiously, while its daughter, the muon, interacts mainly electromagnetically,
giving it a small scattering cross-section, and therefore negligible absorption."
This resolved
the earlier difficulties. Three years later, in 1950, the neutrai a-meson (no)was discovered
confimiing the preciiction of Kemmer based upon charge independence?
A number of
experiments established the spin of both the charged and neutlal x-mesons to be zero, and in
1951, the parity was found to be negative."
After the discovery of the p-meson, it was found that both its decay and its absorption
codd be described by an interaction quite snilar to the one introduced by Fenni for P-decay.
In June 1947, Bruno Pontecorvo realized that the Conversi-Pancini-Piccioni results indicated
that the rate of negative meson capture was very similar to K-capture, allowing for the
dBerence in mass. He therefore suggested that there might be, "...a fiindamentai analogy
between &processes and processes of emission or absorption of charged rnes~ns."'~
The
A-
similar rnagnit~de.'~' In addition, the similarty of the 0-decay coupling constant to p-decay
... it is remarkable that the three independent experiments: the B-decay of the nucIeons
and the p-mesons, and the interaction of the nucleons with the p-mesons lead to
coupling constants of the same order of
In addition, Lee, Rosenbluth, and Yang noted a merence in order of magnitude of processes
involving the p-meson, and processes invo1vng the n-meson,
..-if we assume the sc-mesons to have integral spin and assume direct couphgs for the
processes,
with coupling constants determined fiom the lifetime of r-mesons and the strength of
nucIear forces, the interactions between the nucleons can be quantitativel-y explained as
a second-order interaction through the vimial creation and annihilation of irmesons. 'O3
Even afier the distinction between the r and p had been cleared up, meson theory was
still in trouble. Meson field theory calculations could be trusted only to the frst nonvanishing order of perturbation theory, because when taken M e r , they ran into
infinitie~.'~
The numerous successes of quantum electrodynamics rely upon the fact that
multiple vimial photon effects shouid be small, and this is made possible by the fact that the
coupling constant a is smali, only 1/137. In Yukawa theory, however, the coupling constant
... one c m seldom manage to make a cdculation that is really right because the theory
is so complicated, and if one tries, more as a nile than an exception, one encounters
divergent infinite ternis which one usually attempts to eliminate by not perfectly
orthodox procedures. Perhaps at the root of the trouble is the fact that the theory
attempts to oversimplify a situation which may in fact be quite complicated. When the
Yukawa theory first was proposed there was a legitunate hope that the particles
involved, protons, neutrons, and n-mesons could be legitimately considered as
elementary particles. This hope loses more and more [ofl its foundation as new
elementary particles are king disc~vered.'~~
With the problems in meson field theory, theorists turned to oher methods, Their aim
was to try to make limited predictions concerning interactions of the new particles which did
not depend upon the magnitude of coupling constants. Sice isotopic spin had nothing to do
with perturbation theory, it could serve as a reliable guide. Back in the late thimes, Kemmer
had insisted that the consequences of isotopic spin invariance hold regardless of dynamical
highest pion energies (-140 MeV), the scattering appeared to proceed predominantly in the
isotopic spin H / 2 state. From the results of the experiment, the cross-sections for the three
processes,
?r++p-+r-+p
~-+~+lrO+n
n-+p+.rr-+p
stood in ratios close to 9:2:l. These numbers follow fiom isospin considerations alone as
long as 1=3/2 domina te^.'^' This is because since a a-meson has I=1, and a nucleon (N) has
1=1/2, a rrtN system has I=1/2 or 312 by combination of anguiar momentum- Since isospin is
In December 1947, Rochester aud Butler published their paper, "Evidence for the
existence of new unstable particles".'"
had discovered two unusuai events in a cloud chamberr one showed a forked track wiirch
they interpreted as the spontaneous decay of a neutrd particle into a pair of charged particIes,
and the other showed a track with a marked kink, m o a probably the decay of a charged
particle into another charged particle plus one or more neutrals. In both cases, the mass of
the parent part..de lay somewhere between 770 and 1600 me."' The Manchester V-particles,
as the particles were caiIed, did not attract much attention iriitially. Rochester later wrote that
the M O years following 1947 were embarrassing to the Manchester group because they did
not fmd any more V-particles, and other groups did not reexamine earlier cloud chamber
pictures right away, although they were making their own disc~veries.~"Leprince-Ringuet,
with his group at the cole Polytechnique, announced the discovery of very heavy mesons
group at Bristol found a charged particle with mass between 870 and 985 me decaying into
three particles, al1 believed to be
suffered fkom low statistics. They staaed to attract attention o d y afier Anderson's group at
Caltech published a papa in which they reported thirty-four cloud-chamber events: thirty
forks, and four kinks. They came to the same temarkable conclusion as had Rochester and
Butler, that, "... these two types of events represent, respectively, the spontaneous decay of
neutral and unstable particles of a new type.''114
Brookhaven National Laboratory was providing pion beams that quickly confkmed the
presence of ~-~articles."' This marked the end of an era-data on K-mesons and hyperons
soon began to pour out of the accelerator laboratores at a rate with which the low cosmic ray
intensities could not compete.
The V-particles were puPling, in that they were produced copiously, but they decayed
relatively s l o ~ l ~ If
. ~the
' ~mechanism responsible for their production were responsible for
.
their decay, they would be expected to have lifetimes on the order of 10-" seconds, but their
lifetimes are greater than IO-'' seconds. The first step in the resolution of this problem was
taken by Abraham Pais, who decided to look for new selection d e s , which would hold for Vparticle formation, but not de~ay.'~~
His first step was to divide their interactions into two
distinct groups,
The fist group ... comprises the nucleon-r-meson interaction and certain others of
comparable strength. The second group comprises very weak interactions b e ~ e e n
these same particles. The order of magnitude of the couplings is indeed keminiscent of
those that were introduced by Yukawa to descnie 8-decay through the intermediary of
bos~ns.,.'~'
This is the first clear distinction that we have seen made between strong and weak
interactions. Pais Iooks for selection d e s which would hold for strong and electromagnetic
interactions, but not for weak processes, By weak processes, he means reactions simila. to
neutrino processes in which there is a very small coupling constant-
Pais proposed an "even-odd rule, which in its simples fom, can be stated as follows:
assign a number O to all "old particles (x,N,y,e,~)and a nurnber 1 to the new
V-particles.'"
In any process, add these numbers for initial-state particles, and then for final-
electromagnetic processes, ni and nf m u t both be even or odd, but in weak decays of the new
particles, one mm shall be even, the other odd. Following these instructions,
s-+p4L0+lr0
is strongly forbidden, while
tP+~O+Q
is strongly allowed.'"
~'+p+nand
P+2n
proceed by weak interactions, although, "... such very weak couplings have so far only been
125
1rl2S
considered in neutrino processes, such as 13, p, a-decay--.
... this invariance can only exist if we disregard electromagnetic phenornena For the
electromagnetic field is precisely the agent which enables us to distinguish between
protons and neutrons and between the three charge States of the a-mesons. It should
be added that in what foilows the proton-neutron and the charged-neutral a mass
differences are considered as secondary electromagnetic effects.'"
Pais wanted to group aIi the new particles into isotopic spin multiplets, and then relate the
isospin to a new quantum number, a-party, which is even or odd. Isosph is then to be
... classification in "event' (strong) and "odd" (weak) interactions just corresponds to a
distinction between those interactions which conserve a-parity, and those which do
not. Thus the present picture seems to involve a hierarchy of interaction
corresponding to the symmetry classes of the entopic variabled2'
Pais realimd that this attempt was not flly developed, and he only intended it as, "... a fist
stage aUning at a Mler comprehension of the various particles and field^."'^
The next step in the reconciliation of strong production and weak decay was made in
August 1953 by Murray Gell-Mann with his paper in PhysicaI Review, "Isotopic spin and new
unstable partic~es".'~~
Gell-Mann starts off by giving a hierarchy of interactions,
126
... let us suppose that both "ordinary" particles (nucleons and pions) and "new unstable
particles" (V-particles) have interactions of three kinds:
Interactions that rigorously conserve isotopic spin, (We assume these to be
(i)
strong.)
Electromagnehc interactions, (Let us nclude mass m e r e n c e effects in this
(ii)
category.)
(iii) Other charge-dependent interactions, which we take to be very weakJ3'
Gell-Mann adds to this the ingenious proposai of assigning integer isospin to hyperons, and
half-integer isotopic spin to particl cl es.'^* Later that year, this same isotopic spin
assignment was dso suggested by Nakano and ~ i s h i j i m a 'Gell-Mann
~~
set the charged and
(Kmand their
In the presence of only strong interactions, the Z particles are mass degenerate, as are the K
mesons. (Note that ail multiplets are slightly split, and this splitting is ataibuted to
electromagnetic effects.) Under this scheme, the possible decay,
does not conserve isotopic spin, and so it is forbidden- Therefore, A must decay by weak
interactions. Although every process allowed by Gell-Mann complies with associated
and
?r'+p-+c'+Kare forbidden by Gell-Mann, although,
n-+p+C-+X"
is allowed. in this paper, Gell-Mann stresses that the third component of isospin I, needs to
be conserved by strong and eIectromagnetic interactions, but that it can be violated in weak
interactions.
GeU-Mann had trouble getting the paper published.'35 The editors of Physical
Review objected to the K-mesons being placed in isotopic spin doublets (K+,A!) and (@&-).
They could not understand, at first, how a 1Y" couid be different i*m p. They did not like
the fact that Gell-Mann was requiring that a neutrd meson not be its own particle, but this
had already been done by Kemmer in 1938.1M When Gell-Mann Sonned the editors of
thiq they brought up another problem-they thought, initidy, that the Pauli exclusion
principle implied that fermions had to corne in isotopic doublets, like nucleons, and that
bosons had to be isotriplets, Iike a-mesons. Gell-Mannpointed out that this was not m e , and
they fmally relented, but they refiised to let him use the title he wanted, which was "Isotopic
spin and curious particles",
Physical Review rejected "curious particles". 1 tried "strange particles", and they
rejected that too. They insisted on "new unstabIe particles". That was the only phrase
suniciently pompous for the editors of the PhysicaZ ~eview."'
A month later, in September 1953, Gell-Mann circulated a preprinf "On the
classification of particles". lu
multiplet structure, and he discussed the newly discovered cascade particle, which decayed
into A+L, later understood to be,
y-+A+rGell-Mann suggested that to explain the cascade decay in two steps via A, weak non-leptonic
decays shodd obey the
de,
\Al3\=%
difference between the charged V, particles (C',C-) and the uncharged V,' (A) being too great
for them to be an isotopic triplet, and so he decided to put A in a singlet lnstead, and
consequentiy, he predicted another new particle E0to complete the triplet, which he expected
Gell-Mann and Kazuhiko Nishijima both had the idea that this relationship could be extended
with a new quantity, which GelI-Mann called strangeness S, and Nishijima cded
?-charge.'"
where B is the number of baryons, and S is the a quantum nimiber which is conerved in
strong and electromagnetic interactions. This is a codification of Gell-Mann's earlier work.
It works because under strong interactions, Q,I', and B are al1 conserved, so S is conserved as
~ e l l . ' ~Under
'
weak interactions, however, 1, is not conserve& in facf 1 M31=%, and
therefore S is not conserved either.
Detailed streamiined accounts of this scheme, including more consequences, were
given in 1955 by Nishijima and in 1956 by Gell-Mann. Weak interactions not only obey the
d e , 1 LU,( =%, but they also obey the even more restrictive d e ,
pq=%
dong with
pl=%
The assignments for S were: +1 for K yI?; O for p. n; -1 for A, C,
p,K-;and -2
for I. AU
in ail, the scheme predicted three new particies: the go,the 5,and the @ which was
required to be different from the K? Remarkably, al1 of these particles were discovered
f a i W guide. In fac& even after additional quantum numbers were added, it continued to be
useful, and remains so even to this day.
P (S=-
(-1)
and
1) are non-identical particles, how can they be distinguished in the laboratory? These
two particles differ only in their strangeness, and they both decay through the weak
interaction, which does not conserve strangeness! Gell-Mann and Pais published a paper in
1955 in which they offered a concept of p a r t i ~ l e - ~ e .Consider
' ~ ~ the decay,
KO+7r++rwith amplitude A. Under charge conjugation, the final state goes into itself. Note that since
I(O has zero spin, the two .n-mesons have zero angular momentum so that the state is
unchanged when 7r'P.n-. The charge conjugate of the process,
*+u-+T+
also has amplitude A, (up to a phase which we are fiee to choose to be +1).
fmal state is the same in both processes, but the Initial state is not- It has S+I
in the frst
process, but S=- 1 in the second- How can the operation C transform the final state into itseIf,
but not the initiai state into itseif! It cannot, as long as S is conserved, which it is for the
strong interactions, but it can, and does, when S is not conserved, as in the weak interactions.
ES~+-W&J
Gell-Mann and Pais suggested the novel idea of introducing the combinations of one-partide
states,
Accordig to this combination, the state K,c m decay hto rr'+n- (with amplitude ~ h / 2 )but
,
& cannot decay at al1 that way (amphde zero). More generally, K,c m oniy decay into
even states under C, and
K -can o d y decay into odd states. These two states will therefore
... our picture of the [e-meson] implies that it is a particle mixture exhibiting two
distinct lifetimes, that each lZetime is associated with a different set of decay modes,
and that not more fhmt halfof aZZ [p-mesons] c m undergo the familiar decay into two
pions. lu
Gell-Mann and Pais redized at once that K2should live much longer than K,,because the
rates of decay into a
'
*
-
alternative decays. The fxst evidence for such a long-lived neutral K-particle was obtained
two years later, in 1956, at the Brookhaven ~osrnotron.'~'
The year 1956 marked a crisis in particle physics centred on the properties of the
mesons. Recall that at the Bagnres conference of 1953, the name K-mesons was given to
particles with mass intermediate between the r-meson, and the proton. At the rime, this
designation icluded two mesons, the r and the 8, which decayed by different modes,
9+?r+lr
7+rr+rrt?r
During the conference, Richard Dalitz presented his anaiysis of the r that was designed to
detennine its spin and parity through its decay into three pions."6 The three individuai
pions produced in i meson decay do not emerge with a unique energy, as is characteristic of
a three-body decay, and so Dalitz presented a convenient way of mapping possible energy
ranges in a finite two-dimensional region such that each point on the graph uniquely marked
the energy configuration of the pions.'47 From his anaiysis, Dalitz concluded that the r has
determined that the parity of a r-meson is odd. Neglectng, for the moment, the effects due
tu the relative motion of the n-mesons, if parity is to be conserved under decay, the B
have the product parity of two T-mesons, which is even. Similarly, the
must
parity of three T-mesons, which is odd- Including the relative motion of the r-mesons,
however, does matter, and so to make the argument conclusive and dennitive, it was
necessary to study experimentally the momentum and anguiar distribution of the umesons."'
that
and O do not have the same parity, and therefore are not the same particle- At the same
masses were almost identical, and their l i f e h e s were just as indistinguishable. Since a
particle cannot have a party which is now even, and then odd, there initially appeared to be
three
options. The first two were straightforwad First, perhaps the r and 0 are two distinct
particles rather than alternative decay modes of one particle, although they have nearly
identical properties. Second, perhaps the spin was not zero. For example, spin 2, with even
parity, would permt both 27r- and 3wdecay. This option, however, was contradicted by the
DaIitz plot which became increasingly convincing as more evidence was gathered. These two
ways out, which were abandoned shortly afienvard, represented attempts to save the situation
by conventional means. The third option was radical. Maybe there are not MO particles at
dl, but only one. Maybe parity is not conserved in the decay! Perhaps there is only a single
K-meson with spin zero, and both the 2.n- and 37r- decays are alternative modes of decay of
the same particle.149
Pursuhg this clue, Tsung Dao Lee and Chen Ning Yang searched the fiterature, and
they found that parity conservation had never been tested in weak interaction^,'^^
'"
Thus, Lee and Yang drew attention to a whole class of phenornena, instead o f just an exciting
but rather isolated puzzle- Further, they proposed possible experhentai tests of parity
conservation in 0-decay, and in meson and hyperon decay. To test the possible violation of
parity, they concluded that since panty reverses momenta and positions, but not anguiar
momentum (or spins), it is necessary to meanire a dependence of a decay rate (or crosssection) on a tem that changes sign under the parity operation.
aligned nuclei. IR Afier carefuliy aligning the spins of radioactive cobalt-60 nuclei with a
magnetic field, the direction of emission of 6-decay electrons was monitored, and it was
found that most of the electrons came out in the direction of the nuclear spin. That's all there
is to it, but this simple observation implies parity violation: under space-reversai, the
direction of nuclear spin reverses, but the direction of electron emission does not. Word of
this astonishing resdt spread rapidly even before the resuits appeared in print, and
Garwin,
Lederman and Weinrich, and independently, Friedman and Telegdi demonstrated parity
Rather than beginning with a polarized beam, these experiments exploited Lee and Yang's
prediction that muons should be polarkd dong the line of fiight in the decay,
r++/4++v
and in,
pC+e++2v
as they occur in the chah"'
P and C are violated in weak proceses wherever one looked. Moreover, the effects were
large.
Even before the experimental evidence had shown that P and C are violated, however,
another question had been posed by Lee and Yang, and by Lev Landau.'"
Assuming that
these ~ r symmeeies
o
are violated individually, is the combination of the two conserved?
We assume that in weak interactions these two invariance propeaies do not hold
separately. But we c m suppose that we still have invariance with respect to the
product of the two operations, which we c d combined inversion. Combined inversion
consists of space reflection with interchange of particles and antiparti~les.'~
This idea lasted until 1964, when the K-system also showed CP-violati~n.~~'
Theorists had
to
fall back M e r upon CPT conservation. Although each of C,P, and T, have a l l been
individually violated, the combination CPT has held up under scrutiny, which is a relief, since
any violation of it wodd require the revision of general principles of quantum field
theory. ISg
substitution, so that,
vv~"--~"k
while other fields ... remain un~hanged.'~'
Note the presence of the y, matrix, often referred to as the chiraiity operator, which can be
chosen as being diagonal with eigenvalues I l .'" Similar conclusions were reached by Lee
and ~ang.'" Under the parity operation, spin acts like an &al
... we wish to examine here a possible theory of the neutrino different n o m the
conventionally accepted one. In this theory, for a given momentum p the neutrino has
only one spin state, the spin being always pardel top. The spin and momentum of
the neutrino together therefore automatically define the sense of the screw. In this
theory, the mass of the neutrino must be zero, and its wave-fiinction need only have
two cornponents instead of the usud four.'@
This No-component theory of the neutrino is equivdent to the two-component theory of the
neutrino postulated by Weyl in 1929, which back in 1933, was heavily criticized by Pauli
the weak interactions. In his original theory of 1933, Fenni treated @-decayas the
transformation of a neutron into a proton, dong with the emission of an electron-neutrino
pair, which leaves the spin of the proton unchanged. In addition, the anguiar-momennimand
parity are unchanged. In 1936, Gamow and Teller had shown, by introducing spins of the
heavy particles into the Hamiltonian, that not al1 8-decays occur between nuclear States with
identical angular momenta, M=O,and so they introduced a generazation which could
The near equdity of the effective couphg constants in the processes of P-decay,
p-
decay, and p-capture led to the postulation of a universal Fenni interaction in 1949.'"
These interactions can all be treated as interaction between the pairs of spinor fields np, pv,
ev.IS9 In 1950, Louis Michel studied the most general coupling between four fermions
which involved the five different kinds of coupling possible: scalar, pseudoscalar, vector,
axial-vector, and ten~or."~He found that the aii five different coupling constants may be
lumped together into a single dimensionless constant p, ever since known as the Michel
para~neter.'~' Following Michel, in 1953, Konopinski and Mahmoud noted an ambiguity in
p-+e-+2v
p-+e-tz+"
It tumed out that o d y the second process is possible. They had the idea to look for a
principle which wouid tel1 them which decays were possible, and this led to the principle of
conservution of lepton number. Ascnbe by convention, a lepton number +1 to e-, p-, v; -1
for e+,
Armed with the conservation of lepton number and the known violation of p-,
Sudarshan and Marshak were able to determine the correct coupling of the weak interactions
such as,
they codd infer that there m u t be at least one type of S or V , and at least one of T and
A.'"
The combinations ST and VA both have the added merit that the neutrino is always
emitted with the same chirdity in both Fermi and Gamow-Teller transitions.'"
Anaiysis of
pion and kaon decay indicated that there is a dominant A interaction, and furthemore,
measurement of the angle between the electron and neutrino directions pointed towards the VA
coupiing, making it the best candidate. Sudarshan and Marshak concludeci,
Whiie it is ciear that a mixture of vector and axial vector is the oniy universai fourfennion int=raction which is possible and possesses many elegant features, it appears
that one published and several unpubfished results cannot be reconciled with this
hypothesis.'76
It tumed out that there was a problem with the ~e~ experimenf but it was soon resolved, and
in January of 1958, Sudarshan and Marshak published their definitive paper.
'"
Even before the experimentai issue had been settled, in September 1957, Feynman and
Gell-Mann proposed their own V-A theory on theoretical grounds of simplicity. They were
very impressed by the two-component neutrino, and they took it is a bais for intmducing
parity violation into weak decays for fermions as weil."'
This is equivalent to letting Nature pick out only one of the two chiral spinors, because the
function x has only two c~mponents.'~~
As for the conjugate wave-fiuiction, since,
Using these representations, Feynman and Gell-Mann were able to take advantage of the
attractive feature,
(1-ys) ( l + y 5 )-0
(28)
and so the o d y bilinear form constnicted out of two dinerent spinors tunis out to be,
(29)
(3 0)
This is similar to the expression proposed by Sudarshan and Marshak (1957), and also by Jun
IVLY,
JC-VY,Y&
J:=vY,#
Feynman and Gell-Mann proposed that the weak current is conserved just like the
f
electromagnetic current (another vector quantity). This came to be known as the conserved
vector curent (CVC) hypothesis, and it explained why the universality of the strengths of 6-
and p-decays appear to be the same Although the weak axial vector current is not exact&
conserved, it is almost. Two years later, Gell-Mann and L e proposed the partiaily
conserved axial current (PCAC) hypothesis, which became an important adjunct to the
peaks strongIy around 180 MeV, where the scattering is nearly pare I=3/2, &er which it
d r o p ~ . 'This
~ is a sure sign of the failme of perturbation theory, which cannot accomt for
such peaks.'" The Chicago group had noted that the distribution should be isotropie or
behave as 1+3cos% if an angular momentum &l/2 or 312 state dominates. Experiment
is a sharp peak in the cross-section, where the pion and nucleon join to form a short fived
resonance state. This was the first resonance discovered, and it came to be known as the 3-3
As higher pion energies became available at the Brookhaven Cosmotron, more and
more resonances were found. The full importance and the widespread nature of resonances,
however, only became clear in 1960 when Luis Alvarez and his team began their work with
separated R bearns in hydrogen bubble chambers exposed to the ~evatron.'" The first
resonance observed was the I=1 peak of A+r scattering, which became known as the
C(1 38S).lg5
The following year, in 1961, three meson resonances, the p, a, and 7,were
discovered in rapid succession, and many others were soon to follow.lU It took a few years
before physicists became cornfortable with the idea that thqe is no real Merence between a
resonance and an UIlSfabIe particle, but even before they did, theorists Wre Gell-Mann and
Ne'eman were ready to embrace the newly discovered states with clever systems of
cla~sification,'~~
4. Concluding remarks
Pauli proposed the neutrino as a smultaneous solution to the problems of both the
continuous 0-ray spectnim, and the spin-statistcs of nuclei. Fermi then incorporated the
neutrino hypothesis, and the p-spin associated with Heisenberg's exchange symmetry, into his
theory of B-decay. This was a quantum field theory, and although it gave good results for 8decay, as intended, it could not explain the intense attraction between the neutron and proton.
Combining the ideas of Heisenberg and Fermi, Yukawa poshilated a new particle which could
explain both p-decay and exchange symmetry through its interaction with both heavy and
Iight particles.
M e r charge independence was proven, a cosmic ray partide was found which
appeared to have the characteristics required of Yukawa's meson. Meson field theory was
appealing, and Kemmer postulated that the meson codd be a vector field, instead of a scalar.
The meson did not behave as expected in the lab, however, and in 1947, it was determined
that there were actudly two mesons, the n and the p, with decay coupling constants of smilar
magnitude. Meson field theory struggled, however, and isotopic spin served as a guide for
experiment.
Unstable V-cles
combination of properties as they were produced in abundance, but they decayed rehtively
slowly. Pais dserentiated between strong interactions, responsible for KparticIe formation,
and weak interactions responsible for thek decay. He suggested a new selection rule, the
even-odd d e , which came to be known as associated production. Gell-Mann (and Nishijima)
generalized the d e to a conservation Law of strangeness which was obeyed by the strong and
electromagnetk interactions, but was violated by the weak interactions. The T-8 puzzle
resulted in violation of both parity and charge conjugation invariance, which in tum led to a
clarification of the weak interactions, Soon, accelerators used in conjmction with bubble
chambers were the source of many new particfes. Theoretical physics was ready for the rise
of group theory.
2.
"It is generally supposed t h a t the nucleus of a heavy elernent
consists mainly of ar-particles with an admixture of a few free
protons and electrons, but the exact division between these
constituents is unknown." [Rutherford (19321, p. 737.1
3.
This was first pointed out by Heitler and Rerzberg (1929).
has a problem with
Shortly thereafter, it was also noticed that
its statistics4.
6This was not the first time that Bohr had doubted the
conservation of energy. In 1 9 2 4 , together with Kramers and Slater,
Bohr had postulated that energy conservation shouid only hold
statistically in quantum transitions. [Bohr, Kramers, and Slater
(1924)1 Early in 1925, however, Compton and Simon performed an
experiment in which in which they scattered X-rays from electrons
within a cloud chamber, and their results showed t h a t energy and
momentum conservation do hold true in individual events . [Compton
and Simon (1925)1 The following year, Dirac showed t h a t his
theory
of
relativistic
quantum mechanics
implies
strict
conservation of energy and momentum, and doubts about energy
conservation disappeared, until resurfacing again in the context of
B-decay.
In November 1 9 2 9 , however, Oskar Klein derived an
apparently disquieting consequence of the Dirac equation: a slow
electron can pass through a steep potential wall higher than 2 mc'
and emerge with a negative energy. Although this result, known as
the Klein paradox, was to become harmless in the positron theory,
Bohr was impressed with it, and he wrote to Dirac, asking whether
he thought that, .
t h e difficulties i n relativistic quantum
mechanics might perhaps be connected with the apparently
fundamental difficulties as regards conservation of energy in P-ray
disintegrations and the interior of stars-" Dirac replied that he
preferred to keep rigorous conservation of energy at al1
costs, [Pais (1986), pp. 312-131
8.
9. Pais (1986), p - 317, The next day, 17 June 1931, t h e news made
t h e New Y o r k Times, "A new inhabitant 015 the heart of the atom wa
introduced t o the world of physics today when Dr- W - P a u l i of the
I n s t i t u t e of Technology i n Z i i r i c h , witzerland, postulated the
13 -
14.
15.
17-
18.
See
o r b i t interaction.
20.
There are three other terms in the Hamiltonian:
the
attractive energy between neutrons, the Coulomb repulsion between
protons, and the mass-defects of t h e neutrons-
t o be +i
B r i n k (1965), p . 18.
24.
Heisenberg (1932a), p. 6 .
25 .
29 30
540
31.
At this stage. Fermi is calling the emitted particle a
neutrino. The theory of antimatter was still up in the air, and
Fermi w a s still using Dirac's hole theory. In J u l y of 1934, Wolfe
and Uhlenbeck gave the following notation for P-decay,
NiP+e'+n
Fermi (1934a). p . 2.
34. Apparently, Fermi had some difficulty with the Dirac-JordanKlein method of second quantization of fields, at first, but he
eventually mastered the technique, and put it to good use,[Fermi
(1962), Volume 1, p. 5391
35.
Fermi uses Q* for the transpose conjugate- Today, we would
mite this instead as Q t c
36.
Fermi (1934a), p. 6 ,
37.
Fermi (1934a), p - 7. It was later shown by Wigner that the
possible relativistically invariant interactions bilinear in the
electron and neutrino wave functions (and not containing the
gradients of these functions) are not restricted to vector
transformations. There are five types : calar, pseudoscalar,
vector, axial vector, and tensor, plus linear combination of these
ive types. We shall discuss this in more detail later.
38.
46 -
Y u k a w a (l93S), p.
47-
48.
49.
Y u k a w a (1935), p.
51-
50
h d e theory.
54.
54Note that there was not yet any direct experirnental evidence
of neutron-neutron scattering.
55 -
56.
Cassen and Condon note that this formalism w a s also used by
Fermi, and by Konopinski and Uhlenbeck (1935 .
57,
60.
These types of exchange interactions are discussed in the
review article by Bethe and Bacher (19361 .
61.
62,
63 . Wigaer does not use our modern notation SU(2) and 0 (3) , but he
does refer explicitly t o the two-dimensional u n i t a r y group (while
restricting himself to unimodular transformations), and to the
three-dimensional r o t a t i o n group-
Bhabha thinks that, with a neutral Ual1 known-particles w o u l d fa11 into three groups w i t h
masses of the order 4, Mu, and me, with positive, neutral, and
negative particles in each group75.
particle,
76.
Kemmer (1937)
77. Kemmer (19371, p . 908. Kemmer says that he got this idea for
the exchange of electron-positron and neutrino-antineutrino pairs
from his teachex, Gregor Wentzel, but that initially, Wentzel did
not think this would lead to charge independence. [Kemmer (1982), p
376.1
78.
Decay of the negatively charged Yukawa particle into an
electron plus neutrino could be inferred from Y u k a w a ' s o r i g i n a l
paper. T h e first to mention that Yukawa's mechanism would lead to
the decay of the positive Y u k a w a p a r t i c l e t o a positron p l u s a
neutrino w a s Bhabha (1938)79. Y u k a w a described his U-field by a one-component complex field
which was supposed to be the f o u r t h component of a vector field, i n
analogy to the electrodynamic scalar potential.
This is not
completely identical to a Pauli-Weisskopf (1934) scalar, which a l s o
Kemmer (1938b) -
81,
Kemmer (1938~)
.
Kemmer (1938~)
, p. 355.
85.
The vector case corresponds to work of Proca (1936), and the
scalar case to the work of Pauli and Weisskopf (1934).
89.
90.
91
92.
95 .
96.
97,
In 1941, Christian M d l e r invented the collective name nucleon
for the proton and nucleon [Pais (1986), p. 4501
98.
99.
100.
Puppi (1948)-
104 -
6
Fermi (1952), pp. 933-34. F e r m i , together with Yang in 1949,
had questioned whether n-mesons were elementary particles. They
suggested that these particles m i g h t instead be composites formed
by the*association of a nucleon with an anti-nucleon. [Fermi and
Yang (1949)l We shall mention this iater.
107,
108,
Pais (1986), p. 485-6.
Analyzing reaction by isospin
methods was a program initiated by Watson and Brueckner in 1951109.
110. The mass range was very wide, not only because the identity
of the decay products was unknown, but because the masses of the
pion and muon, both likely candidates, were not at al1 wellestablished.
Remember that it was only a few months after the
discovery of n - p decay111.
117.
118,
122.
Note that following a suggestion of Oppenheimer made at a
Rochester conference, Pais generalizes t h e rule of conservation of
the number of nucleons ta include the heavier V - p a r t i c l e s -
vL0,and
KO as
V.O.
Gell-Mann (1953).
Gell-Mann (1953), p . 8.33 -
136-
Gell-Mann
Kemmer ( 1 9 3 8 ~ ) .
140.
Nishi j ima a c t u a l l y published it first , i n N i s h i j ima (1955),
and he was close t o it a l r e a d y i n Nishi jima C1954) . Gell-Mann
waited u n t i l 1 9 5 6 t o p u t t h i s i d e a i n t o p r i n t , although he t a l k e d
about it with his c o l l e a g u e s a t l e n g t h .
because the positron has the same mass as the electron, while the
proton is much heavier,
A new conservation law for heavier particles, including the
neutron, was formulated by E r n e s t Stckelberg (1938). Stckelberg
knew that neutrons interacted with protons, but he noted t h a t no
transformations of heavy particles to light particles, such as
electrons and muons, had been observed. Therefore, he proposed a
consenration law of a new quantity which he referred to as schwere
Ladung, or heavy charge.
independently, Eugene Wigner (1949)
suggested that a conservation law for the number of heavy particles
could be responsible for the stability of the proton, just as, he
claimed, the conservation law for charges was responsible for the
stability of the electron. Wigner (1952) treats the conservation
law for hoavy particles on a par with the conservation law for
electric charges, assuming that the two conservation l a w s had
similar causes, and thus similar consequences142.
144.
145.
Lande, et al . (1956).
6
Dalitz (1953), He later published the analysis, including
more data, in Dalitz (1954).
Yang (1957), p.
149.
Lee and Yang (1956a, 1957) Aithough Lee and Yang got the
credit (and the Nobel prize) for the idea of parity-nonconservation
in K-decay, Pais thinks that it might have occurred to several
people independently. He rernembers that the issue was raised, more
as a logical possibility than a favourable option, at least as
early as 1954, at a Kyoto conference. According to Pais, Brueckner
said that, "If one assigns spin higher than zero to these
particles-. t h e n the same particle can decay into two or three
pions without any violation of parity conservation. [Pais (1986),
p * 5251
150. Lee and Yang (1956a). In a footnote at the end of the paper.
they also noted that charge-conjugation invariance had no
experimental proof either .
151 -
153.
155.
156.
157.
Landau (19571, p. 3 3 6 .
158.
159.
CPT-invariance is the minimal sufficient ground for t h e
correspondence of particles to anti-particles, and for the
equalities of masses and lifetimes for these pairs of o b j e c t s .
Pauli (1955), pp. 30-51 was the first to cal1 attention to CPT, but
it was also discussed by Lee, Oehme and Yang (1957), Lders (1957).
and Jost (1957). Jost gave the most general proof, based upon the
a x i o r n a t i c formulation of q u a n t u m field theory.
See Sakurai
(1964), Chapter 6, for a good discussion.
160 -
Salam (1957).
161
164.
165.
leave
the
parity
170 -
Michel (1950),
171.
172. Konopinski and Mahrnoud did not use t h i texminology, and they
got the number assignrnents wrong, b u t the essential idea w a
t h e r e - [ G r i f f i t h s (1987) p . 261
173.
Note that lepton conservation i m p l i e s
neutrinoless double P-decay,
( A , Z b ( A , Z k Z ) +Se'
[Pais (1986), p - 530 - 1
the absence
of
170 These letters stand f o r scalar, vector, tensor, and axialv e c t o r - -In this context, P stands for pseudoscalar.
175-
177,
179.
See
180.
Sudarshan and Marshak (1957, L9S81 w e r e led to this
expression by chirality invariance. Sakurai (1958) was led to this
fo-m instead by rewiring invariance of t h e four-fermion
~amiltonianunder separate reversal of the sign of t h e mass of each
fermion in the ~ i r a cequation, but t h i s route turned out to be
equivalent. [Sudarshan and Marshak ( 1 9 5 8 1 , p . 18611
181- Gell-Mann and Lvy (1960)- See Adler and Dashen (19681,
Chapter 2
182- Ashkin, Blaser, Feiner, and Stern (19561 - The reason that it
took so long to confirm the resonance w a s that phase shift analysis
was not unique- This w a s s h o w by Yang, who was then a tudent of
Fermi. Kahn and Goidhaber (1989), p - 1051
184-
186
F i r s t , the p meson (I=J=l)
was observed as a peak w i t h mass
770 MeV and width 150 MeV in the T+T- distribution produced in the
reaction -rr+&S.rr+N[Erwin et al. (1961)1
Then followed the w meson (I=0, &=l) with mass 783 MeV, width 10
MeV, first seen as a 37r peak in p p 5 n . [Maglic et al. (1961)1
Finally, there was the 7 (l=J=O)with mass 549 MeV and width 1 kev,
which was a 37r peak i n n++d+p+p+lr++~~+a-[ P e v s n e r et a l . (1961)1
187- Particles with e x t r e m e l y short lifetirnes can be identified
with resonances as measuwed through the uncertainty relation
AEAt-hThe energy uncertainty is reflected in the width of the
resonance, usually 10 to 200 MeV, which implies lifetimes of the
order of 1 0 -S~- ~ The term resonance is applied when the produced
state decays strongly as in the p or K'- Many of these, like the
K r are identified as higher-energy, or excited states, of
previously known particles. In contrast, states like the A , which
decay weakly, are termed particles - In the early sixties, Geoffrey
Chew criticized this distinction as artificial, and he proposed
that al1 particles and resonance be put on an equal footing. This
view has survived to the present day. [Cahn and Goldhaber (1989),
pp. 107-91
Chen Ning Yang and Robert MilIs were very impressed with the relation between
conservation of charge and phase invariance, and in 1954, they proposed that the conservation
of isotopic spin shodd also be related to an invariance law.' Accordingly, they attempted to
generaIize the concept of gauge invariance to the case of the strong mteractions,
We have tried to generalize ibis concept of gauge invariance to apply to isotopic spin
conservation. It tums out that a very natutal generalization is possible. The field that
plays ILe role of the electromagnetic field is here a vector field that satisfes a
nodinea.equation even in the absence of other fields?
The field must be nonlinear, because unlike the electromagnetic field which carries no electrc
charge, the field in question carries with it an isotopic spin, and so it acts as a source for
In June 1954, Yang and Mills presented their full theory.' They formulated a
principle of isotopic gauge invariance, associated with the existence of a b field, just as phase
invariance is associated with the electromagnetic field. For Yang and MUS, the conservation
of isotopic spin is identical with the requirement of invariance of ail interactions under
equations-they are requiring aii interactions to be invariant under independent rotations of the
isotopic spin at al1 space time-points, which impIies that the relative orientation of the isotopic
spin at two space-time points becomes irnmeasucable, giving a local gauge invariance. This
gauge invariance is imposed by the introduction of the b field. The b field satisfies nonlinear
differentid equations, and its quanta are particles with spin 1, isotopic spin 1, and electnc
charge k e or zeroYang and Mills were inspired by the gauge invariance present in quazltum
electrodynamics, as is clear nom their reference to Pauli's well-laiown 1941 paper in Reviews
of Modern Physics, 'Relaavistic field theores of elementary particles".s In this paper, Pauli
pays ample attention to gauge invariance. He shows how the electromagnetic field can be
introduced into the Lagrangian by making the replacement,
where
is the charge in unis (tic)"? For Yang and Mills, this was a well-known feahne of
It is supererogatory to observe that the photon was not discovered by requiring local
gauge invariance. Rather, gauge transformations were discovered as a useN property
of Maxwell's equations?
It was in his 1941 paper that Pauli introduced the concepts of gauge transformations of the
first and second End, which correspond to global and local gauge invariance. Pauli noted
that, "... the potentids [of the electromagnetic field] undergo gauge transformations of the
second type.'"'
In an entirely similar manner, we introduce a B field in the case of the isotopic gauge
transformation to comteract the dependence of S [the isotopic spin rotation] on x, y, z,
and t. It will be seen that this natural generalization d o w s for very little arbitrariness.
The field equations satisfied by the twelve independent components of the B fie14
which we shalI cal1 the b field, and their interaction with any field having an isotopic
spin are essentially fixed, in much the same way that the fiee electromagnetic field and
its interaction with charged fields are essentially determined by the requinment of
gauge invarian~e.~
'
Yang and Mills consider a two-component wave fimction
JI-w
where S is a 2x2 unitary matrix with deteminant 1. This is the group
(4)
su(2)-I2To maintain
invariance of the Lagrangian, they require that aIf derivatives of >G appear in the following
combination,
( 3 , - i o B & J,
where the B, are 2x2 matrices. After a little algebra, they fhd the isotopic gauge
transformation is,
This is tncky. The Iast term of this equation is a linear combinaion of the isotopic spin
(5)
matrices T, so the B, field must itself contain a linear combination of the these matrices c* It
tums out that the relevant part of the B, fieId takes the f ~ r m , ' ~
BK-2b; T
(7)
To obtain the interaction between any field $ of arbitrary isotopic spin with the b field, Yang
and MiUs replace the denvatives of
+ with,
@,-2i~b,-T)~i
(8)
which give,
The field equations for the b field can then be derived fkom the total Lagangian density,I4
where
ab, ab,
f,Lv-
a ; ~-q
; -2 ~b,xb,
The next step is to quantize the b field. Yang and MUS follow the canonical method
of quantization, and once they are done, the quanta clearty have spin 1, and isotopic spin 1.''
From the conservation of electric charge, it is also clear that the b quanta come in three
charge States, ie, and zero. There is a problem, however, with the mass of the b quantum,
We next come to the question of the mass of the b quantum, to which we do not have
a satisfactory ansver. One may argue that without a nucleon field, the Lagrangian
would contain no quantity of the dimension of a mas, and therefore the mass of the b
quantum in such a case is zero. This argument is, however, nibject to the criticism
that, Iike al1 field theories, the b field is beset with divergences, and dimensional
arguments are not satisfactory
In electrodynamics, by the requirement of electric
charge conservation, it is argued that the mass of the photon vanishes. Corresponding
arguments in the b field case do not exist, even though the conservation of isotopic
spin still holds. We have therefore not been able to conclude anything about the mass
of the b quantum.16
....
Obviously, this is a very serious p r o b h . Local gauge invariance requires that the m a s of
the gauge potential field be identicdy zero for any gauge theory.17 Therefore, if there were
any mass term for the b field in the Yang-Mills Lagrangian, it would spoil the gauge
invariance, yet on the other hand, if the b field were of zero mass, then it would be unabIe to
reproduce the observed short range of the nuclear force.''
view, Yang and Mills remark that it would be inconsistent for the b field quantum to have
any mass less than that of the pion anyway, because if it did, it wodd be expected to be
created abundantiy at high energies, meaning that the charged b quanta should be
This zero-mass problem of the theory was raised quite forcefully by Pauli during
Pauli asked, "What is the mass of this field B,?" 1 said we did not know. Then 1
resumed my presentation, but soon Pauli asked the same question again. 1said
somerhing to the effect that that was a very complicated problem, we had worked on it
and had come to no definite conclusions. 1 d
lremember his repartee: "That is not
sdficient excuse." 1 was so taken aback that 1 decided, after a few moments
hesitation, to sit dom. There was general embarrassment. Finally Oppenheimer said,
"We should let Fang] proceed." 1 then resumed, and Pauli did not ask any questions
during the seminar?
Afier the presentation, Pauli left a note for Yang, and the two of them met later. Pauli
explained that his reaction had been so sharp because he had already been workuig on simila.
Iines, and he had been fniseated by this mass difficulty for over six months.
Pauli had first raised the idea of applying the principle of local gauge invariance to
the concept of isospin at the June 1953 Lorentz-Kamerlingh Onnes conference in Leiden.
During the question period following Pais' paper, "Isotopic spin and mass quantizati~n'~,
Pauli
commented,
1 am very much. in favour of the general principle to bring empirical conservation Iaws
and invariance properies in connection with mathematical groups of transformations of
the Iaws of nature- If besides the consemation of energy-momentum and of charge,
the conservation of the property defhed as number of nucleons and of the charge
independence of the nuclear forces are weli estabfished, they have indeed, as Pais tred
now to express mathematically, also to be connected with group theoretical properties
of the law of nature.... 1 would like to ask in this connection whether the
transformation group with constant phases can be amplified in such a way that the
meson-nucleon interaction is connected with the amplined group. The main problem
hereby seems to me the proper incorporation of the c o u p h g constant into the
group?
Pais answered that the possibility of an amplified gauge group had been on his mind as well,
but that so far, he had not yet achieved any results. According to the conference proceedings,
Wouthuyzen then brought up the fact 9 a t years earlier, he had seen an attempted unincation
This comment must have stirred somethhg within Pauli, because a month after the
conference, he sent Pais a rough manuscript dong these lines, entitied, "Meson-nucleon
interaction and dserential geometry" that begins, "Wn'tten d o m July 21 till25 [1953] to sec
how it is looking."') In this paper, Pauli used a Riemannian geometry similar to that which
Kaluza (1921) and then Oskar Klein (1926), but whereas Kaluza and Klein
interactions, and he interpreted the mathematicaI structure differently than Kaluza and Klein
had. Whereas they had identified the electromagnetic potential with components of the metric
tensor, Pauli identified it M e a d with the Christoffel connection, As for the field strengths of
the meson-nucleon interactions, PauIi associated them with components of the curvature
tensor."
In this long letter to Pais, Pauli noted that local isospin gauge invariance demands the
introduction of a triplet of vector particles, and he managed to fkd as his "mainresult", the
correct expression for the corresponding field strengths. He did not, however, give the
associated dynaDaicd field equations? In December 1953, stl a few months before Yang's
talk, Pauli became frustraated with his inability to solve the zero-mass problem, and he wrote a
If one tries to formulate field equatiom... one wiU always obtain vector mesom wirh
rest mass zero. One could try to get other meson fields-pseudoscalars with positive
rest mass... but I feel that is too artifi~ial.~~
Pauli's enthusiasm was wanig, and he never published his r e s u l t ~ . ~
Many years earlier, in 1938, Oskar Klein had -en
also applied the Kaluza-Klein theory to nuclear interactions, entitled, "On the theory of
charged fields." Klein had presented his work at the Conference on New Theones in Physics
in Poland, where it lay in obscurity mtil 1981, when it was finaily noticed by Cecilia
163
Klein's theory was developed in the fnime of the five-dimensional approach, and it
attempted a unification of the gravitationd, electromagnetic, and nucIear interactions. Klein
wrote this paper only three years afier Yukawa's suggestion that nuclear forces might be
mediated by massive mesons, whkh were d l referred to as heavy electrons or mesotons.
Proceeding 60m the assumption that the nuclear interactions are mediated by vector particles,
Klein extended his 1926 work on quantum theory and ive-dimensional relativity to take the
meson-nucleon interaction into account As before, he related periodicity in the nfth
coordlnate x0 to Planck's constant, but this time, not all of the fields were independent of the
fi& co~rdinate.'~Although neutral particles were still independent of xO, charged particles
were not,
...just as the field quantities contain parts depending upon x0 representing charged
fields, and parts without x0 representing the ordinary electromagnetic and gravitational
fields, we shdl assume that also the spinors con& of xO-f.keecomponents representing
neutral particles (neutron, neutrino) and of x0 components representing charged
particles (proton, electron)?
Whereas n 1954, Yang and Mills were attempting ta describe the strong force, back in 1938,
the distinction between strong and weak interactions was not yet clear, and so KIein does not
distinguish between the two types of nuclear interaction. His Lagrangian includes interactions
between the heavy particles, the proton and neutron, and between heavy and light particles,
The Lagrangian L0may belong either to the pair neutron-proton or the pair neutrinoelectron.... it is worth while to notice that the complete Lagrangian will imply an
164
interaction of heavy and light spinor particles not only through the intermediate of the
electromagnetic field, but also through the B-field, an interaction which wiIl entail the
occurrence of /3-processes, the probability of which may be calcdated on the basis of
the theory developed in this report?
Klein placed the two heavy particles in one isotopic spin doublet, and the two light particles
in the other?
Mediating between either two heavy doublets, or a heavy doublet and a Iight
doublet, were the A field, representing the eIectromagnetic potentiai, and the B and B fields,
representing positively and negatively charged mesons, respectively. Note that there is no
neutral meson in the theory, although in response to a question fiom &ler d e r the
presentation, Klein showed that an uncharged meson can be easily incorporated into the
formalim~'~Ail AUee of the mediating particles, (or four if the neutral meson is included),
enter the equations within the same set of matrices, and consequently, ali of the interactions
share a single couphg constant: both the nuclear interactions and the electromagnetic
As to the rest mass of the new particle which does not appear in the ordinary field
equations, it might be introduced by the addition of a term to the Lagrangian without
disturbing the invariance. But it is not impossible that a M e r development of the
theory will make this somewhat arbibary addition superfluous, the mass appearing as
some sort of self energy determined by the other lengths entering in the theory?
Note that Klein says that the introduction of the mass term does not diturb the invariance; he
is clearfy referring to elech.omagnetic gauge invariance. The addition of a mass term
unquestionably breaks the SU(2) gauge invariance inherent up to this point, but since Klein
was nof demanding SU(2) invariance, he was not concemed with its violatiodS At the
conference, Klein's work did not generate a great deal of dixussion, and his ideas were not
165
fully appreciated. Since the paper was never published in a reguiar journal, and perhaps
because the Second World War broke out soon afterward, the link that Klein had estabiished
between gauge theory and nuclear interactions went Iargely unnbticed for over f
was left to
i years. It
Shortly after the discovery of Yang and Milis, however, RonaId Shaw reached a
similar renilt by a different route?
when he developed an SU(2) gauge theory in early 1954, but he did not subrnit his
the extended Lorentz group and their applications to particle physics, and Part II contains
three chapters dealing with the interactions of particle physics. It is in the final chapter that
Shaw's SU(2) construction appears, but a foobote indicates that this section was nnished in
The work described in this chapter was completed except for its extension [to the fourdimensional case] in Section 3, in Ianuary 1954, but was not pubiished. In October
1954, Yang and Mdls adopted independently the same postdate and denved nmilar consequences."
In his derivation, Shaw ran into the usual problems conceming the mass of the B-field,
Zero mass would appear to be d e d out since otherwise the neutron would have a
rapid decay N+P+B-. In fact, one would have expected the B-particles to have been
observed uniess their mass was quite large, or unless the coupling constant q was very
~ma11,3~
Shaw daims that initally, he thought that the B particles were massiess, and so he put the
idea aside since it clearly did not match up to observation.39 He showed it to his supervisor
later that year,
1 showed it to Salam but 1 do not think that he much appreciated it at the tirne. 1 do
not at dl blame him for this-1 probably told him about it in a very dismissive way,
since the relevant particles, surely PO, I thought, did not exist Much later ... he toId
me that Yang and MiIls had had the same idea and told me to write mine up (which 1
did not)?'
Shaw does not blame Salam for the oversight, and he maintains that SaIam fiequently
announced the contribution at conferences. In fact, in his 1979 Nobel Prize talk, Salam
referred to non-Abelian gauge structure as the Yang-Mius-Shaw theory?
Utiyama in Japan. In early 1954, Utiyama was working on the tetrad formulation of
gravitation in preparation for a visit to Princeton, when he noticed that gravitation and
electromagnetism had a mathematical property in common, the connecrion."
He considered
that dus might be a common feature of al1 fundamental interactions at some level, and so he
constructed the gauge theory which applied to dl Lie groups. When he arrived in Princeton
in April, however, he read a preprint of the Yang-MiIls theory, and he was stunned by the
Six months later, Utiyama retumed to Yang's paper, which had since been published in
October 1954." When he read it, Utiyama reaiized that Yang and Milis had found only an
example of non-Abelian gauge theory, whereas he had developed a general idea of gauge-
Yang-Mills theory as an example of his own. He included this section, he said, partly
because Yang and MUS had aIready publisbed, and partly because he thought that it wodd
not be fair to them if he did not refer to their paper after he had read it4'
interaction". In the introduction, he starts with a Lagtangian whch is invariant under constant
phase transformations, and he then replaces this transformation with a gauge transformation
with a varying phase fictnr Ee carniders the geneat case,
Let us consider a system of fields @(x), which is invariant under some transformation
group G depending on parameters ,,a
...,+. Suppose that the aforementioned
parameter-group G is replaced by a wider group G', deriveci by replacing the
parameters E'S by a set of arbitrary fiinctions c(x)'s, and that the system considered is
invariant under the wider group Gr."
Utiyama is interested in what kind of field needs to be introduced to maintain the wider
invariance of the local symmetry group, how this field transforms, and whether its interaction
with the original field can be detennined uniquely. He goes on to apply his method to derive
Utiyama derives it for gauge theory in general, fiom symmetry arguments, and he does not
168
In the usual textbooks of general relatvity. the covariant derivative of any tensor is
introduced by using the concept of paralle1 displacement On the other han& we shalI
see ... that the covariant derivative of any tensor or spinor can be derived fkom the
postdate of invariance under the "generaiized Lorentz transformations" denved nom
the six panuneters of the usual Lorentz group with a set of six arbitracy functions of x.
In deriving such covariant derivatives, it is unnecessary to use explicitly the notion of
pardiel displacement"
Utiyama's originai contribution was thus his work on gauge theory in general, and its
application to gra~ity-*~
Yang was not aware of this mathematicai ~imiIarity:~
--.when MUS and 1worked on nomAbefian gauge felds, our motivation was
completely divorced fiom general relativity, and we did not apprecizte that gauge
fields and general relativity are somehow related Only in the late 1960s did 1
recognize the structurai similarity mathematicdy of non-Abelian gauge fields with
generai relativity.. ."
In his 1956 paper, Utiyama also expressed an interest in applying his method to the
interactions between mesons and nucleons as soon as the appropriate transformation group
was found. When this actually happened in the 1960s, Utiyama must have been as pieased as
anybody, even though he aiways regretted not having published his work sooner.
3. Concluding remarks
Impressed by the conservation of isotopic spin, Yang and MilIs extended globd SU(2)
to local SU(2) by forcing the Lagrangian to remain invariant through the introduction of a b
field. Although local gauge invariance was an obvious symmetry of
took the bold step of imposing it upon an internai symmetry group. The idea looked
promising, but there was a problem. Local gauge invariance of electromagnetism impiies that
the photon has
zero mass, and similady, local gauge invariance of isotopic spin implies that
169
the b field shouid also have zero m a s The nature of the strong interaction,however, impiies
that the b quanta shodd be heavy, to account for the shoa range of the interaction. Various
strategies wodd soon be adopted to deal with this problem, as we shaU see, yet it took years
before it was discovered that the gauge theory discovered by Yang and M a s is suitable not
only for the strong interaction, but aiso for the weak interaction.
I.
Y a n g and Mills (1954a)This is the first of two papers
published in 1954 by Yang and Mills . It is only one page long, and
it began as an abstract for the A p r i l 1954 meeting of the American
Physical Society in Washington- According to Yang, however, most
of the work had been already completed by February 1954 - [Yang
(19831, p . 191
2.
3.
7,
The replacement of the derivative which Pauli outlined
appeared, among other places, in Weyl (1931a), p. 100. In 1953, a
year before the Yang-Mills paper, J u l i a n Schwinger published the
second paper of his series entitled "The theory of the quantized
fieldN- Schwinger spoke of the gauge nature of the electromagnetic
i e l d as a generally known interpxetation, "The postulate of
general
[that is,
local]
gauge invariance motivates
the
introduction of the electromagnetic fieldIf a l 1 f i e l d s and
sources are subjected to the general gauge transformation
Ix=exp ( - iX ( x ) E 1 x
the Lagrange function we have been considering alters in the
following marner:
I P = ~ ?j,a,h
+
The addition of the electromagnetic field Lagrange function . - .
provides a compensating quantity through the associated gauge
transformation
'A,=A,-a,A.
"
(1994)
for
the
development
of
quantum
&4,W E I i (aa/axk)
This distinction is manifested through the fact that only
expressions which are bilinear in U and d are associated with
measurable quantities even when the associated field is
-physically
qyantized-accordhgto the Bose tatistics. . . . it follow that, in
principle, only gauge invariant quantities can be obtained by
direct measurement [Pauli (1941), p - 2071
11- Yang and Miils (1954b1,p. 192.
12. Although group theory played little part in the training of
most physicists working in particle physics at the tirne, Yang was
an exception. His B .Sc. thesis, completed in China, was titled,
"Group theory and rnolecular spectranJPickering (198 4 ) , p. 1611
13 - Yang and Mill (1954b), p . 193 - Note that the nine components
of b,, for p=1,2,3 are real, while the three components of b, are
pure imaginary.
14. Note that this Lagrangian includes terms representing the
self-couplings of three and four b quanta,
15.
16.
17. This was shown for quantum electrodynamics by Pauli in 1941He showed that, V - . the rest mass of the photons must be exactly
zero for [gauge] transformations of the second type," and, " . .- an
assumption to the effect that the photons have a very s m a l l but
finite rest mass seems to be physically unsatisfactory. [Pauli
(1941),p. 207, 2191 Note that the QED Lagrangian has no mass term
for the electromagnetic field, and being based upon a
generalization of QED, the Yang-Mills Lagrangian does not have a
mass term for the b field. Of course, both Lagrangians have mass
tenns for the spinor fields, but these do not ruin gauge
invariance.
18As Wick (1938) showed, because of the uncertainty prin.ciplel
range is inversely proportional to mass . If the b field had a mass
zero, like the photon, it would i m p l y that the strong force should
have an infinite range, j u s t like electromagnetism-
19.
20.
22.
P a i s (19531, p. 8.87-
23.
P a i s (19861, pp. 5 8 4 - 8 5 A copy of - t h e l e t t e r appears in
OrRaifeartaigh (1997), pp, 171-75.
24.
OrRaifeartaigh (1997). p - 167, Pauli relates the gaugepotentials of the meson-nucleon interaction t o the curvature tensor
through a procedure now known as dimensional reduction.
Dimensional reduction is a technique by which higher-dimensional
systems can be reduced to lower-dimensional ones.
ft is very
powerful in the context of gauge theory because it converts
coordinate transformations in the full space, six dimensions in
this case, into gauge transformations in the space-tirne subspaceThis allows certain components of the Riemann tensor, along with
the covariant derivative, to be interpreted as components of
curvature in the full space and as field strengtns in spacetime.[OrRaifeartaigh (1997), pp. 50-52, 167-1691 In xecent years,
there has been a revival in interest of dimensional reduction in
the wake of the advent of string theory, which has suggested that
the use of 10- or 26- dimensional spaces might lead to a unified
theory of interactions, or a quantum theory of gravity. See, for
example, Appelquist , Chodos, and Freund (1987).
25.
28.
Jarlskog (1981).
Jarlskog notes in a reference that she
herself brought up Klein's 1 9 3 8 earlier in a 1979 conference.
Klein's paper was reprinted in 1989, again in the proceedings of a
Polish conference29-
30-
K l e i n (19381, p . x i x .
31.
32.
K l e i n (1938), p. xx. Recall that Kemmer (1937) also placed
the electron and neutrino in an isotopic spin doublet,
33.
34.
35.
36.
37,
38,
41-
Salam ( 1 9 8 0 ) ~
p. 528.
46 .
47.
50.
If Yang had been a w a r e of the link between his theory and
gravity, he could have discussed it with Weyl, w h o spent the last
six years of his life at Princeton- Unfortunately, t h e y never
spoke about it, "1 had m e t Weyl in 1949 when 1 w e n t to t h e
Institute f o r Advanced Study in Princeton as a young 'member'
1
saw hirn r o m time to time in the n e x t years 1949-55. He was very
approachable, but 1 don't remember having discussed physics or
mathematics with him at any time. His continucd interest in t h e
idea of gauge fields was not known among the physicists. Neither
Oppenheimer nor Pauli ever mentioned it. 1 suspect they also did
not t e l l Weyl of t h e 1954 papers of MiIlsr and mine. Had they done
that, or had Weyl somehow corne across Our paper, 1 imagine he would
have been pleased and excited, for we had put t o g e t h e r two things
that w e r e very close to his heart:
gauge invariance and nonAbelian Lie groups. " [Yang (1980). pp- 19-20 - 1
These diniculties can be overcome by representing the weak interactions, instead, as a partide
exchange. In order to provide the vector and axial-vector interactions, the exchanged
particles, often refemd to as intermediate vector bosons (IVBs), must be spin-l (vector)
mesons. Only two of these particles are necessary for the V-A theory, one positively charged,
and the other negatively charged Since the aesthetics of the Yang-Mlls theory were widely
recognized at the time, it was proposed that this theory might apply to the weak interaction as
well as the strong interaction, in which case the M3s of the weak interaction might be
identified as gauge particles. Furthemore, since gauge theory grew out of an extension of
quantum electrodynamics, it was tempting to try to relate the weak interaction to
electromagnetism.
Still in 1957, Julia.Schwinger presented his anbitious paper, "A theory of the
a family of three Z particles? This idea related the electromagnetic and weak interactions,
since dl three of the mediating particles were in the same family, but there were diniculties.
Ernpirically, it was well-known that the electromagnetic interactions were a few orders of
magnitude stronger than the weak. Not only that, but electromagnetic interactions have
infinite range and they conserve party, while weak interactions are of very short range and
they violate parity! To solve the difficulties with range, Schwinger proposed that the M 3 s
were very massive, while the photon was massless. This idea addressed the problem of
d i f f e ~ gcoupling strengths a s well, since at energies which were low compared to the masses
of the IVBs, the weak interactions would be highly suppressed relative to the electromagnetic
interactions. Even so, it was still puzzling how the massive IVBs could be seen as members
of the same farniIy as the massless photon.
A year later, Sidney Bludman offered a different interpretation. Bludman ignored the
electromagnetic interactions altogether, but he agreed that the IVBs of the weak interaction
could be precisely identifieci with the gauge particles of an SU(2) Yang-MiUs gauge theory.
He achieved this by postulathg that there were not only two M s , however, correspondhg to
raising and l o w e ~ g
operators in isotopic spin space which exchange charge, but that there
was also a thkd operator which wodd not exchange charge, namely, a neutral IVB. If this
neutrd mediator existed, it would be involved in weak nucleon-nucleon processes, weak
Thus, Bludman introduced a neutral current. Although he reasoned that aU three of the M3s,
the neutrai and the charged, should be massive, he was forced to introduce these masses into
the Lagrangian by band.' As expected, this posed a problem for renormalization.
In 1961, Sidney Glashow, a former student of Schwinger, was impressed by the idea
of a parallel between eiectromagnetic and weak interactions, but he thought that a triplet of
IVBs was not enough. To expiain the experimentally observed seledon d e s of iisoopic spin
and strangeness, he postulated the need for a second neutral intermediary, which he cailed
22
Thus, his mode1 incorporated the previous triplet of W s , (Z,, &, G), one negative, one
neutral, and one positive, plus another, a singlet 2,. By a clever choice of the mass temis in
the Lagrangian, Glashow allowed these two neutral particles to mix,
The reader may wonder what has been gained by the introduction of another neutral
rneson. Neither Z, nor 2, interacts with the electrical current so that neither interaction
may be identified with electromagnetism.... However, both the neutrd fields have the
same CP-property, so that linear combinations of the two fields may correspond to
"particles".'
The key was that although neither of the neutral mediators could be identified with
electromagnetism before mxing, both the photon and a neutral weak mediator could be
identified after
hg
Glashow
. gives the Lagrangian mass term,
(2)
descnbe spin-1 particles with masses MA and MB. By setting M f l , Glashow was able to
i d e n e A, with the massless electromagnetic field, and B, with a massive neutral particle.
The strength of the neutral current was not prescnbed, but Glashow reasoned that since
experiment seemed to indicate that strangeness-changing neutral currents should be either
strongly suppressed or absent, he would simply enforce this condition by asniming that the Z
was much heavier than the two charged IVBs which we shall refer to as W and
w".'
At the
currents, but all he did, in reality, was suppress al2 neutral currents. The mode1 had other
problems as well: although Glashow hoped that it would be renormalizable, it was not, and
furthennore, the model applied to leptons only. By a shrewd choice of lepton assignrnents to
group representations, however, Glashow showed how both the parity conservation of
electromagnetic interactions and the parity violation of the weak interactions could be
combined in a single d e d model of both. In the language of group theory, Glashow had
exploited the local gauge group SU(2)xU(I).
In 1960, Sakurai turneci the Yang-Mius theory back upon the strong interactions. He
was very impressed by the fact that local gauge transformations associated with isospin
conservation demanded a vector field, and in fact, he was puzzled that this idea had received
so little attention? Sakurai noticed that the strong interaction not only exhibits conservation
of isotopic spin, but it also exhibits conservation of hypercharge and baryon number-Io
Consequently, he offered an extension of the Yang-Mas tiieory, in which, in addition to the
original triplet of spin-l gauge particles coupled to isotopic spin, theTe were also two other
gauge p h c l e s , one coupled to strangeness, and one to baryon number." Even though the
hypercharge and isotopic spin currents are broken by the presence of the electromagnetic and
weak interactions, Sakurai felt that these were exact symmetries of the strong interaction,
... these three couplings, which are the only couplings deeply rooted in the exact
intemal symmetnes of strong interactions, are the on& "fbndamental"couplings of
strong interactions.'2
By identifj6ng these symmetries as exact, he recognized that he faced the zero-mass problem
of Yang and Mills, but he brushed it aside, hoping that later authors would solve it,
By this time an intelligent reader rnust have made' the following objection: The B
fields cannot be massive because the mass term m'~: in the Lagrangian certainly does
not satisfy your gauge principle. This is a valid objection, perhaps the mot serious
objection to our theory. We wodd like to believe that the mass terms do vanish for
the bare Lagrangian, and that the empirical mass ternis ... reflect, in a certain sense,
our failure of our present-day field the0ry... the author hopes that this publication may
prompt some ciever ideas-.. 13
Sakurai was forced to insert the masses of the five vector mesons into the Lagrangian by
hand. Although these five particles did not trrm out to be the gauge particles which Sakurai
had hoped, his theory still enjoyed some success, as it effectively predicted the properties of
five vector mesons, a triplet and the two single& aU of which were soon discovered in
resonances. They became known as the p, a,and 4.l4
In 1961, Gell-Mann and Glashow produced a catalogue of the various groups upon
which gauge-invariant field-theories could be based. They recognized that in the case of
Yang-Mills theory, this gauge invariance could only be only partial, however, since some of
the relevant currents are not conserved. The isotopic spin and hypercharge currents of the
strong interaction, for instance, are not conserved by the electromagnetic and weak
interactions, while the weak axial-vector current is not consemed by the electromagnetic and
strong interactions.15 In addition, they suggested that partial gauge invariance may apply to
either the strong or weak interactions, but not to both at once,
In general, the weak and strong gauge symmetries WU not be mutually compatible.
There will also be conflicts with the electromagnetic gauge symmetry, conflicts that
must be resolved in favour of electromagnetism, since its gauge invariance is exact
We have not attempted here to describe the three kinds of interaction together, but
only to speculate about what the symmetry of each one might look like in an ideal
limit where symmetry-breaking effects disappear?
Ln this paper, Glashow and Gell-Mann explored theories of vector bosons invariant under
conrinuous groups of coordinate-dependent Iinear trandonnations, and upon analysis of their
differing syrnmetry properties, they concluded that aiI these theories could be expressed with
Lie algebras." The various types of symmetry group included: the orthogonal groups O(n),
the speciai unitary groups SU(n), the symplectic groups Sp(2n), and the Lie groups
They reasoned that within generalized Yang-Miustheory, the field particles which couple to
interactions of higher order than the SU(2) group of isotopic spin. A candidate for this global
had already been suggested independentiy by Murray Gell-Mann and Yuval
the group
su(3).19
Gell-Mann was trying to relate the strong interactions to the concept of currents. This
approach had met with success in both quantum electrodynamics, where electromagnetic
interactions take place between electromagnetic currents, and in the V-A theory, where weak
interactions take place between weak currents. Application to the strong interactions was
tricky, however, because
was not at al1 clear how to constnict currents fkom hadrons.20 Sakurai had already
suggested that vector mesons be coupled to the isotopic spin current and hypercharge current,
and Gell-Mann extended this idea in 1961, by proposing a larger symmetcy group which
would incorporate both isospin and hypercharge into a symmetry of a new property which he
named unitary spin, or ~ - s p i d 'Aside nom any violation effects due to electromagnetism
and weak interactions, Gell-Mann treated aii vector mesons as completely invariant under Fspin. He viewed this as a generalization of Yang-Mills theory,
Now the vector mesons themselves carry F-spin and therefore contribute to the cunent
which is their source. The problem of comtruciing a nonlinear theory of this Iand has
been completely solved in the case of Yang and Mills and by Shaw. We have only to
generaiize their resuIt (for three vector mesons) to the case of F-spin and eight vector
meson3
In Gell-Mann's scheme, strong interactions conserve not only the three components of isospin
(Il,IJ3) and the hypercharge Y , but they also approximately conserve four more operators F,,
F,, F,, F,. Defining FI=&, F2=1-,F3=4 and ~ ~ 4 3 Y,
1 Gell-Mann
2
offered an dgebraic
system of eight operators F,..-&- Because there were eight operators, Gell-Mann named his
These operators are related to each other by the mathematical structure of the group
SU(3). Just as II, 4, and I, form the algebra of SU(2) such that the components of 1Satie
the commutation d e s ,
[lJ,]=i~&where i.j,k=l,..,3
(3
where e,, is totally antisymmetric in ij,k, the components of F-spin satisfy the commutation
[F,Fi]=i&&
where i j , k 1 , . . , 8
(4)
The form of these commutation d e s aliow the F-spin operators to be identined with the eight
Therefore this symmetry group incorporates isotopic spin and hypercharge, while it has new
operators F, ...F, which are chosen such that they obey the interaction rules 1 AI=%
1,l AY=l
In this way, SU(3) ties together strongly interacting particle multiplets with different values of
I and Y @ut the same spin and panty), within approximately mas-degenerate supermultiplets.
Although the SU(3) symmetry is not exact, it can be used to clas@ the hadrons,
We attempt ... to treat the eight known baryons as a super-multiplet, degenerate in the
limit of a certain symmetry but split into isotopic spin multiplets by a symmetrybreaking term.... The symmetry is called unitary symmetry and corresponds to the
"unitary group" in three dimensions in the same way that charge independence
corresponds to the "unitary group" in two dimensions. The eight innnitesimal
generators of the group form a simple Lie dgebra, just like the three cornponents of
isotopic spin. In this important sense, unitary symmetry is the simplest g e n e r b t i o n
of charge independen~e.~'
Just as the isotopic spin has a three-dimensional representation, the unitary spin group has an
perfect, a l l the baryons would have the same mass, which they clearly do not. Therefore, the
symmeetry is broken, and so the four components of uni-
the isospin 1and the hypercharge Y me conserved, and consequently, the SU(3) octet c m be
broken down into SU(2) multiplets: a S doublet, a C triplet, a A singlet, and an N doub~et?
Gell-Mann also assigned the pseudoscalar mesons to an octet, made up of the K doublet, the
K doubIet, the
.K
triplet, and a new SingIet with isotopic spin zero which Gell-Mann predicted
and named x. Ne'eman also predicted this same meson, which he called #', but later, it
came to be known as 7 -
The vector mesons are introduced in a very natural way, by an extension of the gauge
principle of Yang and MlIs- Here too we have a super-multiplet of eight mesons,
corresponduig to the representation 8. In the limit of unitary symmetry and with the
m a s of these vector rnesons "tunied offr, we have a completely gauge-invariant and
minimal tbeory, just like electromagneism. When the mass is tumed on, the gauge
invariance is reduced (the gauge h c t i o n may no longer be space-thne dependent) but
the conservation of unitary spin remains exact. The sources of the vector mesons are
the conserved currents of the eight components of the UIiltary spin?
m e n the symmetry is reduced, the octet of vector mesons breaks up into a triplet p coupled
to the dll-conserved isotopic spin current, a singlet o coupled to the still-conserved
K', -
and he predicted a ninth vector meson corresponding to a singlet cailed B", which evenhialIy
came to be known as 4.
exact, for instance, the proton and neutron only have a nemly identical mas, SU(3) shows
much greater deviations wbich show up as large mass ciifferences between isotopic spin
multiplets. These differences are not just a mail percentage, rather, they can be of
comparable size to the particle masses themselves. Gell-Mann reassured, however, that this
mas-splitting, even though it was large, exhibited a simple mathematid foxm. Gell-Mann
assumed that the baryon mass differences would transform like a component of the octet, and
he reasoned that since the isotopic spin and strangeness were conserveci, the mass ciifference
should t r d o r m like the eighth component of the F-spin. In this way, he derived the sum
de,29
and this result was quite well-satisfied. In 1962, Susumu Okubo found a generalization of
this formula to the lowest order of perturbation for the case of a general super-muitiplet,
which related the masses of different multiplets of isospin and hypercharge?O
This easily reduces to Gell-Mann's result for baryons. The Gell-Mann-Okubom a s formula,
as it came to be cailed also f i t the mass clifferences in the pseudovector meson octet very
well, and it fit the merences in the pseudoscalar meson octet decently?
For the spin 312 baryon decuplet, the formula reduced to an equd spacing mie. It
predicted that the m a s merences between A-E', C ' - a , and ll-Q- would be equal. In
1961, however, only the A had been observed. The e s t evidence for the existence of the CD
and E' was only reported at a CERN confierence in 1962, but the mass-spacuig confirmeci the
prediction. At this same conference, Gell-Mann made a bold prediction about the mass and
properties of the unseen il-?
O-, and its assignment in the SU(3) decuplet required it to carry three units of strangeness.
Mer the conference, bubble chamber physicists went home and started looking for the W.
Two years later, the Brookhaven goup found a particle wth a mass 1686k12 MeV, and a
lifetime characteristic of a weak decay, indicating the desired amount of strangene~s?~
After
this discovery, there was littie argument within the particle physics community that SU(3) was
the appropnate classification system for hadronsY
RecaIl, however, that neither Gell-Mann nor Ne'eman had been aimig at just a simple
classification of hadrons! Both were attempting, instead, to set up a quantum field theory of
the strong interactions derking from gauge invariance. As Gell-Mann remarked, "The most
attractive feature of the scheme is that it perrnits the description of eight vector mesons by a
unified theory of the Yang-Mills type (with a mass term).''35Ne'eman had sirniIar
sentiments, "The following treatment is an attempt to formulate a unified gauge, while
reducing the number of vector bosons. It does, indeed, generate a set of eight mediaing
fields..."j6
As Gell-Mann explained, he was led to this symmetry by a generabition of the
conserved vector current (CVC)hypothesis?
conserving (AY=O) portion of the vector weak current of mongly interacting particles is the
current of the isotopic spin. Gell-Mann generalized this by equating the entire vector weak
current of the strongly interacting partic1es with the curent of the F-spin. This curent has
two terms: the strangeness-preserving isotopic spin current with AY=O and [A.F=l[, and the
equal-time commutation relations of the F-spin, and no matter how badly the super-multiplets
of the Eightfold W a y are broken, the vector current octet stdi applies to the current of the Fspin." Thus, it was through his consideration of weak currents that Gell-Mann was led to
an SU(3) algebra which applied to the strong interactions. In fact, this symmetry c m be
applied to both the vector current and the axial vector current individualiy, since the unitary
spins cornmute with each other, and the combined symmetry is then SU(~)XSU(~)."
T'+~+V,
(7)
processes, and it suppresses the dS= 8-decay rates by tad8. Cabibbo's work also Mplied
that al1 1 &S'=ll @-processessais@ the d e 1 AT=% 1, and through the Gell-Mann Nishijima
relation? this gives the nile M/AQ=+l for &processes, which is in agreement with the resuit
octets, the underlying symmetry group would be SU(8) instead of SU(3). Since he wanted to
remain with the group SU(3), he decided to look for a group of particles corresponding to the
undamental representation of SU(3), namely, a triplet,
A unitary triplet t consists of an isotopic singlet s of electric charge z (in units of e)
and an isotopic doublet (u,d) with charges rtl and z respectively. The anti-triplet 7
has, of course, the opposite sign of the charges. Complete symmetry among the
members of the triplet gives the exact eigMoId way, while a mass Merence, for
example, between the isotopic doublet and singlet gives the first-order violatiod3
By assigning fiactional charges, ?h,-W, -% for u, d, and s respectively, dong with baryon
number %, Gell-Mann was able to build up baryons and mesons from the quarkxu Although
he used them in construction, however, Gell-Manndid not regard these quarks as physical
particles,
It is fun to speculate about the way quarks would behave if they were physical
particles of =te mass (instead of purely mathematid entities as they would be in the
Iimit of infinite mass). Since charge and baryon number are exactly consenred, one of
the quarks (presumably uH or d-%) would be absolutely stable... Ordinary matter near
the earth's surface wouid be contaminated by stable quarks as a r e d t of high energy
cosmic ray events throughout the earth's history, but the contamination is estimated to
be so small that it wodd never have been detected. A search for stable quarks-.would help to reassure us of the non-existence of reai quarks?
Although Gell-Mann deduced these fundamental particles fiom considerations of symmetry in
the context of curent dgebra, the quark concept rapidly proved to be have far-reaching
consequences.
Gell-Mann and Ne'eman were not the first not to introduce SU(3) into particle physics.
Back in 1956, Shoichi Sakata was looking for the meaning behind the GeU-Mann-Nishijima
188
rule?
Sakata drew a paraIlel with nuclear structure-just as nuclei are cornposed of neutrons
In our model, the new particles are considered to be composed of four kinds of
fndamental particles in the tme sense, that is, nucleon, antinucleon, A' and anti-A'. If
we assume that A" has such intrinsic propertes as were assigned by Nishijima and
Gell-Mann we can easily get their ...nile for the composite particies as the renilt of
the addition laws for the ordinary spin, the isotopic spin, and the ~trangeness-~'
Sakata's model therefore has six constitaents: the neutron, the proton, the A, and theh three
antiparticles. It is curous that although Sakata considers the pion as a combination of N and
s,he gives no credit to Fermi and Yang, who first had this idea in 1949"
Although Sakata
himself may not have realized that his work could be fhmed in tems of group theory, his
coileagues at the University of Nagoya did.
In 1959, Ikeda, Ogawa, and Ohnuki formulated Sakata's model in group-theoretical
They fomdated an SU(3) theory, in which the fiuidamental triplet consisted of the
Ikeda, Ogawa, and Ohnuki s~iccessfidiyplaced the three pions and four kaons in an
8-dimemional representation, experirnent soon showed that their baryon octet was in error.
The incorrect assignments of C and ican be blamed upon the fact that n, p, and A cannot
form a triplef because they possess different spins' Experiment nilcd in favour of Geil-
down quarks are in an isotopic spin doublet, while the strange quark is in a isotopic spin
singlet, although it aiso carries an eigenstate of a new quantum number caiied strangeness.
qq,while
baryons c m be treated as bound states of three quarks qqq. In this theory, a- is identified
with a d state, a proton is identified with a uud state, and a neutron is identified with a udd
state.
giving a singlet and an octet, and it aiso offers the baryon representations,
3x3~3=1+8+8+10
giving a sioglet, two octets, and a decuplet Additionally, the breakuig of the SU(3)
symmetry can be accommodated by postulathg a mass difference between the paired (u, d)
and the s quark.
In his second paper OF 1964, Gell-Mann presented an extension of the SU(3) group to
SU(6)?
arrange,
rnonths later, Grsey and Radicati pointed out that SU(6) could be viewed as an enlargement
of the SU(4) symmeetry which had been proposed by Wigner in 1937 in the realm of nuclear
physics,
... the group SU(4) introduced by Wigner to classi@ nuclear states can be extended to
the relativistic domain and it is, therefore, relevant for particle physics. We wili next
show that when strangeness is taken into account the group SU(4) becomes enlarged to
SU(@ which .., embodies SU(3) and the ordinary spin in the same way as Wigner's
SU(4) embodies isotopic spin and ordinary spin...''
Whereas Wigner had combined the symmetries of isotopic spin with spin to give
SU(2)xSU(2) within nuclear physics, the group SU(6) is relevant within the realm of the
strong interactions to the extent that the strong interactions are invariant under the unitary spin
group SU(3), dong with the ordinary spin group SU(2). Grsey and Radicati showed how
SU(6) offered a higher classification system for hadrons. Their papa included the meson
representations,
35=(8,3)+(8,l)+(i ,3)
SU(3)xSU(2)
which contains an octet of spin-zero mesons, and anouier octet of spin4 mesons, as expected.
It also offered the baryon representations,
56=C10*4)+(8-2)
SU(3)xSU(2)
which included the baryon octet with spin 112, and the baryon decuplet, with spin 312. This
was a clear advance over the pure SU(3) approach, because SU(3) cannot furnish any
Symbolically,
6=(3,2)
and the sk-dimensional representation contains an SU(3) triplet of spin doublets (which are
dl States of the same parity). This gives, for mesons,
6x6'=3 5+l
6~6~6=20tS6+70+70
There were many efforts to find theories simultaneously invariant under SU(6) and the
Lorentz group, but none of them met with s u c c e ~ s . ~According
~
to Dyson in 1966, "...no
physicdly interesting marriage of the Lorentz group with intemal syrnmetries is possible, and
[sol the failure of the relativstic SU(6) thories was mathematicdly inevtabIee"6' SU@) Ied
to a few decent results, but the most important consequence of static SU(6), historicaliy, is
that it pointed the way to a new dynamitai degree of fieedom, now known as colour.
SU(6) could ody be applied in cases where orbital angular momentum was effectiveLy
neglected (no spin-orbit couplings), and this implied that the thtee-quarkwave hction was
symmetrc in orbital variabIes- In fact, in the 56-representation, chosen for baryons, the
wave-bction is t o t d y symmetrc in all variabies. This was in apparent conflict with Fermi
statistics (the exclusion principle), which had k e n well-established for over three decades.
To remedy this problem, Oscar Greenberg suggested, in 1964, that quarks do not obey Fermi
statistics, but rather, hey obey a parastatistics in which a single state can be occupied by
three particles.
... we retuni to the question of pfacing three spin-%quarks in S -tes [I=O] in the
baryon 56. This can be done ifthe quarks are pardennions of order p=3...?
Greenberg then gave the rules for the creation and annihilation operators for his t h e ~ r y . ~ ~
Parastatistics was outside of the usual fermion-boson dichotomy, and its field-theoreticai
formulation was alien to many physicists. Its obscurity was soon overcome, however, with
the aid of a more transparent, but essentiaily equivaient, formulation offered by Han and
Na~nbu.~~
In early 1965, Han and Nambu also propoxd a solution which solved the natistics
problem. (At the same time, it aliowed them to funiish quarks with integral charges, which
they preferred to fiactionai charges.) They suggested that there was not one triplet of quarks,
but three, and that these three triplets would manifes a double SU(3) symmetry,
Thus, they replaced each member of the fundamental triplet of SU(3) flavour with a triplet of
another SU(3) group of a new quantum number c. This SU(3), group came to be laiown as
SU(3) c01our.~ The quark colours provided an extra degree of fkeedom which could tum an
othenvse symmetrc wave-hction into a symmetrk one, thus s a t i m g the Pauli principle,
which only applies to completeIy identical particles.
In 1966, Greenberg and Zwanziger noted that the parastatistic model and the
tricoloured triplet rnodel are equivalent, but only if the three triplets carry the same charges,
that is, if there are three (u, d, s) sets each with the old charges (Ys,-%,
-1h).
Greenberg and
Zwanziger also discussed the sauration of paaicle states. Interquark forces seemed to show a
saturation property-they appeared only in the bound states
6 and qqq.
The colour
hypothesis gave a natural explanation of this condition which had previously been imposed by
fiat. Al1 observed states were colour-neutral, or colo~~rless.6~
No problems arose in
assigning coloured quarks the charges which had been initially given to them by Gell-Mann
and Zweig, and in fact, when the idea of coloured quarks nnally came into vogue in the
1970s within the context of gauge theory, it was as fiactionally charged objects.
Gauge theory had fden upon hard tunes because of the zero mass-problem. if the
gauge bosons in Yang-Mills theory are massless, they cannot explain the short range of the
194
nuc1ea.r interactions, whiIe if they are massive, they niin the gauge invariance of the
Lagrangian. The answer to this problem came fiom an unexpected source. Although most of
Yoichiro Nambu's work was in high energy physics, he was also interested in solid state
physics. In 196 1, he published a paper together with a post-doctoral feiiow, Jona-Lasinio,
entitled, "A dynamical mode1 of elemeatary particles based upon an analogy with
hctional dependence of the phase is independent of any gauge fields. AIthough the
superconducting
invariant, although the gauge invariance is hidden-'O This Ied Nambu to the conclusion that
the Lagrangian of a quantum field theory cari possess an exact symmetry, even though that
symrnetry is not manifest in the physical states which that theory describes. In particular,
even though a Lagrangian may possess a certain symmetry, the ground or vacuum state does
This concept can be illustrated with a simple example. Consider a thi rod placed
verticdly on a table, and then pushed down with a ce&
critical value, the rod bends, in a certain direction, more or less at random. Note that the
unbent configuration is completely symmetric, but the bent state is not. In facf there are
inflinitely many possible new (degenerate) ground states, which are related by a rotational
symmetry, but the rod only chooses one of them!
parameter, in this case, the force F, assumes a critical value, and beyond that value, the
symmetric configuration becomes unstable, and so the ground States becorne degenerate.
Jefiey Goldstone pointed out a problem with this concept in 1961. He found that the
Whenever the Lagrangian has a continuous symmetry group, the new solutions have a
reduced symmetry and contain massless bosons. One consequence is that this kind of
theory cannot be applied to a vector partide without losing Lorentz invariance. A
method of losing symmetry is, of course, highly desirable in eIementary pacticle theory
but these theories wiil not do this without introducing non-existent masdess
bosons-..72
This result was derived more rigorously by Goldstone, Salam, and ~einberg." They proved
that this symmetry breakdown must be accompanied by the appearance of massless, spin-zero
particles, as physical states of the theory. These particles became known as Goldstone
bosons.
Goldstone bosons led to a parad& however, because the inspiration for this symmetry
breakdown had corne nom superconductivity, in which there are no massless particles. In
fact, as P. W. Anderson pointed out in 1963, the gauge particles in superconductivity, the
plasmons, act as if they are massive." This is because the plasmons can have both
transverse and longitudinal polarizations, but below a certain cutoff energy, the longitudinal
plasmon is indistinguishable from a massive vector boson. Anderson reasoned that in a
sirnilar manner, Gotdstone bosons could be converted into massive particles by interaction
with the gauge field?
argued that a Yang-Mills vector boson, impiied by the association of a generdized gauge
transformation with a conservation law, would not necessarily have zero m a s . These papers
inspired other theorists to show how symmetry breakdown using spin-zero fields could
generate vector-meson masses, particularly in the case of Yang--Mills theones. The frst to
appear a s zero helicity states of the vector gauge particles, which thereby*acquire a mas,
When the symmetry group of the Lagrangian is extended fiom global to local U(l)
transformations by the introduction of coupling with a vector gauge field, the
Goldstone boson becomes the longitudinal state of a massive vector boson whose
transverse states are the quanta of the transverse gauge field"
Higgs' mode1 conssts of the standard QED Lagrangian, augmented by a pair of scalar fields,
later called Higgs fields, which are coupled to the photon and to one another in such a way
that gauge invariance is preserved. The scalars obey a potential energy fnction which allow
them to break gauge symmetry by playing the role of a degenerate ground state."
Higgs
found that by giving the scalar fields a negative mass term in the Lagrangian, then instead of
yieldng a massless photon and two negative-mass scalar particles, the physical spectrum of
the theory contained a massive photon and a single massive d a .particle, later referred to as
a Higgs particle. Although the mechanism is ingenious, spontaneous symmetry breaking is
mt pite
a miracle-the mass term for the Higgs particle is contained in the kinetic energy of
the Higgs ground state fiom the beginning, and it is simply hidden."
insert the masses of the IVBs into the Lagrangian by hand, thus ruining the gauge invariance
of their theories. The success of spontaneous symmetry breaking, however, made it possible
for Weinberg in 1967, and independently, Salam in 1968, to replace the IVB mass terms with
masses generated by the Higgs mechanism- This aiIowed the vector mesons to have mass,
while preserving gauge invariance.
In this treatment, weak interactions can be viewed as being mediated by gauge bosons
which are massless before symmetry breakhg. The Lagrangian for the theory ais0 contains
tenns for rnassless electrons, muons, and neutrinos, and it is invariant under an intemal
syrnmetry group, which is a gauge symmetry. A scaiar field (the Higgs field) is then
Weinberg had been working upon broken symmetry since he first heard of the 1961
Nambu and Lasinio paper in the mid-si~ties.8~This paper, aside fkom pointing out the
analogy with superconductivity, had given an interesting interpretation of the pion, treating it
as arising fom chirality invariance,
In 1964, Adler and Weisberger each derived sum d e s which gave the ratio g
&
of axial
From the point of view of curent algebra, these d e s codd be regarded as a complete set of
hadronic states inserted in the commutation relations of the axial vector currents, but
Weinberg decided to interpret them dsfferently. He saw them as a consequence of the strong
interactions showing an approximate symmetry, based on the group SU(2)xSU(2), which was
spontaneously broken, giving rise to the nucleon masses?
Aithough the SU(2)xSU(2) symmeetxy is spontaneously broken, it still has a great deai
of predictive power. These predictions take the fom of approximate formulas, which give the
mark elements for low energy pion processes. in these cdculations, one is redy using not
only the fact that the strong interactions have a spontaneously broken approximate
SU(2)xSU(2) symmetry, but aiso that the currents of this symmetry group are, up to an
overall constant, to be identified with the vector and axial vector currents of 8-decay." In
1967, Weinberg thought of extending this broken SU(2)xSU(2) symmetry of the strong
interactions fiom a global symmetry to a local symmetry- He hoped that strong interactions
wodd then be descnbed by something like a Yang-Mills theory, but in addition to the vector
p
mesons predicted by Sakurai, he predicted that there would also be axial vector A l
rnesons.'* To give the p meson a mas, he needed to insert a common p and A l mass term
in the Lagrangian, and then through a spontaneous breakdown of the SU(2)xSU(2) symmetry,
he could split the p and A I masses. Since the theory was not gauge invariant, however, the
pions remained physical Goldstone bosons, and Weinberg could not renormalize itg9
In the fall of 1967, Weinberg reaIized that he had been applying his ideas to the wrong
problem. It is not the p which is massless-it is the photon! And its partner is not the Al,
but the set of massive NBs, the proposed mediaors of the weak interaction. Weinberg
understood, W y , that weak and electromagnetic interactions can be described in terms of an
exact but spontaneously broken gauge symmetry. At the tirne, however, Weinberg admits that
he had little confidence in his understanding of the strong interaction, so he concentrated on
ieptons only?O
In 1967, he presented his paper, "A mode1 of leptons". He reasoned that there are two
left-handed electron-type Leptons (e, and
U(2)xU(1). He postulated that ail unitary 2x2 matrices wodd operate on the left-handed
lepton doublet, and the 1x1 matrices wodd operate on the Bght-handed singlet,"
This group could be reduced, however, because U(2) breaks up into unimodular
transformations and phase tran&ormations, giving SU(2)xU(l)xU(I), and since one of the
U(1) symmetries can be idenfined with lepton number which is conserved, Weinberg decided
~ left the four-parameter group SU(2)xU(l).
to exclude it from the ~ o u p ?This
Weinberg constnicted his Lagrangian out of the lefi-handed doublet and the righthanded singlet, plus the gauge fields A, and B, which couple to an SU(2) group of weak
isospin T,and a U(1) group of weak hypercharge Y, plus a spin zero doublet,g3
This Higgs doublet has a vacuum expectatioa value which breaks T and Y,thus giving mass
to the electron, and to the weak vector bosons. M e r spontaneous symmetry breakdown
200
SU(2)xU(I) to the U(1) of ordinary electromagnetic gauge invariance, three of the four vector
bosons, the two charged bosons
boson, the photon, remains massless. AU four of these fields, however, are mixtures of the A,
and B, fields. The massive fields
Note the presence of the two coupling constants, g and g', whkh combine to give the two
physical coupling constants, e and Gw,which are related to the boson masses,
Saiam achieved similar redts, although he included muons in his theory, In addition, Salam
related the h
g of the gauge bosons not with the two coupling constants, but with an
angle?4 Weinberg soon adopted this idea hrmself, defining this mixing angle by tan&g'/g,
coupling of the neutral boson 2. Neutrd currents had already been predicted by Bludman
(1X8), by Glashow (1961), and by Saiam a d Ward (1964), but none of these previous
theories, however, had offered a definite prediction of the strength of the neutral currents.%
The theones of Weinberg anci Salam, however, included a prediction for the mass of the 2,
which le4 in tum, to a d e f i t e prediction of the strength of the neutral currentg7 Late in
201
processes?
scattering, but he had difnculty with the predictions involving strange particles,
... the data on strangeness-consenlng neutral current interactions are not yet good
enough to d e out the existence of such currents. In contrast, the upper limits on
strangeness non-conserving processes such as lf'-+Cct+p-,
X$_+~~++V+T
[and so on] are
very stringent, and show that no strangeneu-nonconservhg neutml current can have
any appreciable coupling to leptons. Thus some way must be found to introduce the
strangeness-changing charged currents, without ntroducing any strangeness-changing
neutraI current?
IIiopoulos, and Maiani. Deep inelastic scattering experiments in 1969 had suggested that
quarks were point-like entities, dowing theorsts to incorporate hadrons into the W-S model
by appIying the model to quarks as weii as to l e p t ~ n s . ' ~
Glashow, Iliopoulos, and Maiani
We wish to propose a simple model in which the divergences are properly ordered.
Our model is founded in a quark model, but one involving four, not three, fundamental
fermiom... The extra quark is the simplest modification of the usual model leading to
the proper ordering of divergences.... In contradistinction to the conventional (threequark) model, the coupling of the neutd-intennediary-now hypercharge conservingcauses no embarrasment-'O'
The fourth quark was nameci c h , &er a 1964 paper by Bjerken and Glashow, who had
suggested a fourth quantum number chmm as part of their proposed extension of the
Eightfold Way fkom SU(3) to SU(^).'" Although no particles containhg charm were
known in 1970, the idea of a charmed quark was appealing to Glashow, Iliopoulos, and
Maiani not only because it fked the strangeness-changing curent, but also because it burnped
up the number of quarks to four, and at the tirne, four leptons were hown, e,p,v,,v,.
202
This
theory was politeiy ignored, however, by the majority of the physics community, as oniy a
small group of theorists were interested in the K-decay an~maly.'~~
There was not much
interest in unified theories at all, in fact, because there was no proof of renormalizability, but
that would soon change.IM
Both Weinberg and Salam hoped that their theory was renormaIizable, but neither o f
them codd prove it. The Feynman d e s for Yang-MilIs theories had been worked out by
Feynman (1963), Faddeev and Popov (1967), and Taylor (L971)among others, who had
s h o w that unbroken gauge theories are renomalizable, but this proof held only if the gauge
pa&cIes were massless. There was no guarantee that renormalizability was not spoiled by
e~pre-~~ons.'"
Gell-Mannsuggested to Feynman that he try his slcls first on Yang-Mills
theory, since technically it was much easier, while it is aIso a non-linear theory, like gravity,
so that its gauge invariance resembles general coordinate invariance. Feynman did just
thaf
... the Yang-Mills theory is enormously easier to cornpute with than the gravity theory,
and therefore 1 continued most of my investigations on the Yang-MiUs theory with the
idea, if 1 ever cure that one, I'Ll nirn around and cure the other.... the Yang-MiIis case
took me about a day; to calculate the diagram in the case of gravitation 1 tried again
and again and was never able to do it..."'
Feynman's method involved the use of ghost-fields, which are fictitious scalar particles which
203
appear in closed loops in the Feynman diagrams, but which do not appear as physicai
particles. In 1967, Faddeev and Popov extended these rents fiom single-Ioop calculations to
the case of arbitrary diagrams involving Yang-MillsfeId~.'~*
In 1966, Martinus Veltman had presented a paper on the divergences of vector and
axial-vector hadronic current~.'~~
The paper showed that these divergences wuid be
expressed as products of the currents with vector fields representing the photon and the IVBs
of the weak interaction. At a conference tbat year, Veltman ran into Feynman, who explained
to
him that his equations were of the Yang-Mills type.IIO Veltman decided to investigate
case of the f i t e mass theory, and there is a discrete ciiffierence behveen the theory
with zero-mas and a theory with h i t e mas, no matter how s m d as compared to aLi
extemai momenta The reason is that a finite mass spin4 particle has three dinerent
states of poiarktion while a zero-mass particle has only two such states (with
helicities kl).'l2
Veltman discovered that the extra degree of freedom associated with the spin of a massive
vector particle was responsible for niining the renormalizability. He soon began to e n t e 6
the idea that scalar fields could be introduced into the Yang-Mills Lagrangian in such a way
as to cancel the two-loop divergences, and in 1971, the problem was finally solved by one of
204
In the summer of 1970, 't Hooft had attended a m e r school in Corsica where he
leamed about the o-model."'
Goldstone bosons. At m e of the seminars, 't Hoofi leamed that tbis theory couid be
renormakized without spoiling SSB. He asked the presenters, Lee and Symannk, if the resuIts
could be applied to Yang-Mills fields, but they both told him the same thing, to ask his
tacher, Veltman. For his Ph.D. under Veltman, 't Hooft decided to work on the
renormalization of rnassless gauge theory. Adopting many of Veltmads techniques, 't Hoofi
was able to devise a regulator method which allowed hn to prove the renormalizabilitj. of
Hooft realized that adding masses by hand would not preserve the delicate cancellations
necessary in the proof, but giving masses to gauge bosons by spontaneous symmetry breaking
mi&!
Five rnonths later, 't Hoofi presented his second paper, in which he proved the
canonical quantization. This was soon supplie& however, in a series of papers by Benjamin
Lee and Zinn-~ustin.'" Backed by Lee's authority, dong with the more conventional
formaism, theoreticai physicists soon fell in line with the view that gauge theones in which
broken to U(1), m
g mass to
all the vector bosons except the photon, and the theory is
renormalizable. Even so, it was not yet clear that the Weinberg-Salam model was adequate
describe Nature. That was a matter to be detenmined by experiment.
Although the search for neutral cunents had been a low priority at both CERN and
F e d a b , the renormalizability proof of the Weinberg-Salam theory raised interest
considerably. Experimenters placed new emphasis upon neutrino experiments, and they soon
got results. Neutrai currents of the correct form and magnitude as predicted by the W-S
model were found at the Gargamelle bubble chamber at CERN in 1973, and in a counter
experiment at Fennilab in 1974.'19 These were difflcdt experiments to perform, however,
and so they faced dispute initialIy.'20 Neutrino experiments face amajor problem with
neutron background, since both neutrinos and neutrons are electrically neutral, and so neither
leave any tracks. This makes it hard to distinguish between actual neutral current events, and
background events involving secondary neutrons generated by the neutrinos outside the active
chamber. Very impressive evidence came nom a different source in 1978, however, which
also allowed the mDang angle to be calcuiated. The results conkned the Weinberg-Salam
theory.
'"
The success of gauge theory also extended to the strong interactions, although in a
different fom. In 1973, Hugh Politzer, and independently, Gross and WiIczek discovered a
remarkable property of the Yang-Miils fields which came to be known as asymptotic fieedom:
as the energy of a process goes to inf?nity, the effective coupling constant goes to zero?
This was exciting, as it could explain scaling, which had been confirmed by deep inelastic
~cattering."~At the same time, however, it precluded the use o f the Higgs mechanisrn,
because the presence of any strongly interacing scdar particles in the theory would destroy
with distance can be relied upon to explain why quarks as weli as the massless gluons are not
seen in the laboratory-12' This property came to be known as confinement The theory of
quantum chromodynamics (QCD) was buiIt upon the foundation of an exact SU(3) group of
quark colour, and once again, the constraints of gauge invariance and renormalizability guided
the discovery of a Nitable Lagrangian, one which ensured the conservation of strangeness,
Ml three fundamental interactions were now represented by local gauge groups, but
already in late 1973, theorists were already looking for M e r unification. Pati and Salam,
and soon d e r , Georgi and Glashow, each attempted to nd a single unifying gauge group
which could represent all of the fiindamental interactions using an extended group
s m i c t ~ u e . ' Each
~ ~ of these two models assigned quarks and leptons to a single group, which
after spontaneous symmetry breaking, broke down to a product of the SU(2)xU(I) of the
electroweak interaction, plus the SU(3) of the strong interaction-"* Georgi and Glashow
explained that this has the advantage of predictng a single coupling constant7
... we see how attractive it is for strong, we& and electromagnetic interactions to
spring fiom a gauge theory based on the group F=SU(3)xSU(2)xU(l). Alas, this
theory is defective in one important respect: It does not trdy un* weak and
electromagnetic interactions. The SU(Z)xU(l) gauge coupling describe two
interactions with two independent coupling constants; a true unif5cation would involve
o d y one.'"
Although Pati and SaIam favoured SU(4)xSU(4), and Georgi and Glashow favoured SU(5),
there are many candidates for a unified gauge group. Any group is eligible as long as it
fulfils several mathematical requirements, the most obvious requirements k i n g that it has to
contain the gauge symmetry groups SU(3)xSU(2)xU(1), and it has to allow for the correct
reproduction of the particle content of the observed fermion ~pectnim."~
The interestkg part of ail this, of course, is not the agreement with old results, but the
new physics that a single uniQing group would predict! What new, perhaps hitherto
forbidden transitions are possible under the new symmetry group? What new kinds of
particles shouid experirnentalists be on the lookout for? Most grand unified theories share a
spectacular feature: they predict proton decay. The fist to predict this were Pati and Salam,
'"
Proton decay soon became a hallmark of unified theones- Aithough technically, it is possible
to construct unified theones in which baryons are absolutely stable, it is doubtEl that such a
theory would exist- If it did, it would look awkward because it would require a seemhgly
artincial arrangement of both local and global symrnetries, such that in the symmetry limit,
the baryon number for known matter would result In Pati and Salam's theory, for instance,
instead of baryon number and Iepton number being individually conserved,
(B+L) is conserved.'"
O&
their sum
The SU(5) group of Georgi and Glashow is a particdarly elegant example. This
group includes the regular leptons and quarks, and in addition, it predicts new, superheavy
coIoured vector bosons which can mediate lepton-quark transitions, making decays such as
p+?rO+e+ not only possible, but possibly accessible.
Weinberg used the group SU(5) to predict a proton lifetime of 6x10" years."'
A few
months earlier, Reines and Crouch had published a lower limit of 2x10~' years using a
scintillation detector developed for neutrino se arche^.'^^ This made theorists optimistic that
proton decay might be observed at this order of magnitude.
Experimentalists soon began sexching for proton decay deep underground, in mines
originally outftted for neutrino searches. By the mid-1980s, there was a whole series of these
SU(5) was conclusively d e d out by experimentL3' Although theorists mahtained hope for
proton decay, and they extended their efforts to other symmetries like S0(10), experimental
interest in the topic eventually waned. Many of the proton decay experiments were
themselves pirated, and wound up again as neutrino detectors. Whether or not evidence might
be found for proton decay in the future, the lifetime of the proton is certady Iong enough
1961,
and six years later, Salam and Weinberg had shown how to generate the masses of the gauge
particles through spontaneous symmetry breaking- As we have seen, the electroweak theory
then laid dormant for a few years, untiI 't Hooft proved its renormali~ability.~~~
In the
theory has provided a fiamework for future endeavour- Since its inception, the theory has
been able to incorporate al1 new resuits, and discoveries, including evidence of the c quark in
successfi that it becarne known as the Standard Model of electroweak interactions. Today,
together with quantum chromodynamics, it is referred to simply the Standard Model, and
although this theory is far from the final word on the subject, it rem&
reachhg theory based on just a single general principle, the principle of local gauge
invariance.
8. Concluding remarks
Although Yang and Mills had developed their non-Abelian gauge theory in an effort to
describe the strong interaction, it was so appealing that many theorists tried to apply it to the
weak interaction as weli. Aithough it was tempting to i d e n a
with intermediate vector bosons in the V-A theory, the masses of the particles spoiled the
gauge invariance of the Lagrangian. Schwinger attempted to place charged, weak IVBs in an
isotopic ple et dong with the photon Bludman, d e a d , ignored the electromagnetic photon,
and he attempted to place a neutral IVB in the triplet Glashow then combined the IVB
isotopic triplet and the photon into a unified SU(2)xU(l) theory, but like his predecessors he
had no a m e r to the zero-mass problem. Sakurai ignored this problem altogether, and he
identined the gauge vector bosons as the mediators of the strong interaction, and although he
predicted the properties of three physical vector mesons which were soon discovered, they
tunied out not to be gauge bosons at dl.
As more and more particles were discovered, theorists looked for a global symmetry
which couid organze them. The SU(2) isospin group is also a global symmetry, and it
NOTE TO USERS
212
UMI
the baryon octet to the decuplet, it had a problem, in that the baryon super-super-multiplet
needed to give the Yang-Mills gauge particles m a s to ensure the short range of the weak
interaction, while preseming gauge invariance. The electroweak unincation SU(2)xU(1) of
Glashow breathed new life in the hands of SaIam and Weinberg, who generated the mass of
the WZ and 2' bosons using the Higgs mechanism, while keeping the photon massless. A few
years later, 't Hooft showed that the theory was renormalizable. At long last, Yang-Mills
gauge theories codd have massive bosons while preserving gauge invariance! h e strong
interaction also proved to be amenable to gauge theory, but it did not need spontaneous
symmetry breaking. It was treated instead with an unbroken symmetry, so its eight gauge
particles are massless.
The appropriate group for the strong interactions turned out to be not
the broken symmetry of SU@), but the exact symmetry group of SU(3) colour. Ail three of
the fundamental particles are now associated with a gauge group, and efforts continue to find
higher symmetries. Consequently, the language of group theory has become unavoidable
within theoretical particle physics, at Ieast for the present.
-_
interactions. In this respect, symmetry is the glue that holds the Standard Mode1 together.
Of course, there is much work yet to be done- The Standard Mode1 is far from
complete, as it Ieaves many questions unanswered. Why, for instance, do quarks and leptons
corne in three different generations, ;nth different masses? Why do the electromagnetic and
weak interactions have separate coupling constants, instead of a single constant? How are
these forces related to the strong interaction? Why does gravity appear to be so diffierent?
Why are there more than twenq parameters in the Standard Mode1 that need to be
experimenlally determined? Why isn't there just a single constant? Although symmetry may
not prove to be the key to answering these questions, it W, at least, continue to provide a
context in which they may be asked.
1.
W a r d (1959)-
or equivalently,
that is, S is unitary- See Pais (19861, p. 498.
3Schwinger (1957), p. 424 T h e symmetry that exits between
the heavy bosons and fermions in their isotopic space properties
prompts us to ask: 1s there also a family of bosons that realizes
the T=l representation of the three-dimensional rotation group?
4.
6.
Glashow (1961) p. 583. The S stands for strangeness, since
this neutral boson is responsible for a strangeness-changing
neutral current. Glashow did not refer to Bludman for this idea,
but he did refer to a private communication with Gell-Mann.
7.
8.
( M A )4 .
9.
Sakurai (19601, p - 9.
10.
Hypercharge is equal to S for mesons, and S+i for baryonsSpecif ically, Y=S+B. [Geli-Marin and Nereman (1964) pp. 7-8 - ]
11. Gauge symmetry of baryon number was first considered by Lee
and Yang (1955).
12-
Sakurai (1960), p, 9 -
13.
14 -
The
Chapter 6 15.
6
and w
438-
17-
18.
19-
Gell-Mann 6
, p
36 -
23.
Gell-Mann was so pleaed with his theory, that he gave it a
name which alludes to the Eightfold Path to Nirvana in Buddhism.
This consists of right belief, right resolve, right speech, right
conduct , right living, right effort, right contemplation, and right
ecstasy. See Capra (19831, Chapter 6 24.
Gell-Mann (1961),p. 1 5 -
25.
26.
27
Gell-Mann (1961), p - 16 -
28.
These pseudovector mesons were referred to as V, X, 2, and
by Nereman.
34.
For a couple of years, there had been a problem, but it was
resolved earlier, in 1963, Early experiments had indicated that
the A and Z baryons had opposite parity, making the SU(3)
assignment of the two particles to the same state impossible, but
Courant et al (1963) showed, however, that the h and C did have
the s a m e parity35.
Gell-Mann (19611, p - 1 -
37,
Gell-Mann (1964b), p - 64.
T h e same idea is presented,
although not as clearly explained, in Gell-Mann (1962)-
38.
Gell-Mann (l964b), p. 6 4 .
results r o m
40.
41. As Pickering and Pais both point out, in 1962-63, there were
a few events recorded which showed AS=-AQ,
but after 1964, these
experiments were regarded to be in e r r o r . See Pickering (1984), p.
123, and Pais (1986), p. 5 6 4 .
42.
44.
45-
Gell-Mann ( 1 9 6 4 a 1 , p . 169.
46.
Sakata (1956).
47.
Sakata ( E E 6 ) , p - 6 8 7 .
48-
49.
50.
They actually used
U(l)xSU(3).
51 -
U(3).
.
but
this group
factors i n t o
52-
(1960),
Zweig ( 1 9 6 4 ) -
53.
Pickering ( 1 9 8 4 1 , Chapter 4 gives a lengthy cornparison of the
differing approaches of Gell-Mann and Zweig.
55.
Gell-Mann (1964b)-
56.
Gell-Mann (1964b1, p. 7 5 .
57.
58. Sakita (1964) explored the m a s s relations between SU ( 3 ) supermultiplets in d e t a i l . He was also a b l e to calculate the magnetic
moments of the baryons.
59 -
Pais
(1986), p - 560.
61.
62-
Greenbexg ( 1 9 6 4 1 , p - 600.
65.
68-
69.
70. See Moriyasu (1983) pp. 570-74, and Brown and Cao (1991) for
a more complete description of the gauge superconductor.
71.
72.
73.
74.
75.
76.
Besides Higgs (1964a, 1964br 19661, other papers on the
subject include Brout and Englert (1964), Englert , Brout and Thiry
(1966), and Kibble (1967). Incidentally, Salam gives credit to
Kibble for tutoring him on the H i g g s mechanism at Imperia1 College.
77.
78-
Kibble (1967).
79 -
In l a t e r years,
l e p t o n as well.
82.
T
85-
86.
87. Through this assumption, g,/g, gets into the picture through
the Goldberger-Treiman relation, which gives this ratio i n terms of
t h e pion decay constant and t h e pion nucleon coupling.
This
relation shows a cannection b e t w e e n the w e a k interactions and the
strong interactions, but this was not really understood f o r alrnost
a decade.
W e i n b e r g ( 1 9 8 0 1 , p. 517.
88.
91.
92.
as
94.
(#+,#O)
95.
Weinberg (19321, p. 1414,
the Weinberg angle.
100 -
1.07.
Feynman (1963).
108.
this on
has the
of the
totally
Veltman (1970)-
113
(1971a)-
t'Hooft (1971b), ' t gooft got his Ph-D- the following year,
in 1972-[Pickering ( 1 9 8 4 ) , p. 2011
115-
116.
The use of path integrals in gauge theory only became
standard in the 1970s.[Pickering (19841, p . 2011
117.
118 -
1x9
122 .
If the neutral currents had demonstrated different
propert ies , other spontaneously broken gauge theories would have
been possible, and even if neutral currents had not been
discovered, G e o r g i and Glashow presented an electroweak theory in
Politzer was
125 -
126
127.
128. These conservation laws are ensured apart from some so-called
instantons, which are related to the Bell-Jackiw anomaly, as
explained by 't Hooft (1976).
For a thorough history of the
development of QCD, see Pickering ( 1 9 8 4 1 , Chapter 7.
129-
130. In Pati and SaLam's model, the strong interaction group they
choose is actually S U ( 3 ) x S U ( 3 ) .
131 -
132 Another requirement is that the group must be of rank 4 Together, these requirements limit possible candidates to S U b ) for
1225, SO (4ni.2 for nr2, and E6 .
133 -
3 6
Reines and Crouch (1974), Twenty years earlier, Reines,
Cowan, and Goldhaber (1954) had performed the first explicit
experimental tests of the tability of the proton- At that time,
they did not doubt its stability, but they ha& read Wigner (19491,
which discusses a conservation law of heavy particles, and so and
they were interested in invetigating the extent to which the
stability of nucleons could be experirnentally demonstrated. They
determined that the proton lifetime was at least greater than 102'
years, regardless of the decay mechanism, but they reasoned that
most of theiw count rate could be attributed to cosmic rays, and so
they realized that they could get better results by going deeper
underground, and that the proton lifetime should be even longer.
138-
140.
Bell and Jackiw (1969), and then Stephen Adler (1969),
discovered a triangle anomaly related to the axial vector
interaction. Bouchiat, Iliopoulos, and Meyer (1972) then pointed
out that this A B J anomaly destroys the renormalizability of certain
SU (2 x U (1) electroweak theories, specifically, it ruins the mode1
with three quarks. With the addition of a fourth quark, however,
renormalizability is restored (as long as t h e r z re three
flavours) . Since the GIM rnechanisrn dernands a fourtn quark, its
reptation was enhanced even before the discovery of charm.
141.
142. The W and Z bosons were discovere. shortly after, in 1983 See Rubbia (1985)-
M. Conclusions
In 19 18, Weyl discovered gauge invariance within the context of general relativity.
Weyl was trying to mite gravitation and electromagnetim at a time when the o d y known
particles were the proton, neutron, and photon. His work was mathematicdy appeafing, and
his discovery of scaie invariance ailowed him to derive the conservation of charge.
Unfominately, his theory codicted with experiment, but nevertheless, it was inspiring to
other theorists through its association of geornetrical structure with electromagnetism.
Weyl mon tumed his attention tu the emerging quantum rnechanics Both he and
Wigner pioneered the application of group theory to the new physics. Although they found
the permutation and rotation groups were very useful, however, their work was not widely
quantum mechanics grew into quantum field theory, other symmetnes emerged: parity,
charge-conjugation, and time-reversal. Of these, party was the most important, as it was
that it be applied innead to the wave-hction. Weyl took up this idea, and he elevated gauge
invariance to a principle. Gauge symmetry was soon recognked as a symmetry of quantum
electrodynamics, as applying gauge invariance to the Lagranpian Leads to the introduction of
the electromagnetic field. ALthough gauge theory was mathematidy attractive, however, it
e electrornagnetimi,
short range.
charge independence. A few years later, Fermi offered a new model of fl-decay which
mproposed
,
a
which he hoped couid explain both the work of Heisenberg and Fermi.
Meson physics became quite popular, and using SU(2), the neutrd meson was
proposed by Kemmer on the grounds of symmetry! Meson physics soon ran into trouble,
however, as there was a conflict between theory and experiment It was not until 1947 that it
was realized that there are two mesons in cosmic rays, the p and the K.
soon discovered in cosmic rays as weL They were unusual in that they were produced in
abundance, yet they decayed slowly. This was explained by Pais, who proposed a new
selection nile which accounted for strong production, while ailowing for weak decay, The
strong and weak nuclear forces had been distinguished at I a s t
interactions. Study of the strange K-particles soon revealed that weak interactions vioIated
more than strangeness, they violated party! rUthough parity conservation had been tnisted
for y=,
it came crashing down in 1957, and party violation rapidly became the hallmark of
Gauge invariance was revived by Yang and Milis in 1954. They were very impressed
by the link between charge conservation and the introduction of the eiectromagnetic field, and
so they decided to extend the idea to isotopic spin conservation. They boldy imposed
isotopic gauge invariance upon the strong interaction Lagrangian, forcing the introduction of
three new vector fieids.
symmetry! Yang and Miils realized that these particles needed to be massive to explain the
shoa range of the nuclear interaction, but they recognized, at the same time, that addhg masterms to the Lagrangian would ruin gauge invariance. In spite of this zero-mass problem,
Sakurai forged ahead in 1960,and predicted that there were not only three vector particles
coupling to the isotopic spin current, but there should also be two more, one coupling to the
hypercharge current, and the other to the baryon number current.
Gell-Mann and Ne'eman built upon this work, increasing the number of vector
particles to eight. These eight gauge bosons were associated with unitary spin, a new quantity
which incorporated conservation of isotopic spin and hypercharge. Their scheme, the
Eightfold Way, not only related the eight vector mesons to each other, but it dso d o w e d the
together dinerent isotopic spin multiplets in which dl the particles have idenacal parity and
spin, and in addition, they allowed Gell-Mann and Ne'eman to predict new particles, which
Both Gell-Mann and Zweig suggested that the fiindamental triplet of SU(3) could be
identified with a trio of particles, known as quarks. Gell-Mann did not view them as physicai
particles at frst, but they rapidiy became indispensable within partcle physics. U h g quarks,
ali
Furthemore, although previously, it had not been clear how to make weak currents fiom
hadrons, weak currents codd easily be constructed fiom quarks.
Efforts were being made to apply Yang-Mills theory to weak interactions as well, and
indeed, it was tempting to identify weak IVBs as gauge particles. Schwinger attempted to
place the two charged weak M3s and the electromagnetic photon in an isotopic triplet, and
following him, Bludman also used an isotopic triplet, although he repIaced the photon in the
representation with a neutral weak IVB. In 1961, Glashow decided to include ali four of
these mediating particles
other efforts, it faced the zero-mass difnculty. This long-standing problem was finally solved
using spontaneous symmetry breaking. In 1967-68, Weinberg and Salam showed how to
apply the ingenious Higgs mechanism to generate vector meson masses while preserving
gauge invariance. At last, there was a gauge theory of the eiectroweak interactions, but
neither Weinberg and Salam could prove that it was renomalizable. This was nnally
achieved in 1971 by a Dutch graduae student, Gerard 't Hooft. Following this achievement,
in 1973, it was shown that the strong interactions could also be treated using the local gauge
group SU(3) of colour, which predicts the existence ofeight massless gluons.
symmetry allowed theonsts to knpose symrnetries upon the Lagrangian, thus generating
dynamics, and predicting the existence of physicai particles. As aiways, these theones are
subject to experimental verification, but the combination of SU(3)xSU(2)xU(I) bas passed all
the tests.
To this day, the Standard Mode1 serves as a fiamework for future practice, and at
the same time, it serves as a constant reminder of the power of group theoq, the formal
mathematics of symmetry.
References
M. Alston et al- (1960). "Resonance in the AT system", Physical Review Letters, 5, 520-24D. Amati, H. Bacry, J. Nuyts, and J: Prentki (1964). SU(4) and strong interactions", Physr'cs
Letters, 11, 190-92.
H.L. Anderson, E. Fenni, E. A. Long, and D. E. Nagle (1952). "Total cross sections of
positive pions in hydrogen", Physical Review, 85, 936.
P. W. Anderson (1963). "Plamions, gauge invariance, and mass", Physical Review, 130,
43 9-42.
T. Appelquist, A. Chodos, and P. Freund (1987). Modern ffiluza-Klein Theories, New York:
Addison Wesley
J- Ashkin, J. P.
204-6.
C. Becchi, A. Rouef and R Stora (1976). "Renorrnalzationof gauge theoresm,A n m k of
Physics, 98, 287-321.
R. E. Behrends, J. Dreitlein, C. Fronsdal, and W. Lee (1962). "Simple groups and strong
interaction symmetres", Reviews of Modern Physics, 34, 1-40.
R. E. Behrends and A. Sirlin (1961). "Weak-couplingcurrents and symmetries of strong
interactions", PhysicaL Review, 121, 324-3 6.
J. S. Bell (1967). "Cunent algebra and gauge variance",Nuovo cimer& A, 50, 129-34.
J. S. BeIl and R. Jackiw (1969). "A PCAC puzzle: nu-. yv in the cr-model", Nuovo Ciniento
A, 60, 47-61.
A. Benvenuti et al. ( 1 974). "Observation of muonless neutrino-induced inelastic interactions",
PhysicaZ Review Letters, 32, 800-3.
H . A. Bethe and R F. Bacher (1936). 'NucIear physics. Stationary States of nuclei", Reviews
of Modem Physics, 8, 82-229.
H . A. Bethe and E. Fermi (1932). "ifber die Wechselwirkung von zwei Elekaonen",
Zeitsckfl fiir Phys& 77, 296-306.
H. J. Bhabha (1 938). "Nuclcar forces, heavy electrons, and the @-decay",Nature, 141,
117-18.
J. D.Bjorken (1 966). "Applicationsof the chiral U(6) x U(6) algebra of current densities",
PhysicaZ Review, 148, 1467-78.
J. D.Bjorken (1969). "Asymptotic sum d e s at innnite momenturn", Physicul Review, 179,
1547-53.
J. D. Bjorken and S. L. Glashow (1 964). "Elementary parcles and SU(4)lt, Physics Leners,
11, 255-57.
J. D. Bjorken and E. A. Paschos (1969). "Inelastic electron-proton and y-proton scattering
and the structure of the nudeon", Physical Revew, 185, 1975-82.
E. D.Bloom er aL (1 969). "Hi& energy inelastic e-p scattering at 6' and 10", P ~ s i c u l
Review Letrers, 23, 930-34.
D. Bloor ( 1 976). Knowledge and Social Imagery, Chicago: University of Chicago Press.
S. A. Bludman (1958). "On the univerd Fermi interaction", Nuovo Cimento, 9, 433-44.
N. Bohr (1 932). "Chemistry and the quantum theory of atomic constitution", Journal of the
Chernical Sociery, 135, 349-54N. Bohr (1936)- "Conse~ation
laws in quatlttm theory", Nature, 138, 25-26.
L. M. Brown (198 1). "Yukawa's prediction of the meson", Cenruum, 25, 71-132L. M. Brown and T. Y.Cao (1991). "Spontaneous breakdown of symmetry: its rediscovery
and integraiion into quantum field theory", Historicd Studies in the Physical and Biological
Sciences, 21, 211-3 5-
D,8, 4383-94.
T. Y. Cao (1997). Conceptual Developments of 20th Ccntuv Field nieories, Cambridge:
Cambridge University Press.
T. Y. Cao and S. S. Schweber (1993). "The concephial foundations and the philosophical
aspects of renormalization theory", Synthse, 97,33-108.
F- Capra (1983). Tao of Physics, Second edition, Boston: New Science Library,
M. Carmeli (1977).
18, 17-20.
B. Cassen and E. U. Condon (1 936). "Onnuclear forces", PhysicuI Review, 50, 846-49.
J. Chadwick (1932a). "Possible existence of a neutron", Nature, 129, 312.
J- Chadwick (1932b). "The existence of a neutron", Proceedings of the Royal Society A, 136,
692-708.
J. Chadwick and M- Goldhaber (1934). "A 'nuclear photo-effect': Disintegration of the
diplon by y-rays", N ~ t w e ,134, 237-3 8-
K Chandrasekharan, ed. (19 8 0). Hermann Weyl, Centenary Lectuqes, Berlin: Springer.
S. Coleman (1 965). "Trouble with relativistic SU@)", PhysicaZ ~eviewB, 138, 1262-67.
S. Coleman (1979). "The 1979 Nobel prize in physics", Science, 206, 1290-92.
S. Coleman and
31, 85 1-54.
S. Coleman and J. Mandula (1967). "All possible symmetries of the S-matrix", Physical
Review B, 159, 1251-56.
A. Compton (1923). "A quantum theory of the scattering of X-rays by Iight elements",
Physical Review, 21,483-502.
A. Compton and A. W. Simon (1925). "Drected quanta of scattered X-rays", P?zysicul
Rmiew, 26, 289-99.
R E. Cutkosky (1963). "A mechanism for the induction of symmetres among the strong
interactions", Physical Review, 131, 1888-90.
R E. Cutkosky and P. Tarjanne (1963). "Self-consistent deviations fkom unitary symmetry",
PhysicuZ Review, 132, 1354-61.
R H. Dalitz (1953). "On the analysis of r-meson data charge and the nature of the mneson",
Philosophical Magazine, 44, 1068-80.
R H. Da1it.z (1954). "Decay of 7-mesons of known charge", Physical Review, 94, 1046-51-
M. Daniel and C. M. Vialiet (1980). "The geometrical setting of gauge theories of the -YangMills type", Reviews of Modem Physics, 52, 175-97.
C. G. Darwin (1927). "The electron as a vector wave", Nature, 119, 282-84.
L-de Broglie (1 923a). "Ondes et quanta", Comptes Rendus de Z Acadmie des Sciences.
Parr's, 177, 507-10,
W. Deppert, K. Hbner, A. Obserschelp, and V. Weidemann, eds. (1988). Eiact Sciences und
rheir Philosophical Foundatiom, (VorWge des Internationalen Hermann-Weyl-Kongresses,
Kiel 1985)- Frankfkt: Peter Lang,
A. De Rijula, K Georgi, and S. L. Glashow (1975). "Hadron masses in a gauge theory",
Physical Review D,12, 147-62-
1, 9-18.
J- J. de Swart (1962). "The octet mode1 and its Ciebsch-Gordan coefficients", Reviews of
Modern Physics, 34, 9 16-3 9-
B. S. DeWin (1965). Dynmical Theory of Groups and Fiel&, New York: Gordon and
Breach.
B. S. DeWitt (1967b).
"Quantumtheory of gravity
P. A. M. Dirac (1926). "On the theory of quantum mechanics", Proceedings of the Royal
Society A, 112, 66 1-77.
P. A. M.Dirac (1927a). "The quantum theory of the emission and absorption o f radiation",
Proceedings of the Royal Sociery A, 114, 243-265. Reprinted in J. Schwinger, ed. (1958).
Ouantum Electrodynamics, 1-23.
P. A- M. Dirac (1928a). "The quantum theory of the electron", Proceedings of the Royal
Society A, 117, 610-24.
P. A. M. Dirac (1928b). "The quantum theory of the electron. Part II", Proceedings of the
Royal Society A, 118, 351-61.
P. A. M. Dirac (1 930). "A theory of electrons and protons", Proceedings of the Royal
Society A' 126, 360-65.
F. J. Dyson (1966). S'etry Groups in NucZear and Pmticle Physics, New York:
Benjamin
A. S. Eddington (192 1). "A generalization of Weyl's theory of the electromagnetic and
gravitational field", Proceedings of the Royal Society A, 99, 104-22.
P. Ehrenfest and J. Oppenheimer (1931). "Note on the statistics of nuclei", Physical Review,
37, 333-38.
A. Einstein (1905). "Zur Elektrodynamik bewegter KBrper", Annalen der Physik; 17, 89 1921. Translated by W. Pemett and G.B. Jeffery (1923, 1952) in Principle of Relativity, 37-65.
Sirnmgsberichre der
J. P. EIliott and P.
University Press.
C. D. Ellis and N. F. Mott (1934). "Energy relations in the 8-ray type of radioactive
disintegration",Proceedings of the Royal Society A, 141, 502-1 1.
F. Englert and R Brout (1964). "Broken symmetIy and the mass of gauge vector mesons",
Physical Review Letters, 13, 32 1-23.
L. D. Faddeev and V. N. Popov (1967). "Feynman diagrams for the Yang-Mills fieldtt,
Physics Letters B, 25, 29-30.
G. Feinberg and F. Grsey (1959). "Space-tirne properties and internai symrnetries of strong
interactions", Physical Review, 114, 1 153-70.
E. Fermi (1926). "Zur Quantelmg des idealen einatomigen Gasesl',Zeitschnifr Physik, 36,
902-12.
E.Fermi (1932). "Quantum theory of radiation", Reviews of Modern Physics, 4, 87-1 32.
E. Fenni (1934a). "Tentative di una teoria dei raggi p", Nuovo Cimento, 2, 1-19 . Translated
as "Attempt at a theory of 8-rays" in P. K. Kabir (1963). Ine Development of Wenk
Interaction Theov, 1-2 1.
E. Fermi (1934b). "Versuch einer Theorie der &Stra.len. In, ZitschriijijZr Physik, 88,
161-71.
E. Fexm and C. N. Yang (1949). "Are mesons elementary partides?", PhysicaI Review, 76,
1 739-43.
..
M. Fierz (1 939). "ber die relativistische Theorie kraftefieier Teilchen mit beliebigem Spin",
Helvetica P-ysica Acta, 12, 3-37.
D. Fine and A. Fine (1997). "Gauge theory, anomalies and global geometry: The interplay
of physics and mathematics", Studies in Hktory and Philosophy of Modern Physics, 28,
3 07-23.
G. N. Flerov, D. S. Klochkov-V-S. Skobkin, V.VI Terent'ev (1958). "Spontaneous fission
of Thx' and the stability of nucleons",Soviet Plysics Dokiady, 3, 79-80.
V. Fock (1926b). "ber die invariante Form der WelIen- und der Bewegungsgleichungen fr
einen geladenen Massenpuddl, Z e i t s c Physik
~ ~ 39, 226-32.
V. Fock and D. Iwanenko (1929a). "Gometrie quantique hare et dplacement parallle",
Compte Rendus de I'Acndmie des Sciences, 188, 1470-72.
V. Fock and D. Iwanenko (1929b). "ber eine mogliche geometrische Deutung der
relativistischen Quantentheorie", Zeitschrzji fiu Physis 54, 798-802.
V- Fock and D-Iwanenko (1929~)."Quantum geometry", Nature, 123, 838.
R. H. Fowler (1926). "General forms of statinical mechanics with special reference to the
requirements of the new quantum mechanics", Proceedings of the Royal Sociew A, 113, 43 2-49.
A. Franklin (1983). "The discovery and acceptance of CP-violation", HistoricuI Studies in the
Physicul Sciences, 13, 207-3 8.
I. L. Friedman and V. L. Telegdi (1957). Wuclear emulsion evidence for party nonconservation in the decay chah ~'+~*+e+", PhysrkaI Review, 105, 1681-82.
H. Fritzsch, M. Gell-Mann, and K LeutwyIer (1973). "Advantages of the colour octet gluon
picture", Physics L e m s B, 47, 365-68.
W. H.Furry (1937). "A symmetry theorem in the positron theory", PhysicuI Review, 51,
125-29.
W. H. Furry and J. R Oppenheimer (1934). "On the theory of the electron and positve",
Physical Review, 45, 245-262.
M. K. Gaillard and B. W- Lee (1974). "Rare decay modes of the K-mesons in gauge
theories", Pkysical Review D, 10, 897-9 16.
P. Galison (1983). "How the frst neutral-curent experiments ended", Reviews of Modern
Physics, 55, 477-509.
M. Gell-Mann (1953). ffIsotopicspin and new unstable particles", Physicatl Review, 92, 83334.
M. Gell-Mann (1960). "Conserved and partially conserved currents in the theory of weak
interactions", Proceedings of the Intemutional Conference on High Energy Physics
(Rochester, I96O), 508-13. Reprinted in M. Gell-Mann and Y.Ne'eman (1964). The
Eightji~ldW q ,21 0-15.
M. Gell-Mann (1962). "Symmetriesof baryons and mesonst', Ph.ysicuZ Review, 125, 1067-84.
M. Gell-Mann (1964a). "A schematic model of baryons and mesons", Physics Leners, 8,
214-15.
M. Gell-Mann(1964b). "The symmetry group of vector and axi-al vector currents", Physics,
1, 63-75.
10, 107-17.
S-L. Glashow (1980). "Towarda unified theory: threads in a tapestry", Reviews of Modem
R. J. Glauber (1953). "On the gauge invariance of the neutral vector meson theox-y", Progress
of Theoretical Physics, 9, 295-98.
M. L. Goldberger and S. B. Treiman (1958). "Decay of the r-meson", Physical ~evie;, 110,
1178-84.
M. Goldhaber, P. Langacker, and R Slansb (1980). "1s the proton stable?", Science, 210,
851-60.
J. Goldstone (196 1). "Field theones with niperconductor>> solutions", Nuovo Cimento, 19,
154-64.
D.J.
Griffiths (198T).
D. J. Gross and R.
Phpical Review D,6, 477-93D. J. Gross and F. ~
(1 973a)k "Ultravioletbehaviour of non-Abelian gauge thwres",
Physical Review Letters, 30, 1343-46.
D. J. Gross and F. Wiczek (1973b). "Asymptotidy frze gauge theories. I", Physical Review
LI, 8, 3633-52.
F. Grsey (1960). "On the symmetries of strong and weak interactions", Nuovo Cimento, 16,
230-40-
F. Grsey and L.A. Radicati (1964). "Spin and unitary spin independence", P?zysicd Review
Letfers, 13, 173-75.
M. Y.Han and Y. Nambu (1965). "Three-tripletmodel with double SU(3) symmetry",
Physical Review B, 139, 1006-10.
Y.Hara (1 964). "Unitary triplets and the Eighaold Way", Physical Review B, 134, 701-4.
F. J. Hasert et al. (1973). "Observationof neutrino-like interactions without muon or electron
in the Gargamelle neutrino experimentt',Physics Letters B, 46, 138-40.
J. Heilbron (1983). "Origins of the exclusion principle", Hisrorical Studies in the P h p i c d
Sciences, 13, 261-310.
W. Heisenberg (1 925). "ber quantentheoretische Umdeutung kinematischer und
mechanischer Beziehungen", Zitsch~fur
Physik, 33, 879-883. Translated as "Quantumtheoretical reinterpretation of kinematic and mechanid reiations", in B. L. van der Waerden
(1 968). Sources of Quantum Mechunics, 261-76,
W. Heisenberg (1 926a). "Mehrkorperproblem und Resonanz in der Quantenmechanik",
W.Heisenberg (1926b). "mer die Spektra von Atomsystemen mit zwei Elekaonen",
Zeitschrzy fur Physik, 39, 499-51 8.
W. Heisenberg (1933). "ber den Bau der Atomkeme", ZeitschriJtfr Physik, 80, 587-96.
Translated as "On the structure of atomic nuclei. III", in.DIM. Brnk (1965). Nuclear
Forces, 155- 160.
W. Heisenberg (1934). "Bemerkungnir Diraschen Theorie des Positrons", Zeitschrz3fiP
Physik, 90, 209-3 1,
W. Heisenberg (1936). "Zur Theorie der Schauer in der H6henstrahlungW,Zeitschn'fr
Physik, 101, 533-40.
W. Heisenberg (1943a). "Die 'Beobachtbaren Grossen' in der Theorie der ElementarteilschenI.", Zeitschrifrfiir Physik 120, 513-38.
W. Heisenberg (1943b). "Die'Beobachtbaren Grossen' in der Theone der
Elementarteilschen. II.", Zeitschrzjf fur Physik, 120, 673-702.
W. Heisenberg (1957). "Quantum theory of fields and elementary particles", Reviews of
Modern Physics, 29, 269-78.
W. Heisenberg and W. Pauli (1930). "Zur Quantendynamik der Wellenfelder. II", Zeitschrzp
fiir Physik, 59, 168-190.
W. Heitler (1944). Quantum Theov of Radiation, Second edition, Oxford: Clarendon Press.
P. W. Higgs (1964a), "Broken symmetries, massless particles, and gauge fields", Physics
Letters, 12, 1.32-33.
P. W.Higgs (1 964b), "Broken symmetries and the masses of gauge bosons", Physicul Reviav
Letters, 13, 508-9.
D. Hilbert (1915). 'Pie Grundlagen der Physik (Erste MitteiIung)", Nachncken von
KonigZichen der Gesellschnfi der WSsenschajien ru Gdningen, Mathematische-Physikolische
M m ,395-407.
D.Hilbert (1 917). "DieGrundlagen der Physik (Zweite Mitteilmg)", Nachrichteen von
Kiniglichen der Gesellschafi der W&senschafin ru Gdttingen, Mathemarische-Pbsikalische
M a ,53-76.
M. Ikeda, S. Ogawa, and Y. Ohnuki (1959). "Apossible symmetry in Sakata's mode1 for
bosons-baryons systems", Progress of Theoretical Physics, 22, 715-24.
D. Iwanenko (1932). "Sur le constitution des noyaux atomiques", Comptes Rendu de
2 'Acadmie des Sciences, Paris, 195,439-44 1.
Hill.
C. Jarlskog (1 98 1). "Quantum flavour dynamicsl',Physica Scripta, 24, 867-72.
P. K. Kabir (1963). The Development of Weak Interaction nieory, New York: Gordon and
Breach.
N.Kemmer (1982).
O. Klein (1926b). "The atornicity o f eiectricity as a quantum theoretid law", Nature, 118,
516.
Klein (1938). "Onthe theory of charged fields", in New Theories in Physics (Warsaw
Conference, 30 May-3 June 1938). Reprinted in 2. Ajduk, S. Pokorski, and A. Trautman,
eds- (1989). New Theories in Physics. Proceedings XT Warsaw Symposium on EZemenrary
O.
M. Kline (1972). Mathematical huglrlfiom Ancienf to Modern Times, New York: Oxford
University Press.
E. J. Konopinski and H.M. Mahmoud (1953). "The universai Fem interaction", PhysicaZ
Review, 92, 1045-49-
E. J. Konopinski and G.E. Uhlenbeck (1935). "On the Fermi theory of @-radioactivity",
Physical Review, 48, 7-12-
H.A. Kramers (1924). "The Iaw of dispersion and Bohr's theory of spectra", Nature, 113,
673-76.
et al.(1 9%).
1901-5.
- O. Laporte (1924). "Die Smikhir des Eisenspekaums", Zeischny ffu Physik, 23, 135-75.
T. D. Lee, M. Rosenbluth, and CN. Yang (1949). "Interaction of mesons with nucleons and
light particles", Physical Review, 75, 905.
T. D. Lee and C. N. Yang (1955). "Conservation of heavy particles and generalized gauge
transformations", Physical Review, 98, 150 2.
T. D. Lee and C. N. Yang (1956b). "Charge conjugation, a new quantum number G, and
selection niles concerning a nucleon-antinucleon system", Nuovo Cimento, 3, 749-53T. D. Lee and C. N. Yang (1957). " P m nonconservation and a two-component theory of
the neutrino", Physical Review, 105, 1671-75.
T. D.Lee and C. N. Yang (1960). "Implicaions of the intemediate boson bais of the weak
interactions: existence of a quartet of intermediate bosons and their dual isotopic spin
transformation properties", Physical Review, 119, 1410-1 9.
T.D. Lee and C. N. Yang (1962). "Theory of charged vector mesons interacting with the
electrornagnetic field", PhysicaZ Review, 128, 885-98,
T. Levi-Civit (1917). "Nozione di parallelisno in uno variet quaniunque e consequente
spezificazione geometrica deiia curvatuta Riemanniana", Rendico>ui del Circolo Matematico di
Pdermo, 42, 173-203.
L- F-Li (1974). "Group theory of the spontaneously broken gauge symmetries", Physical
Review D,9, 1723-39.
D. B. Lichtenberg (1969). "Baryon supermultiplets of SU(6)x0(3) in a quark-diquark model", Physical Review, 178, 2197-200.
S. Lie (1 893). Vorlesungen aber continuierliche Gruppen mir geomehischen und anderen
Anwendungen, bearbeitet und herausgegeben von G.Scheffers, Teubner: Leipzig. Reprinted
(1971), Bronx, New York: Chelsea
C. H. Llewellyn Smith (1973). "High energy behaviour and gauge symmery", Physics
Leners B, 46,233-36-
F. London (1 9276). "Quantenmechanische Deutung der Theorie von Weyl", Zeitschrzjl jiir
Physik,42,375-89.
K. G. Loos (1967). "Intemal holonomy groups of Yang-Miils fields", Journal of
Mathemutical Physics, 8, 2 114-24.
H. A. Lorentz (1904). "Electromagneticphenomena in a system moving with any velocity
less than that of light", Proceedings of the Section of Sciences. KoninWijke Akademie van
Wetenchappen te Amsterdam, 6, 809-30. Reprinted in W. Perrett and G.B. Jeffery
(1923, 1952). Principle of Relativity, 11-34.
J. M . LoSecco (1985). "Status report on the search for proton decay", Comments on Nucieat
and Particle Physics, 15, 23-3 4-
R. McConnmach (1970). "Einsteig Lorentz, and the electron theory", HiStorical Studies in
the Physical Sciences, 2, 41-87.
J. Mehra (1974)- Einstein, Hilbert, and the Theory of Gravitarion, Dordrecht: Reidel-
L. Michel (1950). "Interaction between four half-spin particles and the decay of the pmeson", Proceedings of the Physical Society of London A, 63, 514-32.
G. Mie (19124. "Die Gnindlagen eher Theorie der Matene", Annalen der Physik 37,
511-34,
G. Mie (1 9 12b). "Die Gnindlagen einer Theorie der Matene", A n d e n der Physik, 39, 1-40.
G. Mie (1913). "Die Gnindlagen einer Theorie der Materie", Annalen der Physik, 40, 1-66.
H. Minkowski (1908). "Raum und Zeit", lecture delivered at the Congress of Scientists,
Cologne, 21 September 1908). Published in H. Minkowski (1909), Physikalische Zeirschttji,
10, 104-11. Translated by W. Perrett and G-B.Jeffery (I 923, 1952) in Principle of ReZafivity,
75-91.
Y. Ne'ernan (1961). "Denvation of the strong interactions nom a gauge invariance", N'clear
Physics, 26, 222-29.
K. Nishijima (1954). "Some remarks on the even-odd d e " , Progtess of Theoretical Physics,
12, 107-8.
J. Norton (1984). "HowEinstein found his field equations, 1912-19 15", Historieai Shrdies in
the Physical Sciences, 14, 253-3 16.
A. Pais (1 9%).
A. Pais (1 961). "On symmetries shared by strong and weak interactions",Reviews of Modern
Physics, 33, 493-97.
A. Pais (1966). "Dynamicd symmetry in particle physcs", Reviews of Modern Physics, 38,
215-55.
A. Pais (1 982). SubtIe is the Lord, New York: Oxford University Press.
J. C. Pati and A. Salam (1973a). "Unifedlepton-hadron symmetry and a gauge theory of the
basic interactions" Physical Review D, 8, 1240-5 1.
J. C. Pati and A. Salam (1973b). "1s baryon number conserved?" Physical Review Letters, 31,
661-64,
W. Pauli (1921). "Relativitatstheorie", EncyHopdie der mthematischen Whsenclurfien,Band
V, Leipzig: Teubner. Translated by G. Field (1958, 1981) as Theov of Relativity, New
York: Dover.
W. Pauli (1925). "mer den Zusammenhang des Abschlusses der Elektronengruppen im Atom
mit der Komplexshvktur der Spektren", Zeitschrzjffir Physik, 31, 765-783.
W.Pauli (1940). "The connection between spinand statistics", Physcul Review, 58, 716-22.
W.Pauli (1941). "Relativistic field theories of elementas, particles", Reviews of Modern
Physics, 13, 203-32.
W.Pauli (1955). "Exclusion principle, Lorentz group, and reflection of space-time and
charge", Niels Bohr and the Development of Physics, New York: McGraw-Hal, 30-5 1.
W.Pauli (1957). "Onthe conservation of the lepton charge", Nuevo Cimenlo, 6, 204-14.
W.Pauli (1964). Collected Scientific Papers, Volume II, edited by R Kronig and V. F.
Weisskopf, New York: Interscience.
W. Pauli and F. Villars (1949). "On the invariant regularization in relativistic quantum
theory", Reviews of Modem Physics, 13, 434-44.
W. Pauli and V. F. Weisskopf (1934). "ber die Quantisienmg der skalaren relativistischen
Wellengleichung", Helveticu Physica Acta, 7 , 701-31.
D. H.Perkins (1984). "Proton decay experiments", AnnuaZ Review of NucIeur and Particle
Science,34, 1-52-
D.H. Perkins (1987). fntroduction to High Energy Physics, Medo Park: Addison-Wesley.
M. L. Perl et al. (1975). "Evidence for anomalous lepton production in e'e- aruiihiiation",
Physical Review Letiers, 35, 1489-92,
W. Perrett and G. B. Jeffery, eds. (1923, 1952)- Princple of ReZativity, Dover: New York.
A. Pevsner et al. (1961). "Evidence for a three-pion resonance near 550 MeV",Physical
Review Letters, 7, 421-23.
A. Pickering (1984). Commcting Qzm-ks: A Sociologic~IHistoty of Pmride Pkysics.
V. V. Raman and P. Forman (1969). "Why was it Schrodinger who deveIoped de Broglie's
ideas?, Hiscorieal &dies in the Physical Sciences, 1,291-314.
J. Reiff and M. Veltman (1969). "Massive Yang-MiUs fields", Nuclem Physics B, 13,
545-64.
G. D. Rochester and C. C. Butler (1947). "Evidence for the existence of new unstable
elementary particles", Nahue, 160, 855-57.
S. P. Rosen (1975). "Another method for searching for proton decay", PhysicuI Review
Leiters, 34, 774-76,
J. Rosen (1995). Symmeny in Science: An Introduction to the Generd Theory, New York:
Springer.
256
"On the composite model for the new particles", Progress of Theoreticul
A- Salam (1 980). "Gauge unification of fundamental forces", Reviews o f Modem Physics, 52,
525-3 8.
11, 568-77.
A. Salam and J. C. Ward (1 96la).
E-Schr6dinger (1926~). "Quantisienmg als Eigenwertproblem. III", Annalen der PhysiS 80,
43 7-90.
664-79.
J. Schwinger (1 951b). "The theory of quanzed fields. I", Physical Review, 82, 9 14-27.
J. Schwinger (1953). "The theory of quantzed fields. II", Pkysical Review, 91, 713-28.
J. Schwinger (1 957). "Aiheory of the hdarnental interactions", Annuls of Physics, 2,
407-34.
R Serber (1% 8). "On the dynaton theory of nuclear forces", Physicaf Review, 53, 2 11.
A. J. Senff, R B. Leighton, C. Hsiao, E. W. Cowan, and C. D. Anderson (1950). "Cloudchamber observations of the new unstable cosmic-ray particles", Physical Review, 78, 290-1.
R S. Shankand (1936). "An apparent failure of the photon theory of scattering", Physical
Rwiew, 49, 8-13.
R S. Shankiand (1937). "The Compton effect with y-rays", Physical Review, 52, 414- 18.
R Shaw (1 955). Problem of particle types and other conhibutions Io the theory of
elemenimy particles, Ph.D. dissertation, Cambridge University.
D. Speiser, and J. Tarski (1963). "Possible schemes for global symmetry", Journal of
Mathemutical Physics, 4, 588-6 12.
J. Steinberger, W. K. H.Panofsky, and J. Steller (1950). "Evidence for the production of
neutral mesons'by photons", Physical Review, 78, 802-5.
J. C. Street and E. C. Stevenson (1937). "New evidence for the existence of a particle of
mass intermediate between the proton and electron", PhysicaZ Review, 52, I O 0 3 4
Y- Takakhi (1957)- "On the generalized Ward identiiy", Nuovo Cimenfo, 6, 371-75.
1. Tamm (1934). "Exchange forces between neutrons and protons, and Fermi's theory",
Nature, 133, 981.
P . Tarjanne and V. L. Teplia (1963). "SU(4) assgnments for the vector resonances",
Physical Review Letters, 11, 447-48.
J. C. Taylor (1958). "&decay of the pion", P&sicaZ Review, 110, 1216.
J. C. Taylor (1971). "Ward identities and charge renomaization o f the Yang-Mills field",
Nuelem Physics B, 33, 43 6-44.
G.
't
173-99.
G.'t Hooft (1997). In Search of the Ultimate Building Blockr, Cambridge: Cambridge
University Press.
.
J. Tiomno (1957). "Onthe theory of hyperons and K-rnesons", Nuovo Cimento, 6, 69-83.
J. Tiomno and J. A. Wheeler (1949). "Energy spectrum of electrons fiom meson decay",
Reviews of Modern Physics, 21, 144-52.
S. B. Treiman (1960). "Weak global symmery", Nuovo Cimento, 15, 9 16-24.
B. L. van der Waerden (1968). Sources of Quantum Mechics, New York: Dover.
B. van Fraassen (1989). Lmvs and Symmetry, Mord: Odord University Press.
M. Veltman (1963). "Unitarity and causdity in a renormalizable field theory with unstable
particles", P?rysicrr, 29, 186-207.
M. VeInan (1966). "Divergence conditions and sum niles", Physical Review Leners, 17.
553-56.
V. P. Vizgin (1994). Unified Field Theories in the hefrst third of the 20th cenhoy, translated
by J. B. Barbour, Basel: Birkhher-
S. Weinberg (1964). "Derivation of gauge invariance and the equivalence principle fkoom
Lorentz invariance of the S-rnatrUcW,Physics Letrers, 9, 3 57-59.
S. Weinberg (1967). "A mode1 of leptons", Physicut Review Lerters, 19, 1264-66.
S. Weinberg (1977). "The search for unity: notes for a history of quantum field theory",
Duedalus, 106, 17-35.
S. Weinberg (1980). "Conceptuai foundations of the unined theory of weak and
electromagnetic interactions", Reviews of Modern Physics, 52, 5 15-23.
G e
l (1943). Ein#hmg in der Quanrentheorie der WelZenjielder, translated by C.
Houtermans and J. M. Jauch (1949) as Quantum Theory of Fields, New York: Interscience.
J. Wess (1960). "Investigation of the invariance group in the three fidamentai fields modei",
Nuovo Cimento, 15,52-72.
J. Wess and B. Zumino (1974). "Supergauge transformations in four dimensions", NucZeur
- Phy~ic.~
B, 70, 39-50.
L. Wessels (1977). "Schr6dinger3sroute to wave mechanics", Studies in the History and
Philosophy of Science, 10, 31-40.
H. Weyl (1919). "Eine neue Erweitening der Relativitatstheorie", A n d e n der Physik 59,
101-33.
H. Weyl (192 1a). "ber die phynkalischen Sim der enveiterten Relativitatstheorie'',
Physikalische Zeitsc-i!, 22, 473-80.
H. Weyl (1921b). "Electrcity and gravitation", Nature, 106,800-2.
H. Weyl (1929a). "Elektron und Gravitation. I", Zitschrij?f i Physik 56, 330-352.
Translated in L. O'Raifeartaigh (1997). Downing of Gauge ?ieory, 121-44.
H. Weyl (1929b)- "Gravitation and the electron", Proceedings of the NatiomL Academy of
Sciences, 15, 323-34. Reprhted in H.Weyl, Gesmnmelte Abhandlungert, Band 3, 217-28.
H.Weyl (1929~)- "Gravitationand the electron", Rice I'titute Pamphlet, 15, 323-34.
Reprinted in R Weyl, Gesummelte Abhandlungen, Baud 3, 229-244.
H. Weyl (193 la). Gnrppentheorie und Quanfenmechanik Leipzig: Hirzel. Revised edition
translated by H.P. Robertson ( 1 93 1 , 1950) as nie Theory of Groups and Quanum Mechanics,
New York: Dover,
H-Weyl (1938).
H. Weyl (1944). "David Hilbert and his mathematical work", Bulletin of the American
Mathematical Society, 50, 6 12-54.
H. Weyl (1952). Symmerry, Princeton: Princeton University Press.
G. C . Wick (1% 8 ) . "Range of nuclcar forces in Yukawa's theory", Nature, 142, 993-94.
G. C. Wick, A. S. Wigkman, and E. P. Wigner (1952). "The intrinsk pasity of elementary
particles", Physical Review, 88, 10 1-5.
A. S . Wightman (1989). "The general theory of quantized fields in the 1950sN, in L. M.
Brown, M. Dresden, and L. Hoddeson, eds. (1989). Pions to Quurh, 608-29.
E. P. Wimer (1926a). "mer nicht kombinierende Terme in der neueren Quantentheorie. 1".
Zeitschrij?fiu P h y s k 40, 492-500.
E. P. Wigner (1926b). "mer nicht kombinierende Terne in der neueren Quantentheorie. II",
Zeitschrzj? fur Physik 40, 883-92.
E. P-Wigner (1927a)- "Einige Folgemgen aus der Schrodingerschen heone iir die
Tennmukniren", Z e i t s c h i j2.r Physik, 43,62652.
546-59.
E. P. Wigner (1937). "On the consequences of the symmetry of the nuclear Hamiltonian on
the spectroscopy of nuclei", PhyszkaL Review, 51, 106-19.
E. P. Wigner (1 939). "On the unitary representations of the inhomogenous Lorentz group",
Annals of Mathematics, 40, 149-204.
E.P. Wigner (1 992). Collected Workr of Eugene Paul Wigner, Volumes 1, II, and VI, edited
by A. M. Weinberg, Berh: Springer.
K-G. Wilson (1969). "Non-Lagrangian models of current algebra", PhysicaZ Review, 179,
1499-1512.
C- N. Yang (1957). "Law of parity conservation and other symmetry laws of physics", Nobel
prke lecture, r e p ~ t e din Selected Popers (1945-1980), 236-46.
C. N. Yang (1974). "Integral formalisrn for gauge fields", Physical Review Letters, 33,
445-47.
C. N. Yang (1977). "Magnetic monopoles, fiber bunaes, and gauge fields", Annak of the
New York Academy of Sciences, 294, 86-97,
C. N. Yang (2980). "HermannWeyl's contribution to physcs", in K- Chandrasekharan, ed(1 980). Hermann Weyl, Centenary Lectures, 7-21,
C. N. Yang and R
M- Y o s b u r a (1978). ''Unifled gauge theories and the baryon number of the universe",
Physical Review Letters, 41, 281-84.
H . Yukawa (7 935). "On the interaction of elementary particles", Proceed<ngsof the PhysicoMathematiccd Sociep of Japan, 17, 48-57. Reprnted in D. M- Brink (1965). Nuclear
Forces, 2 14-24.
G.Zweig (1964). "An SU(3) mode1 for strong interaction symrnetry and its breaking", CERN
Report 84291TH412. Reprinted in DtB-Lichtenberg and S. P. Rosen, eds. (1980).
Developrnent in rhe @urk T h e o v of Hudrons, 22-1 01.